台湾av影片

期刊论文

1.H. Li, D. Long, L. Yuan, Y. Wang, Y. Tian, X. Wang, F. Mo. Decoupled peak property learning for efficient and interpretable electronic circular dichroism spectrum prediction. Nature Computational Science, 5(3): 234-244 (2025).

2.B. Chen, J. Zhang. Practical Compact Deep Compressed Sensing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(3): 1610-1626 (2025).

3.B. Chen, Z. Zhang, W. Li, C. Zhao, J. Yu, S. Zhao, J. Chen, J. Zhang. Invertible Diffusion Models for Compressed Sensing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(5): 3992-4006 (2025).

4.W. Gao, L. Xie, S. Fan, G. Li, S. Liu, W. Gao. Deep Learning-based Point Cloud Compression: An In-depth Survey and Benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence, (2025).

5.M. Liu, J. Liu, Y. Jiang, B. He. Heatmap Pooling Network for Action Recognition from RGB Videos. IEEE Transactions on Pattern Analysis and Machine Intelligence, (2025).

6.S. Yuan, J. Huang, Y. Shi, Y. Xu, R. Zhu, B. Lin, X. Cheng, L. Yuan, J. Luo. Magictime: Time-lapse video generation models as metamorphic simulators. IEEE Transactions on Pattern Analysis and Machine Intelligence, (2025).

7.W. Yu, J. Xing, L. Yuan, W. Hu, X. Li, Z. Huang, X. Gao, T. Wong, Y. Shan, Y. Tian. Viewcrafter: Taming video diffusion models for high-fidelity novel view synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, (2025).

8.M. Geng, L. Wang, L. Zhu, W. Zhang, R. Xiong, Y. Tian. Event-Enhanced Snapshot Mosaic Hyperspectral Frame Deblurring. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(1): 206-223 (2025).

9.H. Zhou, Y. Chang, Z. Shi, W. Yan, G. Chen, Y. Tian, L. Yan. Adverse Weather Optical Flow: Cumulative Homogeneous-Heterogeneous Adaptation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(1): 529-548 (2025).

10.L. Zhu, X. Chen, L. Wang, X. Wang, Y. Tian, H. Huang. Continuous-Time Object Segmentation Using High Temporal Resolution Event Camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(2): 807-824 (2025).

11.Y. Zhao, J. Li, Z. Song, Y. Tian. Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(2): 1089-1102 (2025).

12.H. Liu, J. Xu, S. Peng, Y. Chang, H. Zhou, Y. Duan, L. Zhu, Y. Tian, L. Yan. NER-Net+: Seeing Motion at Nighttime With an Event Camera. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(6): 4768-4786 (2025).

13.Y. Zhang, M. Lin, M. Xu, Y. Tian, R. Ji. Spatial Re-Parameterization for N:M Sparsity. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(9): 7704-7714 (2025).

14.P. Jin, H. Li, L. Yuan, S. Yan, J. Chen. Hierarchical Banzhaf Interaction for General Video-Language Representation Learning. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(3): 2125-2139 (2025).

15.Y. Wu, S. Zhang, Y. Liu, L. Zhang, X. Zhan, D. Zhou, J. Feng, M. Cheng, L. Zhen. Low-Resolution Self-Attention for Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(9): 8180-8192 (2025).

16.F. Fan, Y. Zhao, Y. Chen, N. Li, W. Jia, R. Wang. Local Texture Pattern Estimation for Image Detail Super-Resolution. IEEE Transactions on Pattern Analysis and Machine Intelligence, 47(6): 4517-4534 (2025).

17.Z. Nie, X. Liu, J. Chen, Z. Wang, Y. Liu, H. Si, T. Dong, F. Xu, G. Song, Y. Wang, P. Zhou, W. Gao, Y. Tian. A unified evolution-driven deep learning framework for virus variation driver prediction. Nature Machine Intelligence, 7(1): 131-144 (2025).

18.D. Chen, P. Peng, T. Huang, Y. Tian. Fully Spiking Actor Network With Intralayer Connections for Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems, 36(2): 2881-2893 (2025).

19.H. Qin, D. Zhou, T. Xu, Z. Bian, J. Li. Factorization Vision Transformer: Modeling Long-Range Dependency With Local Window Cost. IEEE Transactions on Neural Networks and Learning Systems, 36(2): 3151-3164 (2025).

20.Y. Wang, Y. Zhang, R. Xiong, J. Zhang, X. Zhang, T. Huang. Super-Resolving Dynamic Scenes With Spike Camera via Multi-Frame Sequential Alignment With Motion Propagation. IEEE Transactions on Image Processing, 34: 6537-6549 (2025).

21.J. Liu, H. Liu, X. Li, J. Ren, X. Xu. MiLNet: Multiplex Interactive Learning Network for RGB-T Semantic Segmentation. IEEE Transactions on Image Processing, 34: 1686-1699 (2025).

22.W. Zhao, W. Gao, D. Li, J. Wang, G. Liu. LOD-PCAC: Level-of-Detail-Based Deep Lossless Point Cloud Attribute Compression. IEEE Transactions on Image Processing, (2025).

23.J. Cai, M. Liu, H. Liu, W. Li, S. Zhou. NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation. IEEE Transactions on Image Processing, (2025).

24.B. Chen, X. Zhang, S. Liu, Y. Zhang, J. Zhang. Self-supervised Scalable Deep Compressed Sensing. International Journal of Computer Vision, 133(2): 688-723 (2025).

25.S. Yang, X. Zhang, Y. Wang, J. Yu, Y. Wang, J. Zhang. DiffLLE: Diffusion-based Domain Calibration for Weak Supervised Low-light Image Enhancement. International Journal of Computer Vision, 133(5): 2527-2546 (2025).

26.M. Geng, L. Wang, L. Zhu, W. Zhang, R. Xiong, Y. Tian. Towards Ultra High-Speed Hyperspectral Imaging by Integrating Compressive and Neuromorphic Sampling. International Journal of Computer Vision, 133(4): 1587-1610 (2025).

27.X. Zheng, Y. Ma, T. Xi, G. Zhang, E. Ding, Y. Li, J. Chen, Y. Tian, R. Ji. An Information Theory-Inspired Strategy for Automated Network Pruning. International Journal of Computer Vision, 133(8): 5455-5482 (2025).

28.H. Li, D. Long, L. Yuan, Y. Wang, Y. Tian, X. Wang, F. Mo. High-Rate Monocular Depth Estimation via Cross Frame-Rate Collaboration of Frames and Events. International Journal of Computer Vision, 133(10): 7332-7351 (2025).

29.P. Qiao, Y. Wang, C. Liu, L. Shang, B. Sun, Z. Wang, X. Zheng, R. Ji, J. Chen. Adaptive Fuzzy Positive Learning for Annotation-Scarce Semantic Segmentation. International Journal of Computer Vision, 133(3): 1048-1066 (2025).

30.Y. Zhou, D. Zhou, Y. Wang, J. Feng, Q. Hou. MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask. International Journal of Computer Vision, 133(5): 2805-2824 (2025).

31.Y. Zhu, H. Liu, G. Hua, H. Tang, Y. Li, W. Huang. Dual Attention Guidance Network for Self-Supervised Monocular Depth Estimation. IEEE Transactions on Circuits and Systems for Video Technology, (2025).

32.L. Zhu, W. Yan, Y. Chang, Y. Tian, H. Huang. Simultaneous Learning Intensity and Optical Flow From High-Speed Spike Stream. IEEE Transactions on Circuits and Systems for Video Technology, 35(6): 5126-5139 (2025).

33.Y. Bao, W. Tan, C. Jia, M. Li, Y. Liang, Y. Tian. ShiftLIC: Lightweight Learned Image Compression With Spatial-Channel Shift Operations. IEEE Transactions on Circuits and Systems for Video Technology, 35(9): 9428-9442 (2025).

34.Y. Wu, Q. Gao, R. Zhang, H. Li, J. Zhang. Language-Assisted 3D Scene Understanding. IEEE Transactions on Multimedia, 27: 3869-3879 (2025).

35.T. Wang, M. Liu, H. Liu, B. Ren, Y. You, W. Li, N. Sebe, X. Li. Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation. IEEE Transactions on Multimedia, (2025).

36.Y. Wen, M. Liu, Z. Tang, J. Yuan, S. Li, B. Ding. STAR: Skeletal Token Alignment and Rearrangement for Interaction Recognition. IEEE Transactions on Multimedia, (2025).

37.B. Lin, Z. Tang, Y. Ye, J. Cui, B. Zhu, P. Jin, J. Huang, L. Yuan. Moe-llava: Mixture of experts for large vision-language models. IEEE Transactions on Multimedia, (2025).

38.C. Li, T. Li, F. Meng, Q. Mao, Y. Bao, Y. Tian, Y. Liang. One is All: A Unified Rate-Distortion-Complexity Framework for Learned Image Compression Under Energy Concentration Criteria. IEEE Transactions on Multimedia, 27: 3992-4007 (2025).

39.Y. Zhu, X. Wang, C. Li, B. Jiang, L. Zhu, Z. Huang, Y. Tian, J. Tang. CRSOT: Cross-Resolution Object Tracking Using Unaligned Frame and Event Cameras. IEEE Transactions on Multimedia, 27: 6529-6542 (2025).

40.L. Chen, D. Li, X. Wang, P. Shao, W. Zhang, Y. Wang, Y. Tian, J. Tang. Retain, Blend, and Exchange: A Quality-Aware Spatial-Stereo Fusion Approach for Event Stream Recognition. IEEE Transactions on Multimedia, 27: 8926-8939 (2025).

41.Q. Ma, Z. Zhang, P. Qiao, Y. Wang, R. Ji, C. Liu, J. Chen. Dual-Level Masked Semantic Inference for Semi-Supervised Semantic Segmentation. IEEE Transactions on Multimedia, 27: 4029-4042 (2025).

42.Y. Li, M. Cheng, X. Zheng, R. Ji, J. Chen. Oriented-Derivative Representation for Boundary-Aware Polyp Segmentation. IEEE Transactions on Multimedia, 27: 7608-7618 (2025).

43.Y. Chen, Z. Sun, G. Wang, Q. Liang, X. Yu, D. Hao. From Cryptic to Clear - Training on LLM Explanations to Detect Smart Contract Vulnerabilities. ACM Transactions on Software Engineering and Methodology, (2025).

会议论文

1.Y. Li, X. Wang, Z. Zhang, Z. Wang, Z. Yuan, L. Xie, Y. Shan, Y. Zou. Image Conductor: Precision Control for Interactive Video Synthesis. AAAI, (2025).

2.W. Zhang, M. Liu, H. Liu, W. Li. SVTformer: Spatial-View-Temporal Transformer for Multi-View 3D Human Pose Estimation. AAAI, (2025).

3.X. Xu, H. Liu, J. Wu, J. Liu. PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes. AAAI, (2025).

4.L. Tang, J. Yang, R. Peng, Y. Zhai, S. Shen, R. Wang. Compressing Streamable Free-Viewpoint Videos to 0.1 MB per Frame. AAAI, (2025).

5.C. Yang, G. Luo, Y. Zhu, J. Li, X. Liu. Robust Image hashing based on Contrastive Masked Autoencoder with Weak-Strong Augmentation Alignment. AAAI, (2025).

6.C. Zhang, W. Gao. AdaDPCC: Adaptive Rate Control and Rate-Distortion-Complexity Optimization for Dynamic Point Cloud Compression. AAAI, (2025).

7.K. Wang, W. Gao. UniPCGC: Towards Practical Point Cloud Geometry Compression via An Efficient Unified Approach. AAAI, (2025).

8.S. Sun, X. Liang, S. Fan, W. Gao, W. Gao. VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment. AAAI, (2025).

9.Z. Pan, N. Zhang, W. Gao, S. Liu, G. Li. Point Cloud Semantic Segmentation With Sparse and Inhomogeneous Annotations. AAAI, (2025).

10.M. Jia, L. Zhao, G. Li, Y. Zheng. Orchestrating the Symphony of Prompt Distribution Learning for Human-Object Interaction Detection. AAAI, (2025).

11.M. Jia, L. Zhao, G. Li, Y. Zheng. ContextHOI: spatial context learning for human-object interaction detection. AAAI, (2025).

12.H. Tang, M. Cao, J. Huang, R. Liu, P. Jin, G. Li, X. Liang. MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval. AAAI, (2025).

13.Z. Tang, J. Zhang, X. Cheng, W. Yu, C. Feng, Y. Pang, B. Lin, L. Yuan. Cycle3d: High-quality and consistent image-to-3d generation via generation-reconstruction cycle. AAAI, (2025).

14.C. Feng, W. Yu, X. Cheng, Z. Tang, J. Zhang, L. Yuan, Y. Tian. AE-NeRF: Augmenting Event-Based Neural Radiance Fields for Non-ideal Conditions and Larger Scenes. AAAI, (2025).

15.H. Xu, P. Peng, X. Zhang, G. Tan, Y. Li, S. Wang, L. Li. Exploiting Continuous Motion Clues for Vision-Based Occupancy Prediction. AAAI, (2025).

16.Z. Sun, S. Qi, X. Huang, X. Xiao, J. Zhang, X. Wang, P. Peng. Towards Building Human-like Smart Agents in Modern 3D Video Games (Student Abstract). AAAI, (2025).

17.Z. Liu, P. Peng, Y. Tian. Visual Reinforcement Learning with Residual Action. AAAI, (2025).

18.Z. Cheng, K. Li, H. Li, P. Jin, X. Zheng, C. Liu, J. Chen. Aligning Instance Brownian Bridge with Texts for Open-Vocabulary Video Instance Segmentation. AAAI, (2025).

19.S. Li, P. Wei, P. Qiao, C. Liu, J. Chen. DigitalLLaVA: Incorporating Digital Cognition Capability for Physical World Comprehension in Multimodal LLMs. AAAI, (2025).

20.L. Liang, D. Yang, X. Zhuang, Y. Xie, L. Chen, Y. Jin, Y. Yin, Y. Xie, W. Yang, D. Yang, J. Ru, X. Zhuang, L. Liang, Y. Zou. ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors. ACL Main, (2025).

21.H. Wang, H. Liu, J. Ren, M. Tan, Z. Jiang. CLIP-6D: Empowering CLIP as a Zero-Shot 6D Pose Estimator Through Generalizable Object-Specific Representations. ACM MM, (2025).

22.Y. Hu, J. Ma, Y. Yang, J. Liang, J. Yan, J. Wu, J. Yang, Y. Deng, R. Wang. Excavating the Most Critical Gaussians: Sparse Selection and Structural Optimization for Efficient 3DGS Compression. ACM MM, (2025).

23.W. Gao, L. Xie, K. Wang, J. Su, C. Peng, W. Gao. DPCSet: A Large-scale Dynamic Point Cloud Dataset for Compression and Perception. ACM MM, (2025).

24.H. Li, B. Qu, W. Gao. T23D-QA: An Open Dataset and Benchmark for Text-driven 3D Generation Quality Assessment. ACM MM, (2025).

25.H. Zheng, W. Gao. OpenMVC: An Open-Source Library for Learning-based Multi-view Compression. ACM MM, (2025).

26.H. Zheng, L. Zhou, W. Gao. SCID-Compress900: A Multi-Scene Dataset of 4K and 1080P Screen Content Images for Image Compression Research. ACM MM, (2025).

27.Z. Sun, Q. Xu, Q. Zhang, S. Liu, G. Li. Overfitted Point Cloud Attribute Code Using Sparse Hierarchical Implicit Neural Representations. ACM MM, (2025).

28.H. Zhou, W. Yu, J. Guan, X. Cheng, Y. Tian, L. Yuan. Holotime: Taming video diffusion models for panoramic 4d scene generation. ACM MM, (2025).

29.C. Feng, Z. Tang, W. Yu, Y. Pang, Y. Zhao, J. Zhao, L. Yuan, Y. Tian. E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras. ACM MM, (2025).

30.J. Zhai, Z. Mai, D. Zheng, C. Wang, X. Zheng, H. Li, F. Yang, Y. Tian. Learning Transition Patterns by Large Language Models for Sequential Recommendation. COLING, (2025).

31.S. Yang, K. Ning, Y. Liu, J. Yao, Y. Tian, Y. Song, L. Yuan. Is Parameter Collision Hindering Continual Learning in LLMs? COLING, (2025).

32.D. Zheng, H. Zhang, J. Zhai, L. Zhong, L. Wang, J. Feng, X. Liao, Y. Tian, N. Xiao, Q. Liao. FedCSR: A Federated Framework for Multi-Platform Cross-Domain Sequential Recommendation with Dual Contrastive Learning. COLING, (2025).

33.Y. Li, H. Chen, H. Zhang, Z. Ge, T. Li, S. Xu, G. Luo. Unraveling the Mystery: Defending Against Jailbreak Attacks Via Unearthing Real Intention. COLING, (2025).

34.X. Zhang, Z. Tang, Z. Xu, R. Li, Y. Xu, B. Chen, F. Gao, J. Zhang. OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking. CVPR, (2025).

35.G. Li, B. Chen, C. Zhao, L. Zhang, J. Zhang. OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction. CVPR, (2025).

36.H. Li, Y. Wu, J. Meng, Q. Gao, Z. Zhang, R. Wang, J. Zhang. InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception. CVPR, (2025).

37.Y. Wang, Y. Zhang, R. Xiong, J. Zhao, J. Zhang, X. Fan, T. Huang. Spk2SRImgNet: Super-Resolve Dynamic Scene from Spike Stream via Motion Aligned Collaborative Filtering. CVPR, (2025).

38.Y. Wang, Q. Zhao, R. Yu, H. Tsui, A. Zeng, J. Lin, Z. Luo, J. Yu, X. Li, Q. Chen, J. Zhang, L. Zhang, P. Tan. SkillMimic: Learning Basketball Interaction Skills from Demonstrations. CVPR, (2025).

39.B. Chen, G. Li, R. Wu, X. Zhang, J. Chen, J. Zhang, L. Zhang. Adversarial Diffusion Compression for Real-World Image Super-Resolution. CVPR, (2025).

40.X. Zhuang, Z. Zhu, Y. Xie, L. Liang, Y. Zou. VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification. CVPR, (2025).

41.J. Yan, R. Peng, Z. Wang, L. Tang, J. Yang, J. Liang, J. Wu, R. Wang. Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting. CVPR, (2025).

42.Z. Yan, Y. Zhao, S. Chen, M. Guo, X. Fu, T. Yao, S. Ding, Y. Wu, L. Yuan. Generalizing deepfake video detection with plug-and-play: Video-level blending and spatiotemporal adapter tuning. CVPR, (2025).

43.W. Yu, C. Feng, J. Li, J. Tang, J. Yang, Z. Tang, M. Cao, L. Yuan, Y. Tian. Evagaussians: Event stream assisted gaussian splatting from blurry images. CVPR, (2025).

44.S. Yuan, J. Huang, X. He, Y. Ge, Y. Shi, L. Chen, J. Luo, L. Yuan. Identity-preserving text-to-video generation by frequency decomposition. CVPR, (2025).

45.Z. Li, B. Lin, Y. Ye, L. Chen, X. Cheng, S. Yuan, L. Yuan. Wf-vae: Enhancing video vae by wavelet-driven energy flow for latent video diffusion model. CVPR, (2025).

46.Y. Pang, B. Zhu, B. Lin, M. Zheng, F. Tay, S. Lim, H. Yang, L. Yuan. Dreamdance: Animating human images by enriching 3d geometry cues from 2d poses. CVPR, (2025).

47.Z. Huang, W. Yu, X. Cheng, C. Zhao, Y. Ge, M. Guo, L. Yuan, Y. Tian. Roompainter: View-integrated diffusion for consistent indoor scene texturing. CVPR, (2025).

48.Q. Zhang, M. Ning, Z. Liu, Y. Huang, S. Yang, Y. Wang, J. Ye, X. Chen, Y. Song, L. Yuan. UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation. CVPR, (2025).

49.Z. Huang, W. Yu, X. Cheng, C. Zhao, Y. Ge, M. Guo, L. Yuan, Y. Tian. RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing. CVPR, (2025).

50.D. Li, J. Li, X. Liu, X. Fan, Y. Tian. Asynchronous Collaborative Graph Representation for Frames and Events. CVPR, (2025).

51.Y. Dong, R. Xiong, X. Fan, Z. Yu, Y. Tian, T. Huang. Self-Supervised Learning for Color Spike Camera Reconstruction. CVPR, (2025).

52.K. Wang, Q. Ma, W. Wan, H. Li, K. Wang, Y. Tian. Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body. CVPR, (2025).

53.X. Wang, Y. Jin, W. Wu, W. Zhang, L. Zhu, B. Jiang, Y. Tian. Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset. CVPR, (2025).

54.H. Xu, P. Peng, G. Tan, Y. Chang, L. Li, Y. Tian. VLMs-Guided Representation Distillation for Efficient Vision-Based Reinforcement Learning. CVPR, (2025).

55.J. Wu, R. Peng, J. Jiao, J. Yang, L. Tang, K. Xiong, J. Liang, J. Yan, R. Liu, R. Wang. LocalDyGS: Multi-view Global Dynamic Scene Modeling via Adaptive Local Implicit Feature Decoupling. ICCV, (2025).

56.W. Liu, W. Gao. Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization. ICCV, (2025).

57.R. Liu, S. Sun, H. Tang, W. Gao, G. Li. Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow. ICCV, (2025).

58.X. Liang, Y. Fan, Q. Yang, X. Wang, W. Gao, G. Li. DGTalker: Disentangled Generative Latent Space Learning for Audio-Driven Gaussian Talking Heads. ICCV, (2025).

59.R. Liu, S. Sun, H. Tang, W. Gao, G. Li. Flow4Agent: Long-form Video Understanding via Motion Prior from Optical Flow. ICCV, (2025).

60.Z. Wang, P. Li, H. Liu, Z. Deng, C. Wang, J. Liu, J. Yuan, M. Liu. Recognizing Actions from Robotic View for Natural Human-Robot Interaction. ICCV, (2025).

61.P. Li, Z. Wang, Y. Yuan, H. Liu, X. Meng, J. Yuan, M. Liu. UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling. ICCV, (2025).

62.G. Xu, P. Jin, Z. Wu, H. Li, Y. Song, L. Sun, L. Yuan. Llava-cot: Let vision language models reason step-by-step. ICCV, (2025).

63.Z. Xu, X. Zhang, R. Li, Z. Tang, Q. Huang, J. Zhang. FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models. ICLR, (2025).

64.X. Zhang, J. Meng, Z. Xu, S. Yang, Y. Wu, R. Wang, J. Zhang. SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography. ICLR, (2025).

65.X. Zhuang, Z. Zhu, Z. Wang, X. Cheng, Y. Zou. UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation. ICLR, (2025).

66.J. Wu, R. Peng, Z. Wang, L. Xiao, L. Tang, J. Yan, K. Xiong, R. Wang. Swift4D:Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstruction of dynamic scene. ICLR, (2025).

67.P. Jin, B. Zhu, L. Yuan, S. Yan. Moe++: Accelerating mixture-of-experts methods with zero-computation experts. ICLR, (2025).

68.K. Ning, S. Yang, Y. Liu, J. Yao, Z. Liu, Y. Tian, Y. Song, L. Yuan. PiCO: Peer Review in LLMs based on Consistency Optimization. ICLR, (2025).

69.J. Zhai, Z. Mai, C. Wang, F. Yang, X. Zheng, H. Li, Y. Tian. Multimodal Quantitative Language for Generative Recommendation. ICLR, (2025).

70.P. Jin, B. Zhu, L. Yuan, S. Yan. Moh: Multi-head attention as mixture-of-head attention. ICML, (2025).

71.Z. Yan, J. Wang, P. Jin, K. Zhang, C. Liu, S. Chen, T. Yao, S. Ding, B. Wu, L. Yuan. Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection. ICML, (2025).

72.E. Xie, J. Chen, Y. Zhao, J. Yu, L. Zhu, Y. Lin, Z. Zhang, M. Li, J. Chen, H. Cai, B. Liu, D. Zhou, S. Han. SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer. ICML, (2025).

73.D. Xiao, G. Chen, P. Peng, Y. Huang, Y. Zhao, Y. Dai, Y. Tian. When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network. ICML, (2025).

74.J. Xu, H. Liu, J. Wu, X. Xu. PRIDEV: A Plug-and-Play Refinement for Improved Depth Estimation in Videos. ICRA, (2025).

75.J. Ren, H. Liu, J. Liu, P. Jiang. Unified End-to-end Network for Category-level and Instance level Object Pose Estimation from RGB Images. ICRA, (2025).

76.S. Fan, W. Gao, Z. Chen, G. Li, G. Liu, Q. Wang. Stochasticity-aware No-Reference Point Cloud Quality Assessment. IJCAI, (2025).

77.Z. Shu, Y. Deng, H. Zhang, Z. Nie, J. Chen. MTPNet: Multi-Grained Target Perception for Unified Activity Cliff Prediction. IJCAI, (2025).

78.Y. Zhu, H. Liu, J. Wu, M. Liu. TCNet: A Temporally Consistent Network for Self supervised Monocular Depth Estimation. IROS, (2025).

79.B. Yin, J. Lin, J. Wen, Y. Li, J. Liu, Y. Wang, M. Liu. Recognizing Skeleton-Based Actions As Points. IROS, (2025).

80.J. Dong, J. Sun, W. Zhang, J. Dong, D. Hao. ConTested: Consistency-Aided Tested Code Generation with LLM. ISSTA, (2025).

81.J. Xie, Z. Zhang, Z. Weng, Y. Zhu, G. Luo. MedDiff-FT: Data-Efficient Diffusion Model Fine-Tuning with Structural Guidance for Controllable Medical Image Synthesis. MICCAI, (2025).

82.L. Xie, Y. Li, Y. Tang, W. Gao. Efficient Geometry Compression and Communication for 3D Gaussian Splatting Point Clouds. MobiCom, (2025).

83.J. Zhang, Y. Du, Q. Wang, W. Li, Y. Gu, J. Zhang. AlignedGen: Aligning Style Across Generated Images. NeurIPS, (2025).

84.J. Fu, Q. Gao, C. Wen, Y. Wu, S. Ma, J. Zhang, J. Zhang. ReCon-GS: Continuum-Preserved Gaussian Streaming for Fast and Compact Reconstruction of Dynamic Scenes. NeurIPS, (2025).

85.W. Li, X. Zhang, S. Zhao, Y. Zhang, J. Li, L. Zhang, J. Zhang. Q-Insight: Understanding Image Quality via Visual Reinforcement Learning. NeurIPS, (2025).

86.Z. Liu, M. Ning, Q. Zhang, S. Yang, Z. Wang, Y. Yang, X. Xu, L. Yuan. CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step. NeurIPS, (2025).

87.Y. Ye, X. He, Z. Li, B. Lin, S. Yuan, Z. Yan, B. Hou, L. Yuan. Imgedit: A unified image editing dataset and benchmark. NeurIPS, (2025).

88.Y. Li, C. Feng, Z. Tang, K. Deng, W. Yu, Y. Tian, L. Yuan. GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation. NeurIPS, (2025).

89.S. Yuan, X. He, Y. Deng, Y. Ye, J. Huang, B. Lin, J. Luo, L. Yuan. Opens2v-nexus: A detailed benchmark and million-scale dataset for subject-to-video generation. NeurIPS, (2025).

90.H. Li, H. Cao, B. Feng, Y. Shao, X. Tang, Z. Yan, L. Yuan, Y. Tian, Y. Li. Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations. NeurIPS, (2025).

91.L. Li, G. Zhao, L. Zhu, Z. Cai, L. Yu, J. Zhang, Z. Wang. AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization. SIGGRAPH, (2025).

92.C. Mou, Y. Wu, W. Wu, Z. Guo, P. Zhang, Y. Cheng, Y. Luo, F. Ding, S. Zhang, X. Li, M. Li, M. Liu, Y. Zhang, S. Wu, S. Zhao, J. Zhang, Q. He, X. Wu. DreamO: A Unified Framework for Image Customization. SIGGRAPH Asia, (2025).

93.Y. Li, L. Li, Z. Zhang, X. Li, G. Wang, H. Li, X. Cun, Y. Shan, Y. Zou. BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing. SIGGRAPH Asia, (2025).

—— 分享 ——

上一篇:2025年度发表论文(集成电路科学与工程专业)

下一篇:2025年度发表论文(通信与信息系统专业)