|
- Hantao Zhou, Runze Hu, Xiu Li. Video
Object Segmentation with Dynamic Query Modulation[C] In
Proceedings of the 2024 IEEE
International Conference on Multimedia and Expo
(ICME-24). IEEE, 2024.
[Page] [Code]
|
|
- Ronghui Li, YuXiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan
Zhang, Yebin Liu, Xiu Li. Lodge: A
Coarse to Fine Diffusion Network for Long Dance Generation Guided by
the Characteristic Dance Primitives[C]. In Proceedings of
the IEEE/CVF Conference on
Computer Vision and Pattern Recognition (CVPR-24), 2024.
[Page] [Code] [Data]
|
|
- Kai Yang, Jian Tao, Jiafei Lyu, Chunjiang Ge, Qimai Li, Jiaxin Chen,
Weihan Shen, Xiaolong Zhu, Xiu Li. Using
Human Feedback to Fine-tune Diffusion Models without Any Reward
Model[C]. In Proceedings of the IEEE/CVF Conference on
Computer Vision and Pattern Recognition (CVPR-24), 2024.
[Page] [Code] [Data]
|
|
- Yicheng Xiao, Zhuoyan Luo, Yong Liu, Yue Ma, Hengwei Bian, Yatai Ji,
Yujiu Yang, Xiu Li. Bridging the Gap: A Unified Video Comprehension
Framework for Moment Retrieval and Highlight Detection[C].
In Proceedings of the IEEE/CVF Conference on Computer Vision and
Pattern Recognition (CVPR-24), 2024.
[Page] [Code] [Data]
|
|
- Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu
Li. Exploring Multi-Modal Control in Music-Driven
Dance Generation[C]. In Proceedings of the International
Conference on Acoustics, Speech and Signal Processing
(ICASSP-24).
[Page]
|
|
- Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, et al. SEABO: A
Simple Search-Based Method for Offline Imitation Learning[C].
In Proceedings of the International Conference on Learning
Representations (ICLR-24), 2024.
[Page] [Code]
|
|
- Chunming He, Kai Li, Yachao Zhang, Yulun Zhang, Zhenhua Guo, Xiu
Li, et al. Strategic Preys Make Acute Predators: Enhancing
Camouflaged Object Detectors by Generating Camouflaged
Objects[C]. In Proceedings of the International Conference
on Learning Representations (ICLR-24), 2024.
[Page] [Code]
|
|
- Yachao Zhang, Runze Hu, Ronghui Li, Yanyun Qu, Yuan Xie, Xiu Li.
Cross-Modal Match for Language Conditioned 3D
Object Grounding[C]. In Proceedings of the Association for
the Advance of Artificial Intelligence (AAAI-24), 2024.
|
|
- Yue Ma, Yingqing He, Xiaodong Cun, Xintao Wang, Siran Chen, Ying Shan,
Xiu Li, et al. Follow Your Pose: Pose-Guided Text-to-Video
Generation using Pose-Free Videos[C]. In Proceedings of
the Association for the Advance of Artificial Intelligence
(AAAI-24), 2024.
[Page] [Code] [Demo]
|
|
- Zunnan Xu, Yachao Zhang, Sicheng Yang, Ronghui Li, Xiu Li. Chain of
Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional
Control[C]. In Proceedings of the Association for the
Advance of Artificial Intelligence (AAAI-24), 2024.
[Page] [Data]
|
|
- C Meng, H Zhang, W Guo, H Guo, H Liu, Y Zhang, H Zheng, R Tang, X
Li, et al. Hierarchical Projection Enhanced Multi-Behavior
Recommendation[C]. In Proceedings of the 29th ACM SIGKDD
Conference on Knowledge Discovery and Data Mining
(SIGKDD-23), 2023: 4649-4660.
[Page] [Code] [Data]
|
|
- Chang Meng, Chenhao Zhai, Yu Yang, Hengyu Zhang, Xiu Li. Parallel
Knowledge Enhancement based Framework for Multi-behavior
Recommendation[C]. In Proceedings of the ACM International
Conference on Information & Knowledge Management
(CIKM-23), 2023: 4649-4660.
[Page] [Code] [Data]
|
|
- C He, K Li, Y Zhang, L Tang, Y Zhang, X Li. Camouflaged object detection with feature
decomposition and edge reconstruction[C]. In Proceedings
of the IEEE/CVF Conference on Computer Vision and Pattern
Recognition (CVPR-23), 2023: 22046-22055.
[Page] [Code] [Data]
|
|
- C He, K Li, Y Zhang, L Tang, Y Zhang, Z Guo, X Li. Weakly-Supervised Concealed Object Segmentation
with SAM-based Pseudo Labeling and Multi-scale Feature
Grouping[C]. Advances in Neural Information Processing
Systems (NeurIPS-23), 2023.
[Page] [Code] [Data]
|
|
- C He, K Li, G Xu, Y Zhang, R Hu, Z Guo, X Li. Degradation-Resistant Unfolding Network for
Heterogeneous Image Fusion[C]. In Proceedings of the
IEEE/CVF International Conference on Computer Vision
(ICCV-23), 2023: 12611-12621.
[Page] [Code] [Data] [Demo]
|
|
- Rui Yang, Lin Song, Yixiao Ge, Xiu Li. BoxSnake: Polygonal Instance Segmentation with
Box Supervision[C]. In Proceedings of the IEEE/CVF
International Conference on Computer Vision (ICCV-23),
2023: 2303.11630.
[Page] [Code] [Data]
|
|
- Yicheng Xiao, Yue Ma, Shuyan Li, Hantao Zhou, Ran Liao, Xiu Li.
SemanticAC: Semantics-Assisted Framework for
Audio Classification[C]. In Proceedings of the
International Conference on Acoustics, Speech and Signal
Processing (ICASSP-23).
[Page] [Data]
|
|
- L Tang, K Li, C He, Y Zhang, X Li. Consistency Regularization for Generalizable
Source-free Domain Adaptation[C]. In Proceedings of the
IEEE/CVF International Conference on Computer Vision
(ICCV-23), 2023: 4323-4333.
[Page] [Code] [Data]
|
|
- Ronghui Li, Junfan Zhao, Yachao Zhang, Mingyang Su, Zeping Ren, Han
Zhang, Yansong Tang, Xiu Li. FineDance: A Fine-grained Choreography Dataset
for 3D Full Body Dance Generation[C]. In Proceedings of
the IEEE/CVF International Conference on Computer Vision
(ICCV-23), 2023.
[Page] [Code] [Data] [Demo]
|
|
- Y Tang, J Liu, A Liu, B Yang, W Dai, Y Rao, J Lu, J Zhou, X Li.
Flag3d: A 3d fitness activity dataset
with language instruction[C]. In Proceedings of the
IEEE/CVF Conference on Computer Vision and Pattern Recognition
(CVPR-23), 2023: 22106-22117.
[Page] [Demo]
|
|
- W Li, X Huang, Z Zhu, Y Tang, X Li, J Zhou, J Lu. Ordinalclip: Learning rank prompts for
language-guided ordinal regression[C]. Advances in Neural
Information Processing Systems (NeurIPS-22), 2022, 35:
35313-35325.
[Page] [Code] [Data]
|
|
- J Lyu, X Ma, X Li, Z Lu. Mildly conservative Q-learning for offline
reinforcement learning[C]. Advances in Neural Information
Processing Systems (NeurIPS-22), 2022, 35: 1711-1724.
[Page] [Code]
|
|
- J Lyu, X Li, Z Lu. Double Check Your State Before Trusting It:
Confidence-Aware Bidirectional Offline Model-Based
Imagination[C]. Advances in Neural Information Processing
Systems (NeurIPS-22), 2022, 35: 38218-38231.
[Page] [Code]
|
|
- R Yang, H Ma, J Wu, Y Tang, X Xiao, M Zheng, X Li. Scalablevit: Rethinking the context-oriented
generalization of vision transformer[C]. European
Conference on Computer Vision (ECCV-22), 2022: 480-496.
[Page] [Code]
|
|
- Z Chen, C Wang, H Zhao, B Yuan, X Li. D2Animator: Dual Distillation of StyleGAN For
High-Resolution Face Animation[C]. In Proceedings of the
30th ACM International Conference on Multimedia (ACM
MM-22), 2022: 1769-1778.
[Page]
|
|
- Y Ma, Y Wang, Y Wu, Z Lyu, S Chen, X Li, Y Qiao. Visual knowledge graph for human action
reasoning in videos[C]. In Proceedings of the 30th ACM
International Conference on Multimedia (ACM MM-22),
2022: 4132-4141.
[Page] [Code]
|
|
- H Zhang, E Yuan, W Guo, Z He, J Qin, H Guo, B Chen, X Li, R Tang.
Disentangling Past-Future Modeling in Sequential
Recommendation via Dual Networks[C]. In Proceedings of the
31st ACM International Conference on Information & Knowledge
Management (CIKM-22), 2022: 2549-2558.
[Page] [Code] [Data]
|
|
- R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, et al. Rethinking Goal-conditioned Supervised Learning
and Its Connection to Offline RL[C]. In Proceedings of the
International Conference on Learning Representations
(ICLR-22), 2022.
[Page]
|
|
- Lyu, J., Ma, X., Yan, J., Li, X. Efficient continuous control with double actors
and regularized critics. [C] In Proceedings of the AAAI
Conference on Artificial Intelligence (AAAI-22), 2022.
[Page] [Code]
|
|
- Li, S., Li, X., Lu, J., & Zhou, J. Self-supervised video hashing via bidirectional
transformers. [C] In Proceedings of the IEEE/CVF
Conference on Computer Vision and Pattern Recognition
(CVPR-21) 2021:13549-13558.
[Page] [Code] [Data]
|
|
- Wang Z, Zhou L, Wang L, Li X. A
Self-boosting Framework for Automated Radiographic Report
Generation. [C] In Proceedings of the IEEE/CVF Conference
on Computer Vision and Pattern Recognition (CVPR-21),
2021, 2433-2442.
[Page]
|
|
- Ma, L., Wang, T., Dong, B., Yan, J., Li, X., Zhang, X. Implicit Feature Refinement for Instance
Segmentation. [C] In Proceedings of the 29th ACM
International Conference on Multimedia (ACMMM-21) 2021,
3088-3096.
[Page]
|
|
- Xu, Z, Lu D, Wang Y, Luo J, Jayender J, Ma K, Zheng Y, Li X. Noisy labels are treasure:
mean-teacher-assisted confident learning for hepatic vessel
segmentation. [C] In Proceedings of the International
Conference on Medical Image Computing and Computer-Assisted
Intervention (MICCAI-21), 2021, 3-13.
[Page]
|
|
- J Yan, H Chen, K Wang, Y Ji, Y Zhu, J Li, D Xie, Z Xu, J Huang, S Cheng,
X Li, J Yao, Hierarchical
attention guided framework for multi-resolution collaborative whole
slide image segmentation[C]. In Proceedings of the
International Conference on Medical Image Computing and
Computer-Assisted Intervention (MICCAI-21), 2021:
153-163.
[Page]
|
|
- Yu B, Li W, Li X, et al. Frequency-aware spatiotemporal transformers for
video inpainting detection. [C] In Proceedings of the IEEE
International Conference on Computer Vision (ICCV-21).
2021: 8188-8197.
[Page]
|
|
- Xu Z, Yan J, Luo J, Li X, et al. Unsupervised multimodal image registration with
adaptative gradient guidance[C] In Proceedings of the IEEE
International Conference on Acoustics, Speech and Signal
Processing (ICASSP-21). IEEE, 2021: 1225-1229.
[Page]
|
|
- Ma L, Dong B, Yan J, et al. Matting
enhanced mask R-CNN[C] In Proceedings of the 2021 IEEE
International Conference on Multimedia and Expo
(ICME-21). IEEE, 2021: 1-6.
[Page]
|
|
- Xu Z, Luo J, Yan J, Li X,et al. Adversarial uni-and multi-modal stream networks
for multimodal image registration. [C] In Proceedings of
the International Conference on Medical Image Computing and
Computer-Assisted Intervention (MICCAI-20). 2020:
222-232.
[Page]
|