39. VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
[Paper]
[Code]
[BibTex]
Wenhai Wang*^, Zhe Chen*, Xiaokang Chen*, Jiannan Wu*, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai#
Neural Information Processing Systems (NeurIPS), 2023.
|
38. EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought
[Paper]
[Code]
[BibTex]
Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo#
Neural Information Processing Systems (NeurIPS), 2023. (Spotlight Paper (3.1%))
|
37. Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
[Paper]
[Code]
[BibTex]
Linyan Huang, Zhiqi Li, Chonghao Sima, Wenhai Wang, Jingdong Wang, Yu Qiao, Hongyang Li#
Neural Information Processing Systems (NeurIPS), 2023.
|
36. Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
[Paper]
[Code]
[BibTex]
Hongyang Li#, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.
|
35. AvSegformer: Audio-Visual Segmentation with Transformer
[Paper]
[Code]
[BibTex]
Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu#
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2023.
|
34. Feature Selection Based on Intrusive Outliers Rather Than All Instances
[Paper]
[BibTex]
Lixin Yuan, Cheng Mei, Wenhai Wang, Tong Lu#
IEEE Transactions on Image Processing (TIP), 2023.
|
33. FB-BEV: BEV Representation from Forward-Backward View Transformations
[Paper]
[Code]
[BibTex]
Zhiqi Li, Zhiding Yu, Wenhai Wang, Anima Anandkumar, Tong Lu#, Jose M. Alvarez#
IEEE/CVF International Conference on Computer Vision (ICCV), 2023.
|
32. InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
[Paper]
[Code]
[BibTex]
Wenhai Wang*^, Jifeng Dai*, Zhe Chen*†, Zhenhang Huang*, Zhiqi Li*†, Xizhou Zhu*, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao#
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (Highlight Paper (2.5%))
|
31. Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
[Paper]
[Code]
[BibTex]
Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (Highlight Paper (2.5%))
|
30. Goal-oriented Autonomous Driving
[Paper]
[Code]
[BibTex]
Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023. (Best Paper Award)
|
29. Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers
[Paper]
[Code]
[BibTex]
Bo Dong, Wenhai Wang, Deng-Ping Fan#, Jinpeng Li, Huazhu Fu, Ling Shao
CAAI Artificial Intelligence Research (CAAI AIR), 2023
|
28. Vision Transformer Adapter for Dense Predictions
[Paper]
[Code]
[BibTex]
Zhe Chen*†, Yuchen Duan*†, Wenhai Wang#^, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao
International Conference on Learning Representations (ICLR), 2023. (Spotlight Paper (8.0%))
|