Wenhai Wang (王文海)
Affiliation: Fundamental Vision Department, Shanghai AI Laboratory
Address: 701 Yunjin Road, Xuhui District, Shanghai, China
Email: wangwenhai362[at]{163.com, gmail.com} wangwenhai[at]pjlab.org.cn

About Me ([GitHub] [Google Scholar] [CV])

I am a Research Scientist at Shanghai AI Laboratory, collaborated with Prof. Jifeng Dai and Prof. Yu Qiao

Previously, I obtained the Ph.D. degree from Department of Computer Science and Technology, Nanjing University (NJU) in 2021. My academic supervisor is Prof. Tong Lu. I received my B.E degree from Nanjing University of Science and Technology (NUST) in 2016.
I work very close with my friends Dr. Enze Xie and Prof. Xiang Li. I was fortunate to work with Prof. Ping Luo and Prof. Chunhua Shen.

My recent works are mainly on:
The fundamental vision department at Shanghai AI Laboratory is now hiring. If you are interested in internship/researcher positions related to computer vision, please feel free to contact me through the email.

News

Experience

Recent Works ([Full List])

(* Equal contribution, † Interns, # Corresponding authors)
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang*, Jifeng Dai*, Zhe Chen*†, Zhenhang Huang* Zhiqi Li*†, Xizhou Zhu*, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao#
CVPR, 2023 (highlight paper (2.5%))
[Paper] [Code] [BibTex]
A strong large-scale CNN-based fondamention model.
Vision Transformer Adapter for Dense Predictions
Zhe Chen*†, Yuchen Duan*†, Wenhai Wang#, Junjun He, Tong Lu#, Jifeng Dai, Yu Qiao
ICLR, 2023 (spotlight paper (8.0%))
[Paper] [Code] [BibTex]
We design a ViT adapter for dense prediction tasks.

Selected Works ([Full List])

BEVFormer: Learning Bird’s-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li*†, Wenhai Wang*, Hongyang Li*, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai#
ECCV, 2022
[Paper] [Code] [BibTex]
[ECCV 2022' Top-10 Influential Papers]
[100 Most Cited AI Papers in 2022]
A versatile camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
PVT v2: Improved Baselines with Pyramid Vision Transformer
Wenhai Wang#, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao
CVMJ, 2021 (ESI highly cited paper (1%), ESI hot papers (0.1%))
[Paper] [Code] [中文解读] [Report] [Talk] [BibTex]
A better PVT.
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan#, Kaitao Song, Ding Liang, Tong Lu#, Ping Luo, Ling Shao
ICCV, 2021 (oral presentation (3.4%))
[Paper] [Code] [中译版] [中文解读] [Report] [Talk] [BibTex]
[ICCV21' Top-10 Influential Papers]
A pure Transformer backbone for dense prediction, such as object detection and semantic segmentation.
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond
Enze Xie*, Wenhai Wang*, Mingyu Ding, Ruimao Zhang, Ping Luo#
TPAMI, 2021
[Paper] [Code] [BibTex]
[CVPR20' Top-10 Influential Papers]
We extend PolarMask (CVPR'20 oral presentation (5.7%)) to several instance-level detection tasks.
PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text
Wenhai Wang*, Enze Xie*, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu#, Chunhua Shen
TPAMI, 2021
[Paper] [Code1] [Code2] [BibTex]
We extend PSENet (CVPR'19) and PAN (ICCV'19) to a text spotting system.

Honors and Awards

Review Services

Senior Program Committee Member
International Joint Conference on Artificial Intelligence (IJCAI), 2021

Journal Reviewer
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
International Journal of Computer Vision (IJCV)
IEEE Transactions on Image Processing (TIP)
IEEE Transactions on Multimedia (TMM)
Computational Visual Media Journal (CVMJ)
Pattern Recognition (PR)

Program Committee Member/Conference Reviewer
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 2021, 2022, 2023
Neural Information Processing Systems (NeurIPS), 2020, 2021
International Conference on Machine Learning (ICML), 2021, 2022
International Conference on Learning Representations (ICLR), 2021
IEEE International Conference on Computer Vision (ICCV), 2021
European Conference on Computer Vision (ECCV), 2022
AAAI Conference on Artificial Intelligence (AAAI), 2022
International Joint Conference on Artificial Intelligence (IJCAI), 2022
IEEE Winter Conference on Applications of Computer Vision (WACV), 2021
Asian Conference on Computer Vision (ACCV), 2020