Wenhai Wang (王文海)
Department of Computer Science and Technology, Nanjing University (NJU)
Address: 163 Xianlin Avenue, Qixia District, Nanjing, China
Email: wangwenhai362 [at] {163.com, smail.nju.edu.cn}

About Me ([GitHub] [Google Scholar] [CV])

I am a Ph.D. candidate at Department of Computer Science and Technology, Nanjing University (NJU). My academic supervisor is Prof. Tong Lu. I received my bachelor degree from School of Computer Science and Engineering, Nanjing University of Science and Technology (NUST) in 2016.

I work very close with my friends Enze Xie and Xiang Li. My recent works are mainly on scene text detection/recognition, deep neural networks exploration, object detection and instance segmentation.

News

Experience

  • Oct. 2019 - Mar. 2020, research assistant at the University of Hong Kong (HKU), hosted by Dr. Ping Luo
  • Aug. 2019 - Mar. 2020, research intern at SenseTime Group Limited, hosted by Xuebo Liu and Ding Liang
  • Jun. 2018 - Dec. 2018, research intern at Momenta, hosted by Xiang Li

Selected Publications ([Full List])

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao
Technical Report, 2021
[Paper] [Code] [中文解读] [Report] [BibTex]
A pure Transformer backbone for dense prediction, such as object detection and semantic segmentation.
PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text
Wenhai Wang, Enze Xie, Xiang Li, Xuebo Liu, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen
TPAMI, 2021
[Paper] [Code] [BibTex]
We extend PSENet (CVPR'19) and PAN (ICCV'19) to a text spotting system.
AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, Zhibo Yang, Tong Lu, Chunhua Shen, Ping Luo
in ECCV, 2020
[Paper] [Dataset] [Code] [BibTex]
We introduce linguistic information to eliminate the ambiguity in text detection.
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
Wenhai Wang, Enze Xie, Xiaoge Song, Yuhang Zang, Wenjia Wang, Tong Lu, Gang Yu, Chunhua Shen
in ICCV, 2019
[Paper] [Poster] [Code] [BibTex]
We propose an efficient method for arbitrary-shaped text detection.
Shape Robust Text Detection with Progressive Scale Expansion Network
Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao
in CVPR, 2019
[Paper] [Poster] [Code] [BibTex]
We proposed a segmentation-based text detector that can precisely detect text instances with arbitrary shapes.
Mixed Link Networks
Wenhai Wang, Xiang Li, Jian Yang, Tong Lu
in IJCAI, 2018
[Paper] [Poster] [Code] [BibTex]
We proposed an parameter-efficient convolutional neural networks for image classification.
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond
Enze Xie, Wenhai Wang, Mingyu Ding, Ruimao Zhang, Ping Luo
TPAMI, 2021
[Paper] [Code] [BibTex]
We extend PolarMask(CVPR'20) to several instance-level detection tasks.
PolarMask: Single Shot Instance Segmentation with Polar Representation
Enze Xie, Peize Sun, Xiaoge Song, Wenhai Wang, Chunhua Shen, Ping Luo
in CVPR, 2020 (oral presentation)
[Paper] [Code] [中文解读] [Talk] [CVPR20' Top-10 Influential Papers] [BibTex]
We introduced a Polar Representation to reformulate the instance segmentation problem.
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection
Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang
in NeurIPS, 2020
[Paper] [Code] [BibTex]
We propose the generalized focal loss for learning the improved representations of dense object detector.
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection
Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang
in CVPR, 2021
[Paper] [Code] [BibTex]
The improved version of GFocal!
Selective Kernel Networks
Xiang Li, Wenhai Wang, Xiaolin Hu, Jian Yang
in CVPR, 2019
[Paper] [Code] [BibTex]
We proposed a dynamic selection mechanism in convolutional neural networks.

Contest

  • National Artificial Intelligence Challenge (NAIC) 2020, Remote Sensing Semantic Segmentation Task, 1st Place (1,000,000 RMB Bonus).
  • ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text, Task1, 1st Place.
  • ICDAR2019 Robust Reading Challenge on Large-scale Street View Text with Partial Labeling, Task1, 2nd Place.
  • AI Challenger 2018 Autonomous Driving Perception Task, 2nd Place (40,000 RMB Bonus)
  • ACM-ICPC Asia Regional Contest, Silver Medal

Review Services

Journal Reviewer
IEEE Transactions on Multimedia (T-MM)
(Senior) Program Committee Member/Conference Reviewer
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020, 2021
Neural Information Processing Systems (NeurIPS), 2020, 2021
International Conference on Machine Learning (ICML), 2021
IEEE International Conference on Computer Vision (ICCV), 2021
International Joint Conference on Artificial Intelligence (IJCAI), 2021
IEEE Winter Conference on Applications of Computer Vision (WACV), 2021
Asian Conference on Computer Vision (ACCV), 2020