Projects per year
Abstract
Image matting requires high-quality pixel-level human annotations to support the training of a deep model in recent literature. Whereas such annotation is costly and hard to scale, significantly holding back the development of the research. In this work, we make the first attempt towards addressing this problem, by proposing a self-supervised pretraining approach that can leverage infinite numbers of data to boost the matting performance. The pre-training task is designed in a similar manner as image matting, where random trimap and alpha matte are generated to achieve an image disentanglement objective. The pre-trained model is then used as an initialisation of the downstream matting task for fine-tuning. Extensive experimental evaluations show that the proposed approach outperforms both the state-of-the-art matting methods and other alternative self-supervised initialisation approaches by a large margin. We also show the robustness of the proposed approach over different backbone architectures. Our project page is available at https://crystraldo.github.io/dpt-mat/.
Original language | English |
---|---|
Title of host publication | 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 168-177 |
Number of pages | 10 |
ISBN (Electronic) | 9798350318920 |
ISBN (Print) | 9798350318937 (PoD) |
DOIs | |
Publication status | Published - 9 Apr 2024 |
Event | 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 - Waikoloa, United States Duration: 4 Jan 2024 → 8 Jan 2024 |
Publication series
Name | IEEE Workshop on Applications of Computer Vision |
---|---|
Publisher | IEEE |
ISSN (Print) | 2472-6737 |
ISSN (Electronic) | 2642-9381 |
Conference
Conference | 2024 IEEE Winter Conference on Applications of Computer Vision, WACV 2024 |
---|---|
Country/Territory | United States |
City | Waikoloa |
Period | 4/01/24 → 8/01/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Algorithms
- Image recognition and understanding
- Low-level and physics-based vision
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Computer Vision and Pattern Recognition
Projects
- 1 Finished
-
COMPaD: Commercial-Oriented Multi-modal Poster Generation and Design
1/10/23 → 30/09/24
Project: Research Councils