Projects per year
Abstract
Benefiting from the inductive biases learned from large- scale datasets, open-vocabulary semantic segmentation (OVSS) leverages the power of vision-language models, such as CLIP, to achieve remarkable progress without re- quiring task-specific training. However, due to CLIP’s pre- training nature on image-text pairs, it tends to focus on global semantic alignment, resulting in suboptimal perfor- mance when associating fine-grained visual regions with text. This leads to noisy and inconsistent predictions, par- ticularly in local areas. We attribute this to a dispersed bias stemming from its contrastive training paradigm, which is difficult to alleviate using CLIP features alone. To address this, we propose a structure-aware feature rectification ap- proach that incorporates instance-specific priors derived directly from the image. Specifically, we construct a region adjacency graph (RAG) based on low-level features (e.g. colour and texture) to capture local structural relationships and use it to refine CLIP features by enhancing local dis- crimination. Extensive experiments show that our method effectively suppresses segmentation noise, improves region- level consistency, and achieves strong performance on mul- tiple open-vocabulary segmentation benchmarks. Project page: https://qiming-huang.github.io/RAG-OVS/.
| Original language | English |
|---|---|
| Title of host publication | 2026 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) |
| Publisher | IEEE |
| Publication status | Accepted/In press - 11 Nov 2025 |
| Event | 2026 IEEE/CVF Winter Conference on Applications of Computer Vision - JW Marriott Starpass, Tucson, United States Duration: 6 Mar 2026 → 10 Mar 2026 |
Publication series
| Name | IEEE Workshop on Applications of Computer Vision (WACV) |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 2472-6737 |
| ISSN (Electronic) | 2642-9381 |
Conference
| Conference | 2026 IEEE/CVF Winter Conference on Applications of Computer Vision |
|---|---|
| Abbreviated title | WACV 2026 |
| Country/Territory | United States |
| City | Tucson |
| Period | 6/03/26 → 10/03/26 |
Fingerprint
Dive into the research topics of 'Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-Free Open-Vocabulary Semantic Segmentation'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Baskerville 2.0: Enhanced Provision for High End and On-Demand Users
Styles, I. (Principal Investigator)
Engineering & Physical Science Research Council
4/01/22 → 3/05/22
Project: Research Councils
-
Baskerville: a national accelerated compute resource
Cai, B. (Co-Investigator) & Morris, A. (Principal Investigator)
Engineering & Physical Science Research Council, Lenovo UK Limited
13/10/20 → 31/03/25
Project: Research Councils