Semi-supervised Semantic Segmentation with Directional Context-aware Consistency - CVPR 2021#

Information

Title: Semi-supervised Semantic Segmentation with Directional Context-aware Consistency, CVPR 2021
Reference
- paper : https://jiaya.me/papers/semiseg_cvpr21.pdf (CVPR 2021)
- code : https://github.55860.com/dvlab-research/Context-Aware-Consistency
Review By: Chanmin Park
Edited by: Taeyup Song
Last updated on Jan. 5, 2022

Problem statement#

위의 그림과 같이 label이 있는 부분을 overlapping을 해서 patch데이터간의 consistency 를 주어지며 이른 context aware consistency라고 명시함
contextual alignment를 주기 위해서 directional contrastive loss를 제시함 이는 pixel wise로 cosine similarity 주게 되는 방법을 의미함.
데이터의 sampling 하는 새로운 방법을 제시함으로 negative sample와 ambiguous한 postive sample을 filtering함

노란색으로 되어진 overlapping region에서 weak augmentation (gaussian noise, color jitter) 했을때와 다른 위치의 patch를 구헀을때의 이미지임
두번째 행에서 보는 바와 feature에서 T-SNE를 적용하면 weak augmentation에서는 feature space가 전혀 바뀌지 않음.

label,target image, unlabel image: $y_{t}, x_{t}, x_{u}$
overlapping image(w/label),non overlapping image(wo/label) : $x_{u 1}, x_{o}, x_{u 2}$
project feature : $Φ$

low level feature projection을 시킨후 upsacaling을 한결과를 label의 영역간의 pixel wise constrative loss를 적용시켜줌
저자는 low level에서 feature를 projection을 시키면 좀더 context에 대해서 학습할수있다고 ablation result를 통해서 보여줌

negative pair의 양을 조절해야되기때문에 pseudo label에서 negative의 prediction을 값을 통해서 filtering을 함

${\tilde{y}}_{u i} = \arg max C (f_{u i}) i \in {1, 2}$

$l_{d c}^{b, n, s} (ϕ_{o_{1}}, ϕ_{o_{2}}) = - \frac{1}{N} \sum_{h, w} M_{d}^{h, w} \cdot \log \frac{r (ϕ_{o_{1}}^{h, w}, ϕ_{o_{2}}^{h, w})}{r (ϕ_{o_{1}}^{h, w}, ϕ_{o_{2}}^{h, w}) + \sum_{ϕ_{n} \in F_{u}} M_{n, 1}^{h, w} \cdot r (ϕ_{o_{1}}^{h, w}, ϕ_{n})}$
Positive 에서도 prediction의 낮은 값의 경우 $γ$ 를 통해서 filtering을 적용하여줌

l_{d c}^{b, n s, p f} (ϕ_{o_{1}}, ϕ_{o_{2}}) = - \frac{1}{N} \sum_{h, w} M_{d, p f}^{h, w} \cdot \log \frac{r (ϕ_{o_{1}}^{h, w}, ϕ_{o_{2}}^{h, w})}{r (ϕ_{o_{1}}^{h, w}, ϕ_{o_{2}}^{h, w}) + \sum_{ϕ_{n} \in F_{u}} M_{n, 1}^{h, w} \cdot r (ϕ_{o_{1}}^{h, w}, ϕ_{n})}

M_{d, p f}^{h, w} = M_{d}^{h, w} \cdot 1 {max C (f_{o_{2}}^{h, w}) > γ}