Conservative-Progressive Collaborative Learning for Semi-supervised Semantic Segmentation

Siqi Fan, Fenghua Zhu, Zunlei Feng, Yisheng Lv, Mingli Song, Fei-Yue Wang

Introduction

SSL技術主要有兩種典型的作法，Entropy minimization和consistency regularization，兩種做法都仰賴於pseudo supervision，導致不正確的pseudo label產生confirmation bias，大部分方法為了解決此問題而使用預測分數(設立threshold)來選擇可信賴的pseudo label，這樣的作法可能使大量的unlabel data資源被浪費。

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Teacher-Student和Student-Student是兩種典型的double branch學習方式，Student-Student訓練時的兩種網路結構容易有模型偶合的問題產生，導致錯誤的結果和限制性。
作者提出Conservative-Progressive Collaborative Learning (CPCL)，平行使用兩種相同結構的網路但使用不同初始值，Conservative只使用高質量的pseudo label做intersection(交集) supervision; Progressive則使用大數量的pseudo label做union supervision。藉由這兩個網路的預測結果生成pseudo label，且兩個網路是在相異知識下進行訓練，因此可降低偶合的問題。
除此之外預測結果的信心值被使用在 loss re-weighting當中，為了解決不可避免的noisy pseudo labels。

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Contribution

CPCL在使用union supervision保留相異部份時也透過intersection supervision尋找共通點已達成保守評估和進步挖掘兩方面的合作，且由與兩個網路結構在相異知識下訓練的性質可有效解決耦合問題。
使用class-wise的不一致指標用來產生不一致部分的pseudo label，且基於信心值對loss做re-weighting，避免noise pseudo label造成的錯誤太巨大

Framework

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Problem Definition

D_{l} = {(X_{l}^{1}, G^{1}), . . ., (X_{l}^{N}, G^{N})},

N

為label data數量

D_{l} = {(X_{u}^{1}), . . ., (X_{u}^{M})},

M

為unlabel data數量，且

M ≫ N

X_{l}^{n}, X_{u}^{m} \in R^{H \times W}, G^{n} \in R^{C \times H \times W}

label data使用傳統的supervised方式做訓練，此篇論文著重將unlabel data

D_{u}

在兩個相同結構但不同初始值的

f_{θ c} (X)

和

f_{θ p} (X)

做訓練

Data augmentation

作者採用srtong augmentation的cutmix方式，將任兩張圖片做隨機區塊裁切上

m i x (a, b, m a s k) = (1 - m a s k) \cdot a + m a s k \cdot b

X_{s} = m i x (X_{1}, X_{2}, m a s k),

取得強化後的

X_{s}

兩個網路分支的前處理近乎相同，首先會先輸入三張影像

(X_{1}, X_{2}, X_{s})

，以

f_{θ c} (\cdot)

來舉例:

Y_{c w}^{1} \leftarrow a r g \underset{y}{m} a x f_{θ c} (y | X_{1})

Y_{c w}^{2} \leftarrow a r g \underset{y}{m} a x f_{θ c} (y | X_{2})

Y_{c s} \leftarrow a r g \underset{y}{m} a x f_{θ c} (y | X_{s})

接著利用

Y_{c w}^{1}

及

Y_{c w}^{2}

做cutmix，取得

Y_{c w}

Y_{c w} = m i x (Y_{c w}^{1}, Y_{c w}^{2}, m a s k),

相同作法也在Progressive分支下取得

Y_{p s}

及

Y_{p w}

，利用

Y_{c w}

及

Y_{p w}

來生成pseudo label

pseudo label generation

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

a g r e e m e n t = {\begin{matrix} T r u e y_{c w}^{i} = y_{p w}^{i} \\ F a l s e o t h e r w i s e \end{matrix}

根據每個pixel判斷若經由兩網路結果相等則此pixel為agreement，否則為disagreement

Agreement

I_{a}^{i} = y_{c w}^{i}, i f y_{c w}^{i} = y_{p w}^{i}

I_{a}^{i}

為agreement

L_{a}

部分的第i個pixel，

L_{i n t e r}

直接使用

L_{a}

當作pseudo label

Disagreement

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

先計算出class-wise的pseudo label指標，上圖的矩陣縱軸及橫軸分別為不同網路預測的結果，

M_{j, k}

為A網路預測結果為第j類別及B網路預測結果為第k類別的pixel總數量，對角線綠色區塊為agreement的預測類別數量，紅色為disagreement

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

I_{d}^{i}

為第i個pixel最後的pseudo label，A網路預測的class為j；B網路預測為k，根據計算出來的指標若

I_{j} \geq I_{k}

，選擇

y_{c w}^{i}

作為該pixel的pseudo label，

I_{j} \leq I_{k}

，則選擇

y_{p w}^{i}

Conservative-Progressive Collaborative Learning for Semi-supervised Semantic Segmentation

Introduction

Contribution

Framework

Problem Definition

Data augmentation

pseudo label generation

Agreement

Disagreement

Read more

Learning pseudo labels for semi-and-weakly supervised semantic segmentation

2024q1 Final Project (rbtree)

2024q1 Homework5 (assessment)

2024q1 Homework6 (integration)