Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels

# Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels ###### tags: `Survey` `CVPR 2020` ## Link [click](https://openaccess.thecvf.com/content_CVPR_2020/papers/Peng_Large-Scale_Object_Detection_in_the_Wild_From_Imbalanced_Multi-Labels_CVPR_2020_paper.pdf) --- ## Dataset ![](https://i.imgur.com/rl4uFZ3.jpg) --- ## Abstract * Open Images dataset的問題 * **多個標籤** * **多種標籤類別** `比如玩具車，既可以是車也可以是玩具` ![](https://i.imgur.com/7sHP11r.png) * **漏標** `標籤本身結構是層級的，比如水果和蘋果。如果一個物體獲得子類標籤，那麼它會自動有父類的標籤。但如果獲取的是父類，則會導致子類標籤的缺失` 缺失子標籤的比例 ![](https://i.imgur.com/hhv86SW.png) * **混標** `對於混淆類別會隨機性標註，比如torch 和 flashlight，cheetah（獵豹）和leopard（花豹）` 混標比例 ![](https://i.imgur.com/4eVtAGq.png) * **標籤分佈不均衡** `從開源圖片庫收集，有的不常見的類也會被包含，導致了標籤分佈不均勻。` ![](https://i.imgur.com/9tubv7s.png) * 設計一個concurrent softmax來處理multi-label的問題 * 提出softsampling methods處理label不平衡問題 ## Method * Concurrent Softmax `原本的Softmax只適用於single label` ![](https://i.imgur.com/q31Ccx7.png) 主要是分母的變化，對於$j=i$，$e$不變；$j≠i$，乘兩個係數。 * $y$ 是當前bbox對應的類別$j$ * $r_{ij}$是類別$i$和類別$j$之間的相關係數 * 這樣，有$r$的調節，對於某些相關性高的激活輸出就不會被壓制。 $1-y$ 解決了顯示多標籤的問題，$r$ 解決了隱式多標籤的問題。 * Softmax的問題 `ex.` 如果一個bbox是Apple，那麼不應該壓制label為Fruit的概率 * Compared with **BCE** `BinaryCrossEntropy` * BCE is always a popular solution to mutl-label recognition, but it **does not work well** on multi-label detection task * Soft-balance Sampling 取了隨機採樣和按類採樣的折中 * 隨機採樣 $P_{n}(i)={n_{i} \over \sum\limits_{j = 1}^C{n_{j}}}$ * 按類採樣 $P_{a}(i)={1 \over C}$ 因此採取折衷的方式 $P_{a}(i)={1 \over C}={1 \over CP_{n}(i)}P_n(i)=\alpha P_{n}(i)$ ## Conclusion * 67.17 mAP * 4.29 points higher than the best result of Open Images public test 2018. ---