Learning Deep Features for Discriminative Localization
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, Antonio Torralba
Computer Science and Artificial Intelligence Laboratory
MIT
ๅ็ฎ้
Abstract
- Revisit the GAP
- GAP(global average pooling layer)
- Achieve 37.1% top-5 error on ILSVRC 2014
- ILSVRC(Large Scale Visual Recognition Challenge)
Introduction
- convolutional layers can localize objects, but this ability is lost when fully-connected layers
- Network in Network, GoogLeNet avoid fully-connected layers.
- minimize the number of params
- GAP(known as a kind of structural regularizer) doesn't simply act as a regularizer.
- This approach can be easily transferred to other recognition datasets for generic classification, localization and concept discovery.
- achieves 37.1% top-5 test error, close to the fully supervised AlexNet.
Class Activation Mapping
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
Weakly-supervised Object Localization
Setup
- use AlexNet, VGGnet and GoogLeNet to generate *GAP
- remove some layer (fully connected layer and softmax)
- add some convolution layer
- 3 * 3, stride 1, padding 1 with 1024 units.
- GAP layer
- softmax
Results
- Classification
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
- Localization
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
- weakly supervision still have a long way to go.
- Generic Localization
- to test the ability about feature extraction between original network and the network concatenated with GAP by linear SVM
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
- similar
- to test the ability about localization by weakly supervision
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
- It still can find the position of the object.
Deep Features for Generic Localization
Fine-grained Recognition
- with bounding box
-
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
-
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
Pattern Discovery
- Given a set of images containing a common concept, test the network whether can find where the position f the important regions in this images.
- How to identify the important region before train the network to test the network performance.
- use GoogLeNet-GAP network training by image-level label. use SVM weight and GAP to contruct the CAM to identify the important region.
- Experiment
-
Discovering informative objects in the scenes
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
-
Concept localization in weakly labeled images
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
-
Weakly supervised text detector
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
- postive set: picture with text
- negtive set: picture without text
-
Interpreting visual question answering (???)
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
Visualizing Class-Specific Units
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More โ
Conclusion
- CAM fofr CNN with global average pooling.
- enable to visualize hotmap to the given image
- weakly supervision to find localize the object.