問答題
What is softmax used for?
What are three supervised cnn model usage?
What are the three major issues you need to learn when you study a neural network model?
- Network Architecture
- Activation function
- Learning rule
Basic CNN architecture can be divided into two stages, what are these stages? What are the functions of the corresponding two stages?
- Convolutional layers+ pooling layer
- Fully connected layer
- Mapping of feature maps to target labels(分類)
計算題
TLU
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
原版
- Activation =
- Y=1 if , else y=0
- Update weight =
- is the learning rate, V is the input vector for example
TLU example
- 2 input or gate(if one of the input is true )
- Initial Weight
- Time = 1
- Time = 2
- Time =3
- … Continue until weight fits all condition
Answer
- No, because Activation condition is reversed
- change learning rule to
Perceptron
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
a
- 2 layer, 2 input 2 output percetron
b
- No , not linearly seperable by one line
c
*
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
* Neural Networks and Machine Learning, Simon Haykin, 3nd ed., Pearson, 2009 p85(pdf)
*
Back propogation
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
a
- Input layer 2, hidden 2, output layer 2
b
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
- Note
- is the weight
- is the result of activation on the node
- is the learning rate
|
0.1 |
|
0.2 |
|
0.3 |
|
0.4 |
|
0.5 |
|
0.6 |
|
0.7 |
|
0.8 |
Forward
s
update weight
|
0.0957 |
|
0.1957 |
|
0.2951 |
|
0.3951 |
|
0.4578 |
|
0.5504 |
|
0.7161 |
|
0.8189 |
驗證 tensorflow
https://colab.research.google.com/drive/1uWPPby020fEdusBBwIwjyLRyArkqoEMv?usp=sharing
CNN
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
Architecture
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
A forward
CNN
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
Pooling

Linear
Softmax

Backward
* 因為有softmax層 outputlayer算法為:
*
* 為target,為softmax的結果
* 其他不變
sigmoid s
sigmoid
upsampling
- reverse maxpool

- reverse relu



IOU Calculate
chapter 6 Object detection

Difference between one stage and two stage

R-CNN

- 重複製作對象feature map造成速度較慢
- Hard to optimize
- Have to be trained seperately
yolo family
Yolov1
- backbone: based on GoogLeNet
- Unified detection:

- Non-Maximum Suppression (NMS) 用來選去包圍物件最佳的 Bounding box.獲得最佳的 Intersection over Union (IoU)

- 優點: 快、 訓練較簡單
- 缺點:
- 一個Grid只能有一(或2)個Class,因此對於擁擠及較小的物間偵測能力較差
- bounding box 對於物件的aspect ratio 較為固定
Yolov2
- 更換 backbone 為 Darknet 19
- Remove the fully connected layers with average pooling
- More BBOX(5) in each grid cell(better at small and occlusion)
- multi - scale training on each batch
Yolov3
- Darknet 53
- Residual learning(input is combined with block output)

Yolov4
- CSPDarknet53
- Neck: SPP(splatial pyramid) and PANet(, Path Aggregation Network)
- Head: YOLO layer
- Bounding box regression loss
- CIoU, GIoU, DIoU, MSE 四種
- IOU Loss
- CIoU (Complete-IoU) Loss
- Regularization
- Data augmentation
- Cut mix: 在圖中放另一張圖
- Mosaic data augmentation: 將四張圖組合唯一
Yolov5
- More mosaic data augment
- GIoU (Generalized-IoU) Loss
- PANet only
- Implement in pytorch
YOLOX
- Anchor free
- Faster training/inference speed
- Do not need to determine anchor parameters

- Decoupled head

Else
Chapter 7 Instance segmentation
R-CNN

Fast R-CNN

- Still Use Selective search
Faster R-CNN
- Speed up with region proposal network

Mask R-CNN
- Extends Faster R-CNN with segmentation

SOLO family

SOLO

SOLOV2
difference fromv1
- The object mask generation is decoupled into a mask kernel prediction and mask feature learning, which are responsible for generating convolution kernels and the feature maps to be convolved with, respectively.
- Predict high-resolution object masks
- SOLOv2 significantly reduces inference overhead with matrix non-maximum suppression (NMS) technique.

- Dynamic Convolutions

- More flexible
- It adds 2D offsets to the regular grid sampling locations in the standard convolution. It enables free form deformation of the sampling grid
outdated (no un-supervised)
List three most popular types of generative models.
- Variational Autoencoders
- Pixel RNN/CNN
- Generative adversarial network(GAN)
What is the main difference between Supervised Learning and Un-supervised Learning networks?
- Supervised learning
- Data: Data and label
- Goal: Map input data to label
- Un-supervised learning
- Data: Data, no labels
- Goal: Learn some underlying structure of the data
Competitive network

SOFM

Cross entropy and Gradient Descent Method
