問答題

What is softmax used for?

What are three supervised cnn model usage?

What are the three major issues you need to learn when you study a neural network model?

Network Architecture
Activation function
Learning rule

Basic CNN architecture can be divided into two stages, what are these stages? What are the functions of the corresponding two stages?

Convolutional layers+ pooling layer
- Feature extraction(特徵擷取)
Fully connected layer
- Mapping of feature maps to target labels(分類)

計算題

TLU

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

原版

Activation =
$a = w_{1} x_{1} + w_{2} x_{2} + w_{3} x_{3} . . . .$
Y=1 if
$a \geq θ$ , else y=0
Update weight =
$w^{'} = w + a (t - y) V$
- $a$ is the learning rate, V is the input vector for example
  $x_{1} = 1, x_{2} = 0$

TLU example

2 input or gate(if one of the input is true )
Initial Weight
$w_{1} = 1, w_{2} = 2, θ = 2$
Time = 1
- $0 \times 1 + 0 \times 2 = 0$
- $A < θ \to y = 0$
- $y = 0 \to$
- $t - y = 0$ no change
Time = 2
- no change
Time =3
- $1 \times 1 + 0 \times 2 = 1$
- $A < θ \to y = 0$
- $w_{1}^{'} = 1 + 0.5 (1 - 0) \times 1 = 1.5$
- $w_{2}^{'} = 2 + 0.5 (1 - 0) \times 0 = 2$
… Continue until weight fits all condition

Answer

No, because Activation condition is reversed
change learning rule to
$w^{'} = w - a (t - y) x$

Perceptron

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

a

2 layer, 2 input 2 output percetron

b

No , not linearly seperable by one line

c

$Σ s_{1} = 0.2 * 1 + (- 0.5) * 1 = - 0.3$
$Σ s_{2} = (- 0.7) * 1 + 0.5 * 1 = - 0.2$
$w_{1, 1}^{'} = w_{1, 1} + 0.5 (0 - 0) * 1 = 0.2$
$w_{1, 2}^{'} = w_{1, 2} + 0.5 (0 - 0) * 1 = - 0.5$
$w_{2, 1}^{'} = w_{2, 1} + 0.5 (1 - 0) * 1 = - 0.2$
$w_{2, 2}^{'} = w_{2, 2} + 0.5 (1 - 0) * 1 = 1$

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

*　Neural Networks and Machine Learning, Simon Haykin, 3nd ed., Pearson, 2009 p85(pdf)
*　

y (n) 為 經 過 a c t i v a t i o n 的 輸 出

Back propogation

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

a

Input layer 2, hidden 2, output layer 2

b

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Note
- $w$ is the weight
- $a$ is the result of activation on the node
  $a = f (Σ (w_{j i} a_{i}))$
- $η$ is the learning rate
- $δ_{j} = (d_{j} - a_{j}) * f^{'} (s_{j}) = (d_{j} - a_{j}) a_{j} (1 - a_{j})$
- $Δ w_{j i} = η δ_{j} a_{i}$

$w_{31}$	0.1	$w_{32}$	0.2
$w_{41}$	0.3	$w_{42}$	0.4
$w_{53}$	0.5	$w_{54}$	0.6
$w_{63}$	0.7	$w_{64}$	0.8

input =
$(1, 1)$

Forward

$a_{3} = f (1 * 0.1 + 1 * 0.2) = 0.57$
$a_{4} = f (1 * 0.3 + 1 * 0.4) = 0.67$
$a_{5} = f (0.57 * 0.5 + 0.67 * 0.6) = 0.67$
$a_{6} = f (0.57 * 0.7 + 0.67 * 0.8) = 0.72$

$δ$ s

target (0,1)
$δ_{6} = (d_{6} - a_{6}) a_{6} (1 - a_{6}) = (1 - 0.72) 0.72 (1 - 0.72) = 0.0564$
$δ_{5} = (0 - 0.67) 0.67 (1 - 0.67) = - 0.1481$
$δ_{4} = (δ_{5} w_{54} + δ_{6} w_{64}) * f^{'} (s_{4}) = (- 0.1481 * 0.6 + 0.0564 * 0.8) 0.67 (1 - 0.67) = - 0.0097$
$δ_{3} = (δ_{5} w_{53} + δ_{6} w_{63}) * f^{'} (s_{3}) = (- 0.1481 * 0.5 + 0.0564 * 0.7) 0.57 (1 - 0.57) = - 0.0085$

$Δ w$

Δ W_{64} = η δ_{6} a_{4} = 0.5 * 0.0564 * 0.67 = 0.01889

Δ W_{63} = 0.5 * 0.0564 * 0.57 = 0.0161

Δ W_{54} = 0.5 * - 0.1481 * 0.67 = - 0.0496

Δ W_{53} = 0.5 * - 0.1481 * 0.57 = - 0.0422

Δ W_{42} = 0.5 * - 0.0097 * 1 = - 0.0049

Δ W_{41} = 0.5 * - 0.0097 * 1 = - 0.0049

Δ W_{32} = 0.5 * - 0.0085 * 1 = - 0.0043

Δ W_{31} = 0.5 * - 0.0085 * 1 = - 0.0043

update weight
$w^{'} = w + Δ w$

$w_{31}^{'}$	0.0957	$w_{32}^{'}$	0.1957
$w_{41}^{'}$	0.2951	$w_{42}^{'}$	0.3951
$w_{53}^{'}$	0.4578	$w_{54}^{'}$	0.5504
$w_{63}^{'}$	0.7161	$w_{64}^{'}$	0.8189

驗證 tensorflow

https://colab.research.google.com/drive/1uWPPby020fEdusBBwIwjyLRyArkqoEMv?usp=sharing

CNN

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Architecture

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

A forward

Tensorflow check https://github.com/JINSCOTT/MLHomework/blob/0cd3071fed50c104864d2fe2b8d8b4fbad5122ed/cnn_maxpooling_backpropagate.ipynb

CNN

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Pooling

Linear

$a_{5} = f (0.1 * 80 + (- 0.05) * 90 + 0.05 * 20 + (- 0.02) * 60) = 0.9644$
$a_{6} = f (0.05 * 80 + (- 0.02) * 90 + 0.03 * 20 + (- 0.07) * 60) = 0.1978$
$a_{7} = f (- 0.4 * 0.9644 + - 1 * 0.1978) = 0.3581$
$a_{8} = f (0.5 * 0.9644 + - 0.5 * 0.1978) = 0.5947$

Softmax

$s 1 = 0.4411$
$s 2 = 0.5589$

Backward

*　因為有softmax層　outputlayer算法為:
*

δ_{j} = (t_{j} - y_{j}) (y_{j} - y_{j}^{2}) a_{j} (1 - a_{j})

t_{j}

為target，

y_{j}

為softmax的結果
*　其他不變

sigmoid
$δ$ s

target (１，０)
$δ_{8} = (0 - 0.5589) (0.5589 - {0.5589}^{2}) 0.5947 (1 - 0.5947) = - 0.0332$
$δ_{７} = (1 - 0.4411) (0.4411 - {0.4411}^{2}) 0.3581 (1 - 0.3581) = 0.0317$
$δ_{６} = (δ_{8} w_{86} + δ_{7} w_{76}) * f^{'} (a_{6}) = (- 0.0332 * - 0.5 + 0.0317 * - 1) 0.1978 (1 - 0.1978) = - 0.0024$
$δ_{５} = (δ_{8} w_{85} + δ_{7} w_{75}) * f^{'} (a_{5}) = (- 0.0332 * 0.5 + 0.0317 * - 0.4) 0.9644 (1 - 0.9644) = - 0.001$
$δ_{4} = (δ_{6} w_{64} + δ_{5} w_{54}) * f^{'} (a_{4}) = (- 0.0024 * - 0.07 - 0.001 * - 0.02) * 1 = 0.000188$
$δ_{3} = (δ_{6} w_{63} + δ_{5} w_{53}) * f^{'} (a_{3}) = (- 0.0024 * 0.03 - 0.001 * 0.05) * 1 = - 0.000122$
$δ_{2} = (δ_{6} w_{62} + δ_{5} w_{52}) * f^{'} (a_{2}) = (- 0.0024 * - 0.02 - 0.001 * - 0.05) * 1 = 0.0001$
$δ_{1} = (δ_{6} w_{61} + δ_{5} w_{51}) * f^{'} (a_{1}) = (- 0.0024 * 0.05 - 0.001 * 0.1) * 1 = - 0.00022$

sigmoid
$ｗ＋ Δ w$

learning rate = 0.5
$w_{8, 6} = - 0.5 + 0.5 * - 0.0332 * 0.1978 = - 0.5033$
$w_{8, 5} = 0.5 + 0.5 * - 0.0332 * 0.9644 = 0.4839$
$w_{7, 6} = - 1 + 0.5 * 0.0317 * 0.1978 = - 0.9968$
$w_{7, 5} = - 0.4 + 0.5 * 0.0317 * 0.9644 = 0.3847$
$w_{6, 4} = - 0.07 + 0.5 * - 0.0024 * 60 = - 0.142$
$w_{6, 3} = 0.03 + 0.5 * - 0.0024 * 20 = 0.006$
$w_{6, 2} = - 0.02 + 0.5 * - 0.0024 * 90 = - 0.128$
$w_{6, 1} = 0.05 + 0.5 * - 0.0024 * 80 = - 0.046$
$w_{5, 4} = - 0.02 + 0.5 * - 0.001 * 60 = - 0.05$
$w_{5, 3} = 0.05 + 0.5 * - 0.001 * 20 = 0.04$
$w_{5, 2} = - 0.05 + 0.5 * - 0.001 * 90 = - 0.095$
$w_{5, 1} = 0.1 + 0.5 * - 0.001 * 80 = 0.06$

upsampling

reverse maxpool
reverse relu

IOU Calculate

chapter 6 Object detection

Difference between one stage and two stage

R-CNN

重複製作對象feature map造成速度較慢
Hard to optimize
Have to be trained seperately

yolo family

single shot detector

Yolov1

backbone: based on GoogLeNet
Unified detection:
- Non-Maximum Suppression (NMS) 用來選去包圍物件最佳的　Bounding box.獲得最佳的 Intersection over Union (IoU)
優點：　快、　訓練較簡單
缺點：
- 一個Grid只能有一(或2)個Class，因此對於擁擠及較小的物間偵測能力較差
- bounding box 對於物件的aspect ratio 較為固定

Yolov2

更換 backbone 為 Darknet 19
Remove the fully connected layers with average pooling
More BBOX(5) in each grid cell(better at small and occlusion)
multi - scale training on each batch

Yolov3

Darknet 53
Residual learning(input is combined with block output)

Yolov4

CSPDarknet53
Neck: SPP(splatial pyramid) and PANet(, Path Aggregation Network)
Head: YOLO layer
Bounding box regression loss
CIoU, GIoU, DIoU, MSE 四種
- IOU Loss
- CIoU　(Complete-IoU) Loss
Regularization
Data augmentation
- Cut mix: 在圖中放另一張圖
- Mosaic data augmentation: 將四張圖組合唯一

Yolov5

More mosaic data augment
GIoU (Generalized-IoU) Loss
PANet only
Implement in pytorch

YOLOX

Anchor free
- Faster training/inference speed
- Do not need to determine anchor parameters
Decoupled head

Else

Yolo F/R/S/P

Chapter 7 Instance segmentation

R-CNN

Fast R-CNN

Still Use Selective search

Faster R-CNN

Speed up with region proposal network

Mask R-CNN

Extends Faster R-CNN with segmentation

SOLO family

SOLO

SOLOV2

difference fromv1

The object mask generation is decoupled into a mask kernel prediction and mask feature learning, which are responsible for generating convolution kernels and the feature maps to be convolved with, respectively.
Predict high-resolution object masks
SOLOv2 significantly reduces inference overhead with matrix non-maximum suppression (NMS) technique.
Dynamic Convolutions
- More flexible
- It adds 2D offsets to the regular grid sampling locations in the standard convolution. It enables free form deformation of the sampling grid

outdated (no un-supervised)

List three most popular types of generative models.

Variational Autoencoders
Pixel RNN/CNN
Generative adversarial network(GAN)

What is the main difference between Supervised Learning and Un-supervised Learning networks?

Supervised learning
- Data: Data and label
- Goal: Map input data to label
Un-supervised learning
- Data: Data, no labels
- Goal: Learn some underlying structure of the data

問答題

What is softmax used for?

What are three supervised cnn model usage?

What are the three major issues you need to learn when you study a neural network model?

Basic CNN architecture can be divided into two stages, what are these stages? What are the functions of the corresponding two stages?

計算題

TLU

原版

TLU example

Answer

Perceptron

a

b

c

Back propogation

a

b

Forward

δs

Δw

update weightw′=w+Δw

驗證 tensorflow

CNN

Architecture

A forward

CNN

Pooling

Linear

Softmax

Backward

sigmoid δs

sigmoid ｗ＋ｗ＋Δw

upsampling

IOU Calculate

chapter 6 Object detection

Difference between one stage and two stage

R-CNN

yolo family

Yolov1

Yolov2

Yolov3

Yolov4

Yolov5

YOLOX

Else

Chapter 7 Instance segmentation

R-CNN

Fast R-CNN

Faster R-CNN

Mask R-CNN

SOLO family

SOLO

SOLOV2

difference fromv1

outdated (no un-supervised)

List three most popular types of generative models.

What is the main difference between Supervised Learning and Un-supervised Learning networks?

Competitive network

SOFM

Cross entropy and Gradient Descent Method

tags: Artificial Neural Networks and Deep Learning CSnote Artificial Neural Networks and Deep Learning

Read more

NTPU CSIE NOTE 2025 北大資工筆記/期中末歷屆 集合大全

演算法總頁

微算機與組合語言 Microprocessor & Assembly language

數位系統設計

$δ$ s

$Δ w$

update weight
$w^{'} = w + Δ w$

sigmoid
$δ$ s

sigmoid
$ｗ＋ Δ w$

tags: `Artificial Neural Networks and Deep Learning` `CSnote` `Artificial Neural Networks and Deep Learning`

NTPU CSIE NOTE 2025 北大資工筆記/期中末歷屆集合大全