2022 Computer Vision Midterm Exam

# NTU Computer Vision Midterm Exam ###### tags: `Course` > [toc] ## 歷年期中考共筆 - [2021](https://hackmd.io/@ypMXyOIoTCm6PRSX8z3gVQ/HJgKKVlEt) - [2020](https://hackmd.io/eG2UKqDdSQuXyTlB9uFPVQ) - [2019](https://hackmd.io/UhyNov3ASWm6DdsFvIUcwQ) ## 歷年期中考古題 * [2021](https://hackmd.io/@KC-YEN/r18ACfFtK) * [2019](https://www.ptt.cc/bbs/NTU-Exam/M.1572947541.A.F16.html) * [2017](https://www.ptt.cc/bbs/NTU-Exam/M.1510142305.A.B04.html) ## 重要金句 - 授課教師: 傅楸善 - 金句: 笑話比本文更重要 - To err is human, to forgive divine: 人非聖賢，孰能無過 ## Chapter 1: Overview ### 未出過的名詞解釋 - computer vision: 用電腦視覺模擬人類視覺，可用於物體辨識 - vanishing point: 3D中平行的物體在2D時會在遠處交會，可用於繪圖時做出遠近感 - extremal sharpening: 讓亮的地方更亮、暗的地方更暗，可用於影像銳利化 ### 出過的名詞解釋or高機率出的 - Shape from texture: 用物品材質、紋理來反映出三維空間的形狀，可用於電腦繪圖展現3d感覺 - Shape from shading: 用陰影濃淡、明暗來反映出三維空間的形狀，可用於電腦繪圖展現3d感覺 - 電腦視覺的主要應用 - Vision-guided robot assembly: 以電腦視覺輔助機器人組裝，為電腦視覺的主要用途之一，例如機器人在焊接組裝汽車，用電腦視覺來指引鎖螺絲的位置 - Inspection tasks: 檢測任務，為電腦視覺的主要用途之一，例如用電腦視覺來確定物體表面沒有瑕疵，SART等 - SART (Simultaneous Algebraic Reconstruction Technique) - FBP (filtered back projection) - Pattern recognition: 從圖片中尋找特定的模式，為電腦視覺的主要用途之一，例如協助林務局去分析地景資料，尋找拍攝到的照片有多少面積的水稻等 - Alignment: 協助判斷物件是否對齊，為電腦視覺的主要用途之一，例如製作晶片時用於判斷位置 - Measurement: 量測，為電腦視覺的主要用途之一，可量測如長度、面積等 - Atomic image feature (最小不可分割的特徵單位，如edge, corner, hole….) - Edge: boundaries between objects and background, 影像中一邊亮一邊暗的部分 - Corner: the intersection of two edges - Hole: if $S_1$ is the hole of $S_2$ => (Edge of $S_1$) $\subseteq$ (Edge of $S_1$) and $S_1$ $\not\subseteq$ $S_2$ - Peak: 把亮度看成高度的話，peak就是中間高亮的部分，可用於影像判斷如醫學影像等 - Pit: 把亮度看成高度的話，pit就是中間暗的部分，可用於影像判斷如醫學影像等 - Ridge: 把亮度看成高度的話，中間亮兩邊暗，可用於影像判斷如醫學影像等 - Valley: 把亮度看成高度的話，中間暗兩邊亮，可用於影像判斷如醫學影像等 - Composite feature: 結合數個atomic image後的影像特徵 - Arcs: edge or ridge pixels linked together，為composite feature的一種 - Region: connected sets of pixels with similar properties，為composite feature的一種 - Recognition methodology - Image formation: 影像如何形成的，例如是用透視投射法還是正投影法 - Conditioning: 利用模型去除無用的雜訊，可用來降噪 - Labeling: 將相同特性的像素給予同樣的編號，可用於邊緣檢測、自駕車、機器人等 - Grouping: 識別同一類事件的最大連接像素，利於將群組化後的資訊進行一些計算，可用於segmentation分割 - Extracting: 對grouping好的set of pixels，計算特性，例如centroid, orientation等等 - Matching: 給定特定的三維物件或二維的形狀，在圖形中找出相對應的事件，可用在template matching - Range image: 在一張2d 影像中的每一個pixel存放深度值（depth），可用於重建3d模型 - Pixel: 圖片的基本單位，包含位置與數值 - Feature: 影像中的特殊結構，可用於feature recognition - Orientation: 影像中物體的方向，加上position可用於判斷物體的形狀與位置 - gray level: 灰階亮度，pixel所含的量值，為0(black)-255(white) for a 8 bit pixel ## Chapter 2: Thresholding and Segmantation - 考圖 - 經過演算法作用後的圖 - 聯通圖演算法的名字、內容 - 疊代法 ### 未出過的名詞解釋 - binary value 1: 二進制影像中的數值為1的pixel，通常被認為是物體的一部分 - binary value 0: 二進制影像中的數值為0的pixel，通常被認為是背景的一部分 - binary machine vision: 二元機器視覺，對二元影像做分析（非灰階影像） - ocr: optical character recognition，光學判斷垂直水平有幾個pixel為1 - micr: magnetic ink character recognition，磁性墨水去讀垂直水平有幾個pixel為1 ### 出過的名詞解釋or高機率出的 - thresholding: 將灰階影像轉為二元影像 - B(r,c)=1 if I(r,c) $\geq$ intensity threshold - intensity histogram: X軸為0-255灰階值，Y軸為影像中灰階值出現的次數，可用來決定有效的intensity threshold - connected components labeling: 做分組，可將二元影像中同樣為1且相鄰的像素劃成同一個group，可用於辨識潛在為相同物件的像素 - connected components analysis: - 1. 用connected component labeling對binary value 1的區域做label - 2. 將相連的label給定相同的標籤 - 3. decision making - label: 給定像素特定的編號，可用於辨識物件的區域 - Pixel property: 像素的屬性，位置、灰階值與亮度 - Region property: 區域的屬性，形狀、位置、bounding box - 4-connected: 四連通，東西南北，以線做連接 - 8-connected: 八連通，東西南北、東北、西南.....，以點做連接 - border: 二元影像中，1和0的交界 - signature segmentation: 該方向上所有為1的像素加起來的數量，稱為該方向上的投影，大多會拿兩個彼此差90度的投影做成一個長方形，投影完之後，若整行、列為0，則捨棄並切割成子影像，因此在數次重複後，會得到數個子影像，他們就是可能的物件區域 - signature: 投影，寫上面 - segmentation: 分割，寫上面 ### Binary Thesholding Algorithms - **Minimizing within-group variance (Otsu):** 最小化組內變異，組間差異越大越好，越能區分群體，目標是找到 $t$ 去minize組內變異 (p.15) - **Minimizing kullback information distance:** 最小化 kullback 有效散度J，假設目標與背景都常態分佈，目標也是找到 $t$ 去 minimize 有效散度 $J$ (p.17) ### ==Connected Component Algorithms== - ==**Iterative Alogorithm:**== - initialize each pixel to a unique label - iterate top-down followed by bottom-up passes until no change (上到下->下到上->重複直到收斂) - **Classical Alogorithm:** - first top-down pass: performs label propagation as iterative algorithm (第一次從上到下傳播跟迭代法一樣) - when two different labels propagate to the same pixel, the smaller label propagates and equivalence entered into table (如果當有兩個標籤傳播到同一個像素，會挑小的標籤，同時將標籤計入到==等價表==中) - equivalence classes are found by transitive closure (找到等價表中一樣的標籤，並且挑出最小的) - second top-down pass: performs a translation (挑出最小的標籤，去取代等價的標籤) - **A Space Efficient ==Two-Pass Algorithm== That Uses a Local Equivalence Table:** - 一行行處理，邊傳播邊檢查 - small table stores only equivalences from current and preceding lines - maximum number of equivalences = image width - relabel each line with equivalence labels when equivalence detected - **An Efficient ==Run Length== Implementation of the Local Table Method:** - 用一連串 1 跟 0 去紀錄整串的 pixel value - 用 table 去紀錄每一行每個 1 的 start 和 end - 當 p 結束早於 q 的開始或 q 結束早於p的開始->沒有交集 - 當結束晚於開始->有交集，進行label取代 ## Chapter 3: Region Analysis - border pixel: neighboring pixel outside the region - perimeter pixel - border pixel is 8-connected，perimeter pixel is 4-connected $P_4=\{(r, c)\in R \mid N_8(r,c) \setminus R \neq \varnothing \}$ (8-connected 有一點在外面) - border pixel is 4-connected，perimeter pixel is 8-connected $P_8=\{(r, c)\in R \mid N_4(r,c) \setminus R \neq \varnothing \}$ (4-connected 有一點在外面) - $P=\langle(r_0, c_0), ..., (r_{k-1}, c_{k-1})\rangle$ (起點=終點，扣掉終點) - ==length perimeter==: (水平+鉛直) + 對角線 * $\sqrt{2}$ - ==co-occurrence matrix (共生矩陣) (p29)==: $P_{i,j}$ 看原始影像有多少組滿足 $(i,j)$ 的 pattern - $P_H$: 水平 (24組) - $P_V$: 鉛直 (24組) - $P_{SD}$: 45 度 (18組) - $P_{LD}$: 135 度 (18組) - 8 distinct extremal points’ names of the bounding rectangle ({ }most { }): - topmost right - rightmost top - rightmost bottom - bottommost right - bottommost left - leftmost bottom - leftmost top - topmost left ## Chapter 4: Statistical Pattern Recognition - p54 - false alarm rate (假警報率): $P(b|g)=\frac{P(g, b)}{P(g)}=\frac{P(g, b)}{P(g|g)+P(g|b)}=FN$ - misdetection rate (漏檢率): $P(g|b)=\frac{P(b, g)}{P(b)}=\frac{P(b, g)}{P(b|g)+P(b|b)}=FP$ - ==**例題(p33, 34):**== $P(g)=0.95, P(b)=0.05$ 以及給定下面的表，求$E[e]=?$ | | Good | Bad | | ---- | ------------- | ------------- | | Good | $P(g\|g)=0.8$ | $P(b\|g)=0.2$ | | Bad | $P(g\|b)=0.1$ | $P(b\|b)=0.9$ | | | Good | Bad | | ---- | --------------- | ------------- | | Good | $e(g,g)=2000$ | $e(g,b)=-100$ | | Bad | $e(b,g)=-10000$ | $e(b,b)=-100$ | $\begin{align*} E(e)&=e(g,g)P(g, g)+e(g,b)P(g, b)+e(b,g)P(b, g)+e(b,b)P(b, b) \\ &=e(g,g)P(g| g)P(g)+e(g,b)P(b|g)P(g)+e(b,g)P(g| b)P(b)+e(b,b)P(b|b)P(b) \\ &=[e(g,g)P(g|g)+e(g,b)P(b|g)]P(g)+[e(b,g)P(g| b)+e(b,b)P(b|b)]P(b) \\ &=[2000 \times 0.8 + (-100) \times 0.2] \times 0.95 + [(-10000) \times 0.1 + (-100) \times 0.9] \times 0.05 \\ &=1580 \times 0.95 + (-1090) \times 0.05 \\ &=1580-2670 \times 0.05 \\ &= 1446.5 \end{align*}$ - pattern recognition: 圖形辨識，藉由將圖形分類的方式來辨識圖形 - unit: 一個影像區域，被放進圖形辨識的單位 - measurement vector: 對應unit的一個measurement vector，之後放進去decision rule來辨識屬於哪一個category - curse of dimensionality: 當維度增加，會使得資料變得更稀疏，更難找到想找的目標，花更多時間並且更難視覺化 - (t, a, d): 真實情況是t的情況下，觀測到d(measurement vector)後assign給他a - e(t, a): 經濟增益值，那個情況下會對經濟的影響，正負決定正影響or負影響 - economic gain matrix: 2*2 答對答錯列連表 - identity gain matrix: economic gain matrix對角線（答對）設為1，其他為0 - P(g,g) -> p(assigned|true) - TP: good diagnosed good - TN: bad diagnosed bad - FP: bad diagnosed good, misdetection rate, Type I error, P(g|b) - FN: good diagnosed bad, False alarm rate, Type II error - identity gain matrix可以用來算分類對的期望值，因為只有分類對的有e(t,a)=1，乘上機率加總即可 - fair game assumption: 分類的結果，不會因為有decision rule得到真實類別的資訊，就會分的比較好， P(a |t,d)=P(a |d) <a和t獨立> - P(a,t|d)=P(a|d)P(t|d) - p(a|d): Conditional probability of decision rule - misidentification error (alpha): 真實是K但assigned不是K - 口訣- $\alpha_1$=P($C_1$)f($C_2$)/P($C_1$) - $\alpha_2$=P($C_2$)f($C_1$)/P($C_2$) - false-identification error (beta):assigned K但真實不是K - $\beta_1$=P($C_2$)f($C_1$)/1-P($C_2$) - $\beta_2$=P($C_1$)f($C_2$)/1-P($C_1$) ## Chapter 5: Mathematical Morphology - mathematical morphology: - 內容: 運用數學領域中拓樸、集合論等手法處理影像 - 應用: 膨脹、侵蝕、去雜訊、填補小洞 - linear shift-invariant operator: 利用鄰居的線性組合做運算，且運算結果不受平移影響 - dilation (膨脹): - 內容: - 膨脹操作會把高亮 (即值為255) 的邊界膨脹開。 - 設定卷積核尺寸，卷積核沿著圖像滑動，與卷積核對應的原圖像的像素值中只要有一個是 1，中心元素的像素值就是 1 - $A \oplus B=\{x \in E^n \mid x=a+b \text{ for some } a \in A \text{ and } b \in B \}$ - 目的: 如果以白色部分為前景，會使圖片中較亮的範圍變大 - 應用：可用於填補圖片中的小黑洞 - erosion (侵蝕): - 內容： - 腐蝕操作會把高亮 (即值為255) 的邊界腐蝕掉 - 設定卷積核尺寸，卷積核沿著圖像滑動，如果與卷積核對應的原圖像的所有像素值都是 1，那麼中心元素就保持原來的像素值，否則就變為 0。 - $A \ominus B = \{x \in E^n \mid x+b \in A, \forall b \in B \}$ - 目的: 如果以白色的部分為前景，會使圖片中較亮的部分縮小 - 應用: 能消除部分雜訊 - opening (開運算): - 內容: - 在數學形態學中，開運算被定義為先侵蝕後膨脹 - $A \circ B=(A \ominus B) \oplus B$ - 目的: 能夠消除雜訊 - 應用: 能夠消除雜訊 - closing (閉運算): - 內容: 在數學形態學中，閉運算被定義為先膨脹後腐蝕 - $A \bullet B=(A \oplus B) \ominus B$ - 目的: 能夠消除小型黑洞 - 應用: 能夠消除小型黑洞 - hit and miss: - 內容: $A \otimes (J, K) = (A \ominus J) \cap (A^c \ominus K)$ 且 $J \cap K = \varnothing$ - 目的: 找角落、孤立點 - 應用: 計算 genus - $A \subseteq B$: - dilation: $A \oplus K \subseteq B \oplus K$ - erosion: $A \ominus K \subseteq B \ominus K$ - opening: $A \circ K \subseteq B \circ K$ - closing: $A \bullet K \subseteq B \bullet K$ - top: $T[A](x)=\max \{y \mid (x, y) \in A \}$ - umbra (陰影): $U[f]=\{(x, y) \in F \times E \mid y \leq f(x) \}$ - distance transform: 走到背景最短距離需要幾步 - medial axis: 用最少的資料能還原出原始影像 - genus $g(I)$: number of connected components minus mumber of holes of I - $g_4$: 4 connected for object, 8 connected for background - $g_8$: 8 connected for object, 4 connected for background - 灰階的dilation and erosion - dilation: K相反，相加後取最大（外擴） - erosion: K相同，I-K(相減)後取最小（完全放進去才有值） ## Chapter 6: Neighborhood Operator - neighborhood operator: - 內容: define neighborhood and operation. - 目的: do calculation based on neighborhood and operation. Make the relationship between pixels local - 應用: dilation, erosion, or convolution等 - neighborhood of $x_0$ (counterclockwise): - 4-connected: $x_1, x_2, x_3, x_4$ - 8-connected: $x_1, ..., x_8$ | $x_7$ | $x_2$ | $x_6$ | | ----- | ----- | ----- | | $x_3$ | $x_0$ | $x_1$ | | $x_8$ | $x_4$ | $x_5$ | - recursive neighborhood operator: output 和前面有關的 neighborhood operator - nonrecursive neighborhood operator: output 和前面無關的 neighborhood operator，可平行計算 - ==Euclidean distance== (continuous circle): ![](https://i.imgur.com/BdTotVR.png) - 先四聯通，再八聯通(交替) => 近似 Euclidean distance - ==connectivity number==: - 0: isolated - 1: edge - 2: connecting - 3: branching - 4: crossing - 5: interior - conditioning (調節): - 內容: 調節是基於模型，將所觀察到的圖像由帶有資訊的形狀組成，這些帶有資訊的圖形被我們不會感興趣的變化所修改，這些變化通常是加法或乘法 - 目的: 去除無用的資訊、雜訊 - 應用: 降躁、背景均一化 - thinning operator: - 內容: 標記內部、邊界像素算子、對關係算子、連接收縮算子 - 目的: 對稱收縮，直到沒有內部像素留下 - 應用: 得到影像骨架 - symbolic domain - 內容: domains containing symbolic (包含符號的域) - 目的: classifying pixels, each pixel belongs to one category. - 應用: domain type of neighborhood operator (鄰域算子的域類型) - extremum sharpening: - 內容: Output closer of neighborhood maximum or minimum. 先找出neighborhood 最大的和最小的，看看中心點元素的值靠近最大還是最小，靠近最大的話中心點元素就變成最大，靠近最小中心點元素就變成最小 - 目的：可以讓整張圖片亮的更亮，暗的更暗 - 應用：影像銳利化 - correlation - 內容: - $(f*g)(x, y)=\sum_{i=-n}^n\sum_{j=-n}^nf(x, y)g(x+i, y+j)$ - 目的: 提取影像特徵 - 應用: CNN - convolution - 內容: - $(f*g)(x, y)=\sum_{i=-n}^n\sum_{j=-n}^nf(x, y)g(x-i, y-j)$ - 目的: 提取影像特徵，提取出的特徵和 corrrelation 相反，和 kernel 的 pattern 相同 - 應用: CNN - cross correlation: - 內容: 將 kernel 除上總和，計算 correlation - 目的: 防止灰階超過範圍 ## 助教研究 ### Super-Resolution - **Method:** Use ==USRNet== to generate high-resolution images - **Steps:** 1. Input LR image and parameters 2. USRNet 3. Generate reconstructed HR images - **Results:** Make the reconstructed HR image as similar as possible to the original HR image ### Image Quality Tuning Parameter Recommendation for Smartphone - **Methods:** ==Differential evolution (DE)== algorithm with machine learning (ML) - **Steps:** 1. DE generate recommended parameters 2. ML predict whether the recommended parameters are good or bad 3. If bad, regenerate / If good, push to camera - **Result:** Recommend ISP 晶片 parameter successfully ### Introduction of Adaptive EV Charging - **Methods:** ==Model Predictive Control (MPC)== with ==unbalanced three-phased infrastructure constraints== - **Steps:** 1. Request for EV charging demand or Rescheduling (請求電動汽車充電需求或重新排程) 2. MPC Optimization 3. Update the Charging Rate - **Results:** - Lightly constrained system -> Affine constraints - Highly constrained system -> SOC constraints ### Contrast Enhancement and Dehazing Based on Fast Single Image Processing Algorithmg - **Method:** 利用 ==MCPA== 去除霧埋 - 步驟: - minimum channel - $I = tJ+(1-t)A$ $I$: 原始影像、$J$: 乾淨影像、$t$: 傳輸率、$A$: 霧的亮度 - histogram equalization (HE) - image multiplication - 結果: 比其他人快 4、5 倍 ### Chinese Optical Character Recognition - **Method:** ==CTW-CRNN==、==Parseq== (SOTA method) - **Step:** - 利用 paper 提供的方法，電腦生成模擬 training set - 訓練 CTW-CRNN (CNN+RNN+Decoder) 和 Parseq - 實驗 Parseq 在中文 dataset 上是否能超越舊方法 - **Result:** CTW-CRNN 和 Parseq 結果差不多 ## 考古題名詞解釋 - [ ] conditioning - 內容: 調節是==基於一個模型==，該模型觀察到的圖像由帶有資訊的形狀組成，==這些帶有資訊的圖形被我們不會感興趣的變化所修改==，這些變化通常是加法或乘法 - 目的: ==去除無用的資訊、雜訊== - 應用: ==降躁、背景均一化== - [ ] labeling - 內容: - 標記是基於一個模型，該模型帶有資訊的形狀具有作為事件的空間排列的結構，每個空間事件都是一組連接的像素 - 給定相同特徵的 pixels 相同的編號 - 目的: 讓電腦將相同特性的像素給予相同編號 - 應用: 閥值化、邊緣檢測、自駕車、機器人、相片搜尋、影片檢索、商品搜尋、辨識影像中的各種物體、thresholding、edge detection、corner finding - [ ] grouping - 內容: 分組操作通過==收集或識別==參與==同一類事件的最大連接像素集==來識別事件 - 目的: 有利於將群組化後的資訊進行一些計算 - 應用: segmentation 分割、edge linking 邊緣鏈接 - [ ] range image - 內容: 視線距離 (range image 又稱為2.5度空間影像) - 目的: 在一張 2D 平面圖的每個 pixel 存放深度值，方便進行重建 - 應用: 重建 3D model - [ ] pixel - 內容: - picture element - 影像的基本單位，包含位置和灰階值 - 目的: has properties of position and value - 應用: 將圖像數字化 - [ ] feature - 內容: 圖像中的特殊結構 - 目的: 特徵是與解決與特定應用程序相關的計算任務相關的任何信息 - 應用: feature recognition - [ ] orientation - 內容: 影像中物體的方向 - 目的: orientation 加上 position可以得知影像中的物體形狀、位置等 - 應用：orientation 加上 position可以得知影像中的物體形狀、位置等 - [ ] dilation - [ ] erosion - [ ] closing - [ ] neighborhood operator - [ ] thinning operator - 內容: ==標記內部/ 邊界像素算子==、對關係算子、連接收縮算子 - 目的: ==對稱收縮==，直到沒有內部像素留下 - 應用: ==獲取圖像骨架== - [ ] symbolic domain - 內容: 包含符號的域 - 目的: ==classifying pixels==, each pixel belongs to one category - 應用: domain type of neighborhood operator (鄰域算子的域類型) - [ ] extremum sharping - 內容: output closer of neighborhood maximum or minimum. 先找出 neighborhood 最大的和最小的，看看中心點元素的值靠近最大還是最小，靠近最大的話中心點元素就變成最大，靠近最小中心點元素就變成最小 - 目的: 可以讓整張圖片亮的更亮，暗的更暗 - 應用: 影像銳利化 - [ ] roberts edge detector - 內容: 用 -1 0 mask 算出 r1，0 -1 mask 算出 r2 (中心點放左上角) 用 0 1 mask 算出 r1，1 0 mask 算出 r2 (中心點放左上角) 然後比較 $\sqrt{r1^2 + r2^2}$ 是否大於門檻值，是的話該點為邊，設0 - 目的: 找出一張圖邊的位置 - 應用: 找出一張圖邊的位置 - [ ] background normalization - 內容: 為一種 ==conditioning== - 目的: 透過正規化背景去除無用資訊 - 應用: 透過正規化背景去除無用資訊 - [ ] peak noise - 內容: 峰值雜訊，透過設立的各式各樣閥值，判斷該點是否和周圍的臨域像素的平均值相差超過門檻，也可透過差了幾個標準差判斷，是的話就是峰值雜訊。 - 目的: 找出並移除峰值雜訊，影像相對會模糊 - 應用: outlier removal 和 contrast-dependent outlier removal - [ ] outlier removal - 內容: 如果中心像素值和臨域像素的平均值相差超過門檻，視該中心值為峰值雜訊，就把中心像素值換成臨域像素的平均值，反之，中心像素值維持不變 - 目的: 把峰值雜訊移除 - 應用: 把峰值雜訊移除 - [ ] shape from texture: 用物品材質、紋理來反映出三維空間的形狀，可用於電腦繪圖展現3d感覺 - [ ] shape from shading: 用陰影濃淡、明暗來反映出三維空間的形狀，可用於電腦繪圖展現3d感覺 - [ ] alignment: 協助判斷物件是否對齊，為電腦視覺的主要用途之一，例如製作晶片時用於判斷位置 - [ ] measurement vector - [ ] pattern recognition: 從圖片中尋找特定的模式，為電腦視覺的主要用途之一，例如協助林務局去分析地景資料，尋找拍攝到的照片有多少面積的水稻等 - [ ] virtual reality - [ ] augmented reality - [ ] stereo vision - [ ] segmentation - [ ] intensity histogram: 對region中像素的灰階值去畫出intensity histogram，可用直方圖感受圖形的亮度 - [ ] Gray-Level Co-occurrence Matrix (GLCM) - [ ] region - [ ] classifier - [ ] bounding rectangle - [ ] area - [ ] centroid - [ ] Statistical Pattern Recognition - [ ] maximin decision rule - [ ] Bayesian decision rule - [ ] recursive neighborhood operator - [ ] linear shift invariant operator - [ ] correlation - [ ] shape - [ ] preserve order: 保持大小順序 - [ ] hexagonal grid: 將各列像素平移 0.5 個 pixel，使得像素至周圍每個像素的距離相同 - [ ] corner: the intersection of two edges - [ ] edge: boundaries between objects and background - [ ] linear shift-invariant operator - [ ] mathematical morphology - [ ] convolution - [ ] cross correlation - [ ] weight mask - [ ] noise cleaning