luoluochou - HackMD

luoluochou

@pipibear

一開始是為了找個地方紀錄一些很混亂的 XD 研究所準備筆記，現在成功上岸變成分享一般的讀書筆記啦～因為是自己理解過後所做的筆記，因此可能有錯誤的地方，如果有覺得哪裡怪怪的，歡迎討論、指正！希望可以透過寫筆記的方式整理自己的思緒，還有練習把一件事從最基本的地方開始講清楚的能力。

Joined on Mar 25, 2023

這是目前主要在持續更新的筆記，底下的各章節還在緩慢增加中，內容除了 ML 也包含了大量的機率與統計（篇幅快相當了⋯⋯）也是這個地方唯一有系統性地在整理的筆記，其餘準備研究所時的雜亂小筆記有機會再整理！

Published notes105
Likes6

Select all Clear all

AND

OR

Untagged(4)

Coherence Problem(1)

Convex Optimization(2)

DMA(1)

FAT(1)

Interrupt Driven I/O(1)

ML(75)

NP-Completeness(2)

POSIX shared memory(1)

Polling(1)

Probability(19)

allocation method(1)

buffer(1)

contiguous allocation(1)

direct communication(1)

directory(1)

hard link(1)

indexed allocation(1)

indirect communication(1)

inode(2)

link(1)

linked allocation(1)

linux threads(1)

mmap(1)

non symbolic link(1)

pipe(1)

producer consumer(1)

programmed I/O(1)

pthreads(1)

shared memory(1)

soft link(1)

thread pool(1)

symbolic link(1)

數學(4)

研究所(22)

考古訂正(1)

計系(9)

資演(9)

New to old

Notes sorted by

New to old

Old to new

A to Z

Z to A

README
整理這一系列文章，主要是為了加強我自己對 ML 的理解。雖然有點花時間，但是覺得用自己的話重新把東西講一遍，常常能夠發現一些原本自以為懂，但其實概念模糊的地方。建立在這樣的出發點，內容大部分是我每讀完一段後大致的課本原話翻譯（有時候可能省略部分例子或是一些額外的描述），有些是我了解大概的意思，加上回想過去所學後整理出的理解，除此之外，也有網路上到處看看後做的整理。關於網路資源，如果只有一兩句話就不會特別註明，但如果有參考較多的地方會在最下方「參考資料」處留下連結。總之是綜合各方面的東西做出來的筆記，因此可能有誤，歡迎指出～另外，在撰寫這一系列筆記的一學期過後，我修了台大林軒田老師的機器學習，發現所有這些筆記裡最有幫助的部分應該只有機率，其他老實說用不太上（或過於簡單）XDD
luoluochou changed 6 months agoView mode Like 4 Bookmark
Fibonacci Heap
Fibonacci Heap Pasted Graphic 2 Definition 由很多個 rooted tree 組合而成，而每棵 tree 都具 Min Heap property <font color = "snake">root list</font>: 用Circular doubly linked list 串連所有的 root <font color = "snake">Child list</font>: 用Circular doubly linked list 串連所有的 child 用Circular doubly linked list的好處：
luoluochou changed 6 months agoView mode Like 1 Bookmark
Convex Optimization 筆記說明
這篇筆記主要用來記錄 $113$ 學年度上學期，黃文良老師的「訊號處理和機器學習之數學基礎」課程筆記和課本閱讀的整理。因為開學後感覺時間或許沒有那麼多，不知道能更新多少，紀錄筆記的方式也不會像寫 ML 筆記時那麼詳細，但筆記的作法仍然和當初類似，每篇筆記會對應課本同一小節的內容，課本主要用的是： :::info Boyd and Vandenberghe 的 Convex Optimization ::: 點課本名稱的連結，能下載課本和 lecture slides。
luoluochou changed 9 months agoView mode Like 2 Bookmark
Convex Optimization 筆記
Convex Optimization 筆記說明 Ch1 Introduction 1.1 Mathematical Optimization 1.2 Least-squares and linear programming
luoluochou changed 10 months agoBook mode Like Bookmark
5.2 Parameter Estimation
背景前面一節我們有講到，我們可以將 sample 中的 $N$ 個 data points，每個具 $d$ 個 features，用 matrix $X$ 表示： \begin{equation} X = \begin{bmatrix} X_1^1 & X_2^1 & ... & X_d^1 \ X_1^2 & X_2^2 & ... & X_d^2 \ . & \ . & \
luoluochou changed 10 months agoView mode Like Bookmark
ML 筆記
README Ch1: Introduction 1.1 What is Machine Learning? 1.2 Examples of Machine Learning Applications 1.3 Notes Ch2: Supervised Learning 2.1 Learning a Class from Examples 2.2 Vapnik-Chervonenkis Dimension
luoluochou changed 10 months agoBook mode Like Bookmark
6.3 Principal Component Analysis
我們在 $6.1$ 的 Introduction 中有提到，feature extraction 有兩種常用的 method： Principal Component Analysis (PCA) linear discriminant analysis 兩者都是利用 linear projection。在像這樣的 projection methods 中，我們想要做的是： :::info 找到一個 mapping，在喪失的資訊最小化的情況下，將原本在 $d$-dimensional space 中的 original inputs，map 到另一個小於 $d$ 的 $k$-dimensional space。 :::
luoluochou changed 10 months agoView mode Like 1 Bookmark
補充：multivariate outliers
本篇文章的內容為筆記「補充：Exploratory Data Analysis」的延續，在那篇筆記裡，最後當我們在講 outliers 時，因為篇幅太長，所以我們只講了課本的 univarite 的情形。當時我們說，利用 box-plot 或是其他的 graphical technique，我們可以用看的就能找出 outliers，但是這樣的方法只適用於 univariate data。當我們的 data 是 multivariate 時，我們是沒有辦法用看的就找出哪些是 outliers，挑出 outliers 的工作也就變得複雜許多。實務上，當我們的 variable 數 $d>2$ 時，就很難去辨認 outliers。在這篇筆記裡，我只會簡單講一下 multivariate case 的情況下我們大致上可以怎麼做，舉簡單的例子，停在大概有點概念的程度。如果有興趣深入看看，參考資料的兩篇文章一篇介紹了如何用 python 來計算 MD，另一篇為 multivariate outliers detection 的論文，這篇論文前半部分還是很容易懂的，後半部我就沒看了，總之有興趣可參考！
luoluochou changed a year agoView mode Like Bookmark
5.4 Multivariate Normal Distribution
課本中的向量皆用粗體，以往我都是省略直接用 $x$ 表示，看得懂就好，但現在符號比較多，為避免混淆，還是決定將向量都標上向量符號。本節內容為以前我們學過的 univariate normal distribution 推廣到 multivariate 的情形。首先，recall 我們在 univariate case 裡如何去定義一個 random variable $X$ 具 normal distribution，我們當時說，只要 $X$ 的 pdf 滿足： \begin{equation} p(x) = \frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{(x-\mu)^2}{2\sigma^2}} \end{equation}
luoluochou changed a year agoView mode Like 2 Bookmark
6.2 Subset Selection
前一節的 Introduction 裡面我們有提到，feature selection 的一種做法叫做 subset selection。在 subset selection 中，我們的目標是從原本所有的 features 裡面，挑出一個最好的 subset。什麼叫「最好的 subset」呢？ :::info The best subset ++contains the least number of dimensions++ that ++most contribute to accuracy++. ::: 至於剩下的 features，我們就直接捨棄掉。
luoluochou changed a year agoView mode Like 1 Bookmark
6.1 Introduction
不管是在 regression 或 classification 裡： :::warning 任何 regressor / classifier 的 complexity，都 depends on input (feature / attribute) 的數量。 ::: 在前面第五章裡，我們在討論 multivariate data 時，都是令每個 data point 為： \begin{equation} \vec{x} = [,x_1,...,x_d],
luoluochou changed a year agoView mode Like Bookmark
File allocation method
判斷 Performance 的前提：要決定一個系統要用哪種 allocation method，需要先判斷這個系統比較多是需要 sequential access 還是 random access Contiguous allocation <font color = "red">需要事先宣告大小</font> ==support sequential access, random access== 不管是 sequential access 還是 random access，contiguous allocation 只需要一次 access 就能取得一個 block。因為我們可以直接把一個檔案的起始位置放在 memory 裡面，所以我們很容易可以計算任何第 i 個 Block 的位址，然後就能直接去讀取那個 block。 Contiguous allocation 適合 small files
luoluochou changed a year agoView mode Like Bookmark
5.8 Multivariate Regression
在 <font color = "snake">multivariate linear regression</font> 中，我們會假設 output ==$r$== 可以被寫成一個 ++linear function++，也就是我們的 ++input variables $x_1,...,x_d$ 的 weighted sum，再加上 noise++。其實在統計學的術語裡面，這應該要叫作 <font color = "snake">multiple regression</font>，<font color = "snake">multivariate</font> 是用在當我們有 multiple outputs 的情形。總之，如果我們把上面文字的描述透過數學表示，即為： image 接下來，就像我們在筆記「4.6 Regression」中講過的，為了要 fit 這些 polynomial functions $g(\vec{x}^t|w_0,...,w_d)$ to data sets，也就是說：
luoluochou changed a year agoView mode Like Bookmark
5.7 Discrete Features
binary features 在一些應用裡，我們的 attributes（即 features）為 discrete，為 $n$ 種不同的值中的其中一種。舉例來說，某種 attribute 可能會是： $\text{color} \in {\text{red, blue, green, black}}$ $\text{pixel} \in {\text{on, off}}$ 我們來假設一個 attribute $x_j$ 是 binary（Bernoulli）的情形：
luoluochou changed a year agoView mode Like Bookmark
5.6 Tuning Complexity
本節的內容不多，基本上都是前一節：筆記「5.5 Multivariate Classification」的整理與延伸，因此如果有沒特別說明的地方，基本上回頭看這篇筆記，就可以在裡面找到答案 :) discriminant function 在 simplicity 和 generality 之間的 tradeoff 在前一小節最後整理的表格裡，也就是這張表： image
luoluochou changed a year agoView mode Like Bookmark
5.5 Multivariate Classification
我們以前學過：在 classification 的問題裡，我們希望透過 sample 的 data 來對每個 class 找出一個適合的 function（叫做 discriminant function），來使得當我們有新的 input 時，我們可以代入這些 functions 裡，並藉由找出最大的 output 來決定要把這個新的 input 分類到哪個 class。最初我們的討論都建立在 input 只是某個值（一維）的情況下，而在這個小節裡，我們要做的是延續前幾節對 multivariate 的介紹，來看看一個 $d$ 維的 random vector 要怎麼去做 classification。也就是說，當 feature 很多時，要怎麼去分類我們的 input？ normal class conditional densitiy 定義
 luoluochou changed a year agoView mode Like Bookmark
補充：Conditional Distributions
寫這篇的時候生病中，腦袋不清楚，忘記很多內容已經在其他筆記寫過了，不過反正也很簡短，就當作是重點整理好了。不過 conditional distribution 的內容應該是沒有寫過，主要也是誤了講這個才回顧了一堆重複的內容，其他已經會了的話，可以跳著看這部分。兩篇和這篇內容有重疊的筆記，可以交互參考： A.2.2 Joint Distribution and Density Functions 補充： Joint distribution functions 背景知識：joint / marginal distribution
luoluochou changed a year agoView mode Like Bookmark
補充：Basic Multivariate Normal Theory
本節內容完全對應參考資料中的網頁，基本上都是定義和證明，用來補充其它筆記中的定義、定理的證明。證明我寫的基本上和網頁中的差不多，只是比較詳細，少數那裡省略沒證的我有自己證在這裡。基本上都是數學，所以就保留使用英文了。 1 Random Vector random vector mean, variance 定義 <font color = "snake">Random vector</font> 也就是我們第五章中一直在用的 <font color = "snake">multivariate random variable</font>，它的 mean 和 variance 定義如下：
luoluochou changed a year agoView mode Like Bookmark
補充：Exploratory Data Analysis
stem-and-leaf display 當我們要去探索一個 unknown distribution 的 characteristics 時，我們會去取一個含 $n$ 個 observations 的 sample： \begin{equation} x_1, \ x_2, \ ..., \ x_n \end{equation} 得到這些 observations，我們常常會需要把它們由小排到大。其中一種簡便的方式叫做 <font color = "snake">stem-and-leaf display</font>。直接來看個例子，假設我們有五十筆考試成績的資料如下：
luoluochou changed a year agoView mode Like Bookmark
A.2.6 Variance
variance variance 定義 ==$Var(X)$==（==$\sigma^2$==），稱作 <font color = "snake">variance</font>，也就是國高中學過的變異量，是用來測量 ++$X$ 和 expected value $\mu$ 之間的變化有多大++，定義如下： :::info \begin{equation} Var(X) = E[(X - E[X])^2] \end{equation} :::
luoluochou changed a year agoView mode Like Bookmark

1
2
3
...
6

Create a note from template

Create a note from template

Oops...

This template has been removed or transferred.

Upgrade

All
Team

No template.

Create a template

Upgrade

Template name

Save as option

Save this note as a template Save to another note as a template

Delete template

Do you really want to delete this template?

Keep the content

Turn this template into a regular note and keep its content, versions, and comments.

Link with GitHub

Please authorize HackMD on GitHub

Please sign in to GitHub and install the HackMD app on your GitHub repo.
HackMD links with GitHub through a GitHub App. You can choose which repo to install our App.

Learn more Sign in to GitHub

Create team

Team Name

Team Description

Team URL Path

https://hackmd.io/team/

Allow a-Z, 0-9 and dash not at beginning.

Are you sure?

Do you really want to remove this member?

Leave team

Are you sure to leave this team?

Delete team

Once you delete your team, all team notes will be deleted and cannot be recovered. Please ensure you've exported or transfered these notes.

Enter team name before deleting it:

Change your password

Use a minimum password length of 8-128 characters, including lowercase and uppercase characters and numbers.

Setup your password

Use a minimum password length of 8-128 characters, including lowercase and uppercase characters and numbers.

Sign in

Email

Password

Forgot password

or

By clicking below, you agree to our terms of service.

Sign in via Facebook Sign in via Twitter Sign in via GitHub Sign in via Dropbox

Sign in with Wallet

Connect another wallet

New to HackMD? Sign up