ML project note

Instruction

no ML package
Image Not Showing Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
code + report
- instructions on how to run code
- submit system's output (same format as training set)

main goal:

sequence labelling model for informal texts using Hidden Markov Model (HMM)
build two sentiment analysis systems for one different language from scratch, using our notations (?)
also using annotations from others (?)

En.zip contains:

train: labelled training set

Municipal B-NP
bonds I-NP
are B-VP
generally B-ADVP
a B-ADJP
bit I-ADJP

dev.out: dev.in but with label

HBO B-NP
has B-VP
close B-NP
to I-NP
24 I-NP
million I-NP
subscribers I-NP

labels:

O: outside of any entity
B-{sentiment}, I-{sentiment}: Beginning and Inside sentimental entites
–> sentiment can be "positive", "negative" and "neutral"
–> what is "B-NP" then?

stochastic process is a collection of random variable indexed by mathematical sets
e.g.
states S = {hot, cold}
States series over a time –>

z \in S^{T}

weather for 4 days can be a seq –> {z1=hot, z2=cold…}

limited horizon assumption
Probability of state being on time T only depends on state on time T-1

$P (z_{t} | z_{t - 1}, z_{t - 2}, . . .) = P (z_{t} | z_{t - 1})$
Stationary Process assumption
conditional prob does not change over time, i.e.

$P (z_{t} | z_{t - 1}) = P (z_{2} | z_{1}), t \in 2, . . ., T$

Theory

MLE is a method to estimate the parameters of a distribution based
first define problem, we have:

distribution
$D_{θ}$
samples S = (
$x_{1}$ ,…
$x_{n}$ )
parameter space: range of possible values for
$D_{θ}$
- Bernouli: (0, 1)
- Gaussian:
  $R * R$

We do not know actual

θ

, so we want to estimate it using S
the likelihood defined as

Π_{i = 1}^{N} P [X = x_{i}]

For Bernouli, it is defined as:

Π_{i = 1}^{N} θ^{x_{i}} (1 - θ)^{1 - x_{i}}

For bernouli, calculate log likelihood:

derivation

l (θ^{'}; x) = \sum l o g (θ^{x_{i}} * (1 - θ)^{1 - x_{i}})

1/T S(x) - 1/(1-T) S(1-x) = 0 = (1-T) S(x) - T S(1-x)

For HMM, we can use Expectation Maximization (EM), which use iterative process to perform MLE in statistical models with latent variables