# HTML 2021 HW3 Grading Policy
## P5
- Option 1: Using Hoeffding inequality.
- -1 points if there is a minor mistake, e.g. notation error
- -3 points if there is a major mistake or unclear derivation, e.g. error in using Hoeffding
- -5 points if the derivation of option 1 is unrelated or incorrect.
- Option 2: You should either (a) mention the likelihood is proportional to $\prod \mu^{y_i} (1-\mu)^{1-y_i}$ and take partial derivative to maximize it , or (b) mention Binomial distribution, or \(c\) mention the equivalence to option 4.
- -1 points if there is a minor mistake, e.g. notation error, only mention Bernoulli.
- -3 points if there is a major mistake or the derivation is unclear, e.g. likelihood function is wrong , "by statistics", "$\nu$ is mean so maximize", “by definition” ...
- -5 points if the derivation of option 2 is unrelated or incorrect.
- Option 3: You should take partial derivative to show it minimize the squared error.
- -1 points if there is a minor mistake.
- -3 points if there is a major mistake or unclear derivation, e.g. use simulation, ...
- -5 points if the derivation of option 3 is unrelated or incorrect.
- Option 4: You should either (a) take partial derivative to show it minimize the error, or (b) mention the equivalence to option 2
- -1 points if there is a minor mistake.
- -3 points if there is a major mistake or unclear derivation, e.g. "mean will minimize", ...
- -5 points if the derivation of option 4 is unrelated or incorrect.
## P7
- 20 Points
- Correct answer, your steps need clear and are necessary.
- 10 Points
- Almost correct, for example, you didn't show $V = - \nabla_{err}$ in detail or you didn't show how you caculate it.
- 5 Points
- Correct answer but you use excultion to choose the answer.
- 0 Points
- Wrong answer
## P9
- 20 Points
- The steps are totally correct regardless of your final answer choice. An example solution would be like writing down both $\textbf{w}_{\text{LIN}}$ and $\tilde{\textbf{w}}$, and derive the answer by comparing these two weights.
- 15 Points
- The steps are almost correct, but with some minor mistakes. (1) Making mistakes on matrix multiplication, such as $(AB)^{-1}=A^{-1}B^{-1}$ or $(AB)^{T}=A^{T}B^{T}$.
- 10 Points
- The solution includes one major mistake, but some part of it is reasonable. (1) Assuming $\textbf{X}$ is invertible. (2) Transformed data is not in the form of $\textbf{X}\Gamma^T$. (3) Assuming that there exists a $\textbf{w}$ such that $\textbf{X}\textbf{w}=\textbf{y}$. (4) Serious mistakes on matrix multiplication.
- 10 Points
- The answer is simulated by a program.
- 0 Point
- The solution is incorrect or includes more than 1 major mistakes. (1) Assuming $\textbf{X}$ is invertible. (2) Transformed data is not in the form of $\textbf{X}\Gamma^T$. (3) Assuming that there exists a $\textbf{w}$ such that $\textbf{X}\textbf{w}=\textbf{y}$. (4) Serious mistakes on matrix multiplication.
## P11
**Correct answer: \(c\)**
Basically, there're 3 cases:
- 20 Points
- Choosing the correct answer
- 10 Points
- Choosing (a), (b)
- The bounds are not tight enough.
- 5 Points
- Choosing (d), (e)
- We can find a counter-example to show that (d), (e) cannot upper-bound the given term.
Exceptions
- 15 Points
- Choosing the correct answer, with a minor issue.
- 10 Points
- Choosing the correct answer, with only high-level intuition / unreasonable explanation.
- 0 Points
- Totally unrelated / unreasonable explanations
- Without explanations