--- tags: 298 NLP --- # Introduction to Statistical NLP ## Why? Here are some links of what statistical NLP can do today. - [OpenAI Codex Live Demo](https://www.youtube.com/watch?v=SGUCcjHTmGY#t=13m8s) ## What is a Language Model? https://en.wikipedia.org/wiki/Language_model ## References The book [Supervised Machine Learning for Text Analysis in R](https://smltar.com/) has a lot of good material about topics we will not discuss, such as the importance of tokenizing, stop words, stemming, bias in data. You can also find a good account of word embeddings. If you want to see the math background of the language model of $n$-grams, have a look at Chapter 3.1 of [Speech and Language Processing](https://web.stanford.edu/~jurafsky/slp3/) by Jurafsky and Martin. This [podcast with Ilya Sutskever](https://www.youtube.com/watch?v=13CZPWmke6A) for the deep learning side of things.