--- tags: ResBaz2021 --- # Basics of text processing with spaCy ## Friday, May 21st, 2021 3\:00-5\:00 [Back to Resbaz HackMD Directory](https://hackmd.io/@ResBaz21/directory) In this workshop, we will dive into the basics of spaCy for text processing. spaCy is a free Natural Language Processing Python library developed specifically for production environments. The first part will be dedicated to introducing spaCy and navigating through its documentation. Then, we will work in small groups and code. Using Jupyter notebooks, we will create a pipeline for text tokenization, part-of-speech tagging, parsing, and extractingnamed entities. We will dedicate the last 15 minutes to discussing the advantages of using spaCy and its ecosystem (Prodigy, Thinc, and other tools). ## Getting Started You will need: basic Git and GitHub knowledge to clone the repository to your machine, at least Intro to Python 1 and 2 from this ResBaz edition, and having used Jupyter notebooks at least once before. If you are interested in following along with the code, this link will provide information including an introduction to Jupyter notebooks: https://tinyurl.com/rb21-spacy --- ## Introductions Name, Affiliation, Title, Email, Social Media - Damian Romero (workshop instructor), University of Arizona, Ph.D. Candidate, Spanish and Portuguese, damianijil@email.arizona.edu, [Twitter @DamianRomero_CL](https://twitter.com/DamianRomero_CL), [LinkedIn @damian-romero](https://www.linkedin.com/in/damian-romero/) - Your Name, University of Arizona, Your title, youremail@email.arizona.edu, your social media ## Questions and Answers In this section, you can post your questions and feel free to answer if you have it. Questions will be answered during or after the workshop. 1. Ask your question. - Here is an answer --- ## Freebies & swag Check back at the end of the workshop for free stuff for workshop attendees, courtesy of [Exlposion](https://explosion.ai/). --- :::info **Session Feedback :mega:** Use the link below to provide your feedback on the session: [**Session Feedback Form**](https://forms.gle/TrnJpr9qRBEKdnVVA) :::