owned this note changed 4 years ago
Published Linked with GitHub

Basics of text processing with spaCy

Friday, May 21st, 2021 3:00-5:00

Back to Resbaz HackMD Directory

In this workshop, we will dive into the basics of spaCy for text processing. spaCy is a free Natural Language Processing Python library developed specifically for production environments.
The first part will be dedicated to introducing spaCy and navigating through its documentation. Then, we will work in small groups and code. Using Jupyter notebooks, we will create a pipeline for text tokenization, part-of-speech tagging, parsing, and extractingnamed entities. We will dedicate the last 15 minutes to discussing the advantages of using spaCy and its ecosystem (Prodigy, Thinc, and other tools).

Getting Started

You will need: basic Git and GitHub knowledge to clone the repository to your machine, at least Intro to Python 1 and 2 from this ResBaz edition, and having used Jupyter notebooks at least once before.

If you are interested in following along with the code, this link will provide information including an introduction to Jupyter notebooks: https://tinyurl.com/rb21-spacy


Introductions

Name, Affiliation, Title, Email, Social Media

Questions and Answers

In this section, you can post your questions and feel free to answer if you have it. Questions will be answered during or after the workshop.

  1. Ask your question.
    • Here is an answer

Freebies & swag

Check back at the end of the workshop for free stuff for workshop attendees, courtesy of Exlposion.


Session Feedback

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

Use the link below to provide your feedback on the session:
Session Feedback Form

Select a repo