References

Creating and compiling textual data

“Data Prep and Cleaning | Digital Humanities.” Accessed October 27, 2020. https://digitalhumanities.berkeley.edu/data-prep-and-cleaning.

McGinnis, Scott Paul. “All Mimsy Were the Borogoves: A Brief Introduction to the Unicode Standard | Townsend Center for the Humanities.” Accessed October 27, 2020. https://townsendcenter.berkeley.edu/blog/all-mimsy-were-borogoves-brief-introduction-unicode-standard.

Padilla, Thomas. “Getting Started with OpenRefine.” Accessed October 27, 2020. http://thomaspadilla.org/dataprep/.

“TextCleanr - Text Cleaner Tool.” Accessed October 27, 2020. http://www.textcleanr.com/#.

“Bookworm HathiTrust.” Accessed October 27, 2020. https://bookworm.htrc.illinois.edu/develop/#.

“Google Books Ngram Viewer.” Accessed October 27, 2020a.
https://books.google.com/ngrams/.

“NYPL Digital Collections API.” Accessed October 27, 2020a. http://api.repo.nypl.org/.

Corpora and data sources

Andrew Piper. “TxtLAB450. A Multilingual Data Set of Novels for Teaching and Research – .TxtLAB @ McGill.” Accessed October 27, 2020. https://txtlab.org/2016/01/txtlab450-a-data-set-of-multilingual-novels-for-teaching-and-research/.

Barbaresi, Adrien. 2019. “German Political Speeches Corpus.” Zenodo. https://doi.org/10.5281/ZENODO.3611246.

“Dh resources for project building [Licensed for Non-Commercial Use Only] / Data Collections and Datasets.” Accessed October 29, 2020. http://dhresourcesforprojectbuilding.pbworks.com/w/page/69244469/Data Collections and Datasets.

“DOAB: Directory of Open Access Books.” Accessed October 28, 2020. https://www.doabooks.org/.

“Eighteenth Century Collections Online.” Accessed October 28, 2020.
https://www.gale.com/primary-sources/eighteenth-century-collections-online

“The Early Modern OCR project. Accessed October 28, 2020. https://emop.tamu.edu/.

“English Corpora: Most Widely Used Online Corpora. Billions of Words of Data: Free Online Access.” n.d. Accessed October 26, 2020. https://www.english-corpora.org/.

“HTRC Analytics.” n.d. Accessed October 26, 2020. https://analytics.hathitrust.org/.

“Perseus Collections/Texts.” n.d. Accessed October 28, 2020. http://www.perseus.tufts.edu/hopper/collections.

“Project Gutenberg.” n.d. Project Gutenberg. Accessed October 27, 2020. https://www.gutenberg.org/.

Svaikovsky, Victoria, Anne Meisner, Eve Kraicer, and Matthew Sims. 2018. “Racial Lines.” Harvard Dataverse. https://doi.org/10.7910/DVN/KERZQY.

“The Situationist International Text Library.” n.d. Accessed October 27, 2020. http://library.nothingness.org/articles/SI/.

University Of Alberta Libraries. 2016. “Canadiana (CIHM) Collection - Metadata Records.” https://doi.org/10.7939/DVN/10710.

Natural Language Processing

“Analyzing Movie Subtitles.” Accessed October 30, 2020. https://mubaris.com/posts/movie-analysis/.

Leemans, Inger, Janneke M. van der Zwaan, Isa Maks, Erika Kuijpers, and Kristine Steenbergh. 2018. “Mining Embodied Emotions: A Comparative Analysis of Sentiment and Emotion in Dutch Texts, 1600-1800.” Digital Humanities Quarterly 011 (4).

Mager, Manuel, Ximena Gutierrez-Vasques, Gerardo Sierra, and Ivan Meza-Ruiz. 2018. “Challenges of Language Technologies for the Indigenous Languages of the Americas.” In Proceedings of the 27th International Conference on Computational Linguistics, 55–69. Santa Fe, New Mexico, USA: Association for Computational Linguistics. https://www.aclweb.org/anthology/C18-1006.

Saldaña, Zoë Wilkinson. 2018. “Sentiment Analysis for Exploratory Data Analysis.” Programming Historian, January. https://programminghistorian.org/en/lessons/sentiment-analysis.

Wordseer/Wordseer. (2014) 2020. JavaScript. Wordseer. https://github.com/Wordseer/wordseer.

Topic Modeling

Meeks, Elijah, and Scott Weingart. 2012. “» The Digital Humanities Contribution to Topic Modeling Journal of Digital Humanities.” Accessed October 28, 2020. http://journalofdigitalhumanities.org/2-1/dh-contribution-to-topic-modeling/.

Arnold, Taylor, and Lauren Tilton. 2015. Humanities Data in R: Exploring Networks, Geospatial Data, Images, and Text. Quantitative Methods in the Humanities and Social Sciences. Springer International Publishing. https://doi.org/10.1007/978-3-319-20702-5.

Blei, David M. n.d. “» Topic Modeling and Digital Humanities Journal of Digital Humanities.” Accessed October 29, 2020. http://journalofdigitalhumanities.org/2-1/topic-modeling-and-digital-humanities-by-david-m-blei/.

Blevins, Cameron. 2010. “Topic Modeling Martha Ballard’s Diary.” Cameron Blevins (blog). April 1, 2010. http://www.cameronblevins.org/posts/topic-modeling-martha-ballards-diary/.

Jähnichen, Patrick, Patrick Oesterling, Gerhard Heyer, Tom Liebmann, Gerik Scheuermann, and Christoph Kuras. 2017. “Exploratory Search Through Visual Analysis of Topic Models.” Digital Humanities Quarterly 011 (2).
“Mining the Dispatch.” n.d. Accessed October 29, 2020. https://dsl.richmond.edu/dispatch/pages/intro.

Navarro-Colorado, Borja. 2018. “On Poetic Topic Modeling: Extracting Themes and Motifs From a Corpus of Spanish Poetry.” Frontiers in Digital Humanities 5. https://doi.org/10.3389/fdigh.2018.00015.

Posner, Miriam. n.d. “Very Basic Strategies for Interpreting Results from the Topic Modeling Tool – Miriam Posner’s Blog.” Accessed October 29, 2020. http://miriamposner.com/blog/very-basic-strategies-for-interpreting-results-from-the-topic-modeling-tool/.

Schöch, Christof. 2017. “Topic Modeling Genre: An Exploration of French Classical and Enlightenment Drama.” Digital Humanities Quarterly 011 (2).

Underwood, Ted. 2015. “Seven Ways Humanists Are Using Computers to Understand Text.” The Stone and the Shell (blog). June 4, 2015. https://tedunderwood.com/2015/06/04/seven-ways-humanists-are-using-computers-to-understand-text/.

Tools

Champagne, Ashley. n.d. “LibGuides: Digital Scholarship Resources for Courses: Web Scraping.” Accessed October 27, 2020. https://libguides.brown.edu/c.php?g=1049232&p=7615082.

“Download Xpdf and XpdfReader.” n.d. Accessed October 26, 2020. https://www.xpdfreader.com/download.html.

“Homebrew.” n.d. Homebrew. Accessed October 26, 2020. https://brew.sh/.

“Laurence Anthony’s AntConc.” n.d. Accessed October 26, 2020. https://www.laurenceanthony.net/software/antconc/.

“OpenRefine.” n.d. Accessed October 26, 2020. https://openrefine.org/.

“Overview — Visualize Your Documents.” n.d. Accessed October 26, 2020. https://www.overviewdocs.com/.

“Pandoc - Demos.” n.d. Accessed October 26, 2020. https://pandoc.org/demos.html.

“PDF Split And Merge.” n.d. Pdfsam.Org. Accessed October 26, 2020. https://pdfsam.org/.

“SameDiff.” n.d. Accessed October 26, 2020. https://databasic.io/en/samediff/.

“TAPoR.” n.d. Accessed October 27, 2020. http://tapor.ca/home.

“Text Mining Support.” n.d. About JSTOR (blog). Accessed October 26, 2020. https://about.jstor.org/whats-in-jstor/text-mining-support/.

“Try Pandoc!” n.d. Accessed October 26, 2020. https://pandoc.org/try/

Wordseer/Wordseer. (2014) 2020. JavaScript. Wordseer. https://github.com/Wordseer/wordseer.