Category: School of Information

Seminar: Characterization and Gender, 1800-2008

 

Ted Underwood teaches in the School of Information Sciences and the English Department at the University of Illinois, Urbana-Champaign. He was trained as a Romanticist and now applies machine learning to large digital collections. His most recent book, Distant Horizons: Digital Evidence and Literary Change (Univ of Chicago, Spring 2019) addresses new perspectives opened up by large digital libraries.

Seminar reading:

Read more

Seminar | Quantifying without Computers

 

Laura McGrath is the Associate Director of the Stanford University Literary Lab and a postdoctoral fellow in English. Her primary interests lie in computational approaches to post45 American fiction. She is at work on a manuscript, a literary history of the agent, entitled Middlemen: Making Literature in the Age of Multimedia Conglomerates. She is also working on a second, trade book called Comps: The Big Data Behind the Book Business.

Seminar readings:

Read more

Natural Language Processing

This course introduces students to natural language processing and exposes them to the variety of methods available for reasoning about text in computational systems. NLP is deeply interdisciplinary, drawing on both linguistics and computer science, and helps drive much contemporary work in text analysis (as used in computational social science, the digital humanities, and computational journalism).

Read more

Predicting Dates of First Publication in the HathiTrust

The rise of large-scale digitized book collections—such as those provided by Google Books, the HathiTrust and the
Internet Archive—is enabling a fundamentally new kind of text analysis that exploits the scale of collections to ask
questions not possible with smaller corpora. Many of these research questions are driven by historically deep textual
collections—corpora that span several decades or centuries in their publication. Moretti (2007) analyzes the changing

Read more

Pages