Do you have a project or tool using the HTRC Extracted Features Dataset? Let us know at firstname.lastname@example.org
A web-based tool for viewing similar words to a query, for each year from 1800 to 1923.
you can notice a few interesting things: from the generated table for "caste", it appears that "suffering" was a pretty frequent contextual word for "caste" in the early nineteenth century, but then "suffering" seemed to drop out of the context by the late nineteenth century. On the other hand, "degradation" seemed to remain, more or less, part of the context of the word "caste" throughout. You can also notice that the occurrence of "race" in the same context as "caste" becomes more frequent after 1870 or so. Before 1870, the instances of "race" in the same context as "caste" was less pronounced.
An interactive, faceted, visualization of terms across the HathiTrust collection, built on the EF dataset.
An approach for visualizing thematic trends within a book.
A Python library that scaffolds Pandas use of EF data. With example scripts.
Send us your Lessons or Tutorials related to the EF Dataset.
Python code for some simple examples of "literary sleuthing":
Underwood, Ted. June 3, 2014. "A window on the twentieth century may be about to open." The Stone and the Shell. Blog. http://tedunderwood.com/2014/06/03/a-window-on-the-twentieth-century-may-be-about-to-open/
Mimno, David. 2014. "Word counting, squared." David Mimno. Blog. http://www.mimno.org/articles/wordsim/