Child pages
  • Extracted Features in the Wild

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.


titleShare your work

Do you have a project or tool using the HTRC Extracted Features Dataset? Let us know at


Word Similarity Tool, David Mimno


A topic model of fiction, based on the genre-classified dataset (only 1920-22); it may be extended once extracted features are available after 1922.

Berkeley Data Science Module, Chris Hench and Cody Hennesy

A Jupyter Notebooks-based curriculum for using HTRC Extracted Features in the classroom developed at the University of California, Berkeley.

Image Added


HTRC Feature Reader

A Python library that scaffolds Pandas use of EF data. With example scripts.




and Lessons

Send us your Lessons or Tutorials related to the EF Dataset.

Python code for some simple examples of "literary sleuthing":