Child pages
  • Extracted Features in the Wild

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.


See how researchers and developers are using the EF dataset in real-world projects.

titleShare your work

Do you have a project or tool using the HTRC Extracted Features Dataset? Let us know at


Word Similarity Tool, David Mimno


An approach for visualizing thematic trends within a book.


A Topic Model of Fiction, Jonathan Goodwin

A topic model of fiction, based on the genre-classified dataset (only 1920-22); it may be extended once extracted features are available after 1922.

Berkeley Data Science Module, Chris Hench and Cody Hennesy

A Jupyter Notebooks-based curriculum for using HTRC Extracted Features in the classroom developed at the University of California, Berkeley.

Image Added


HTRC Feature Reader

A Python library that scaffolds Pandas use of EF data. With example scripts.




and Lessons

Send us your Lessons or Tutorials related to the EF Dataset.

Python code for some simple examples of "literary sleuthing": 


Underwood, Ted. 2015. "How Scholars Can Support Digital Libraries." Europeana Research.