Note: Your capsule has been pre-loaded with the packages required to follow these examples.
Tutorials
HTRC Data Capsule Tutorial
Detailed tutorial for using the HTRC Data Capsule System
Fetching Volume OCR Content in HTRC Data Capsule
How to access volume text in secure mode in a capsule
Use Cases
Use Solr API to Retrieve Volume IDs
HTRC provides a search engine API, Solr API, for scholars to search volumes of their interest. Scholars can search by full-text, or MARC catalog fields. An example query is http:
//
chinkapin.pti.indiana.edu:9994/solr/meta/select/?q=title:war
which returns all volumes of which the titles contain "war".
Perform Text Analytics Using IPython
Use the IPython interactive interface to fetch volume content, and then run vector space model and topic modeling on volumes' OCR content.