Detailed tutorial for using the HTRC Data Capsule System
To learn about the technical specifications and general functionality of the HTRC Data Capsule environment, see the HTRC Data Capsule Specifications and Usage Guide page.
Note: Your capsule has been pre-loaded with the packages required to follow these examples.
Since it's performed within the capsule's virtual machine environment, it will be helpful to open a browser in the capsule, e.g. Firefox, and go to the url http://wiki.htrc.illinois.edu/pages/viewpage.action?pageId=22085965 or http://bit.ly/1whzT6H Then you can easily copy and paste the hyperlinks and the commands from the Wiki.
HTRC provides a search engine API, Solr API, for scholars to search volumes of their interest. Scholars can search by full-text, or MARC catalog fields. An example query is
chinkapin.pti.indiana.edu:9994/solr/meta/select/?q=title:war which returns all volumes of which the titles contain "war".
Detailed step-by-step instructions for using the HTRC Data Capsule system, including creating, using, and requesting to export derived data from your Capsule.
User guide for the Workset Toolkit, a command line interface that facilitates data and metadata transfer and analysis in a capsule.
HTRC Data Capsules come with Voyant Tools installed.
Use the IPython interactive interface to fetch volume content, and then run vector space model and topic modeling on volumes' OCR content.
Using the HTRC API within a capsule.