Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The HathiTrust Research Center

HathiTrust Research Center (HTRC) enables computational analysis of works in the HathiTrust Digital Library (HTDL) to facilitate non-profit research and educational uses of the collection. HTRC, which is co-located at Indiana University and the University of Illinois at Urbana-Champaign, engages in research and development for computational text analysis of massive digital libraries. Leveraging data storage and computational infrastructure at Indiana University and the University of Illinois at Urbana-Champaign, the Center creates and maintains a suite of tools and services for text-based, data-driven research--such as HTRC Algorithms and Data Capsule--and engages in cutting-edge research on large-scale data analysis.

HTRC operates under a non-consumptive research paradigm: HTRC makes available the collection for computational analysis, while remaining  within the bounds of the fair use rights courts have recognized as applying to text analysis. The Center is committed to breaking new ground in the areas of non-consumptive text mining, allowing scholars to fully utilize content of the HathiTrust Digital Library.


Learn more by watching an introductory video!

Relationship to HathiTrust

HathiTrust is a partnership of academic and research institutions, offering a collection of millions of titles digitized from libraries around the world. The HathiTrust Digital Library (HTDL) is a digital preservation repository and highly functional access platform that continues to grow as HathiTrust partners, primarily academic libraries in the United States, contribute newly digitized content. Individual access and preservation are important concerns for HathiTrust. It allows users to search for and build collections of digitized works, and to read those in the public domain.

The Research Center’s focus is on the aggregate strengths: what can we learn from so many books? Digitization has enabled large-scale questions that we couldn’t ask before, and the Research Center is here to help you ask them while working within the restrictions of intellectual property law.






Panel
borderStylesolid
titleQuick links

HTRC Analytics - gateway to HTRC tools

HTRC, Help! - get help and request assistance

HathiTrust - HTRC's parent organization

Introductory video

Find all tutorials




Panel
borderStylesolid
titleResearch examples

Advanced Collaborative Support (ACS) projects

Research Examples and Use Cases

Extracted Features in the Wild


HTRC Services

Many of the HTRC services require an account to log in and interact with the tools via the HTRC Analytics website. Register for an account by going to the main page of the HTRC Analytics. Anyone possessing an email address from a nonprofit institution of higher education is allowed to register, including those whose institutions are not HathiTrust members. 

Tools & Data

HTRC Analytics

The primary gateway to HTRC!

Button Hyperlink
titleGo to HTRC Analytics
typeprimary
urlhttps://analytics.hathitrust.org/
 
Button Hyperlink
titleLearn more
typestandard
urlHTRC Analytics Documentation

HTRC Algorithms

Web-based, click-and-run tools in HTRC Analytics that perform computational text analysis on worksets, which are user-created collections of volumes. No programming required.

Button Hyperlink
titleCreate a workset
typeprimary
urlhttps://analytics.hathitrust.org/staticworksets
targettrue
 
Button Hyperlink
titleLearn more
typestandard
urlHTRC Worksets

Button Hyperlink
titleRun an algorithm
typeprimary
urlhttps://analytics.hathitrust.org/statisticalalgorithms
targettrue
 
Button Hyperlink
titleLearn more
typestandard
urlHTRC Analytics Algorithms

HTRC Data Capsules

Secure virtual environments in HTRC Analytics for non-consumptive text analysis, where researchers can implement their own data analysis and visualization tools.

Button Hyperlink
titleUse a Data Capsule
typeprimary
urlhttps://analytics.hathitrust.org/staticcapsules
targettrue
 
Button Hyperlink
titleLearn more
typestandard
urlHTRC Data Capsule Environment

HTRC Extracted Features

An unrestricted dataset of metadata and word counts for each page in the HathiTrust Digital Library. Download and explore on your own machine.

Button Hyperlink
titleDownload Extracted Features
typeprimary
urlHTRC Derived Datasets
targettrue
 

HathiTrust+Bookworm

Create a line graph showing word use trends in 13.7 million HathiTrust volumes.

Button Hyperlink
titleTry HT+BW
typeprimary
urlhttps://bookworm.htrc.illinois.edu/develop/
targettrue
 
Button Hyperlink
titleLearn more
typestandard
urlHathiTrust+Bookworm

Research & Teaching Support

Advanced Collaborative Support: Assisting on specialized questions

Program that offers specialized expertise, developer time, and compute resources to researchers who apply for and are awarded support.

Button Hyperlink
titleAdvanced Collaborative Support (ACS)
typestandard
urlINT:Advanced Collaborative Support (ACS)

HTRC, Help!

Researcher support via email, through monthly office hours, and anonymized frequently asked questions about HTRC. 

Button Hyperlink
titleHTRC, Help!
typestandard
urlHTRC, Help!

Training researchers and librarians

HTRC provides training and researcher support for those teaching with and using HTRC. Affiliates of the Scholarly Commons are available for workshops and webinars, and they will also consult about specific scholarly projects or pedagogical applications.

Button Hyperlink
titleEducational Materials
typestandard
urlEducational Materials and Workshops