Child pages
  • Use Case: Perform Text Analytics Using Topic Explorer

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This use case obtains some HTRC volume content, builds topic models based on the content, and then visualizes the topic models in a web browser.

VM Mode

This use case can be run in only secure mode in the VM. To export experiment results out of the VM, you need to release the result files in secure mode, and then receive results via email.

Example Use

First, switch the VM mode to secure mode (done in the HTRC portal). 

In the VM, start a Terminal, and change directory to the htrc-data folder

Code Block
languagebash
cd ~/demo/htrc-data/home/dcuser/HTRC-Demos/Python/topicexplorer-demo

List the files of this folder

...

This demo code:

  • loads data from 3 volumes in HathiTrust using the HTRC Data API
  • builds an LDA topic model from the corpus
  • save the LDA trained model
  • view topics in a web browser in an interactive way

Here are the scripts use used in this example.
View filenamehtrc-topicexplorer-demo.zipheight250