In the virtual machine you created in HTRC Data Capsule, in maintenance mode, run the command:
Then, switch to secure mode, to fetch content by using Data API in Data Capsule (it won't work in maintenance mode to prevent data leak)
Once in secure mode, run the following command to download OCR data:
- the htrc-id is a file containing a volume id list that you're interested in, with one ID per file.
- output.zip is the .zipped folder for the fetched OCR content