Hathibooks (How To Download From It)
Hathibooks (How To Download From It)
2. Uncheck the box next to 'Ask where to save each file before downloading' in
Downloads section under Chrome Setting>Advanced. Set a directory location where you
want your book to be downloaded.
---- Skip to (3) if you are already reading a book under ETAS ----
a. Go to [https://babel.hathitrust.org/cgi/wayf]
b. Find and select your university in the list of partner institutions. Select
“Continue”.
c. You will arrive at the usual login screen for your university. Enter your
account details.
d. When you have successfully logged in, you will be returned to the HathiTrust
website.
e. Enter your search terms in the search bar, and click on the “Search HathiTrust”
button.
f. You *may* see books having the label “Temporary Access”. These are all non-
public-domain-works, available only under ETAS, and are designed to be read online
w/o being downloadable.
g. Select the “Temporary Access” link to open the necessary book. A new screen will
load with this message :- “Access to this work is provided through the Emergency
Temporary Access Service.”
h. Select the “Check Out” button. You can now read the book via their interface.
[The banner at the top of the browser informs you how long the book is checked out
to you. Your access to the book will automatically renew unless another user
requests the book.]
---- Jump to (2) if you are not certain about accessing a book under ETAS ----
3. Check the URL of the page. This URL will be of the generic form :-
https://babel.hathitrust.org/cgi/pt?id= ... lV3VNSEj1c. Note the value of
numericalstring and alphabetstring.
4. Be aware that HT often adds random number of blank white pages to the end of the
book, for some indecipherable reason. [I have seen a 40 page long book containing
1968 extra pages.]
So, switch to thumbnail view and jump-scroll to the page range which ought to
contain the end of the book. Note the precise end page number. [The corresponding
catalog entry over HT typically mentions total pages + front matter pages + back
matter pages, if any. Add them up for a rough guess about the end page.]
5. We assume, *for this guide*, that the alphabetical and numerical string (Step 3)
were respectively mdp and 123456789 whilst end page number (Step 4) was 337.
We (thus) create a corresponding URL :- https://babel.hathitrust.org/cgi/imgsrv ...
56789;seq=[1:337]
Note the syntax of this URL, carefully. [The "imgsrv/image" bit.]
6. Click the DTA extension, click on "add download", and paste the above URL. Keep
all other options unchanged except choosing subfolder as downthemall, from the
dropdown menu.
7. Click the download button, and wait for completion. A new browser tab will
document progress of download for each file. If any part file ain't downloaded for
some reason, just right click on it and chose resume option.
If you have done everything correctly, an image corresponding to every single page
of the book will now be automatically downloaded to Download_directory/downthemall.
They can come in a variety of formats including svg, jpeg, jpg and (mostly) png.
8. Merge all these images to a single PDF using [https://fm-pdf.com/.../Free-JPG-
To-PDF-Converter-Setup.exe], courtesy a name sort. Adobe Acrobat Pro etc. can do
this merge, as well. For Mac, just use the Preview app.