This is a virtual machine and walkthrough for Heritrix. Heritrix documentation can be found here.
The virtual machine that is built uses 2GB of RAM. Your host machine will need to be able to support that.
The walkthrough can be found in the walkthrough
subdirectory.
Download each of the following dependencies.
You'll need to get your virtual machine running on the command line. For a basic walkthrough of how to use the command line, please consult this lesson at the Programming Historian.
git clone https://github.com/web-archive-group/heritrix-walkthrough.git
cd heritrix-walkhrough
vagrant up
- Ubuntu 14.04
- Java 7
- Heritrix 3.2.0
This research has been supported by the Social Sciences and Humanities Research Council with Insight Grant 435-2015-0011.