I have discovered wget missed some files on the first run. It's picking up the deltas now in a second run, minor, shouldn't take too long. I will upload the revised hashes as soon as it's done. Sorry all!
The Azimuth Backup Project respectfully submits the FTP site of the EPA, mirrored at:
for your copy if you make one, consider making life easy and using rsync to the same path:
rsync -Hhartv --progress rsync://mirror.math.princeton.edu/pub/climatemirror/ftp.epa.gov/ /var/local/whereever
Great mirror. I think there is a problem with the EmisInventory folder, though. It should contain ~50 subfolders chock-full of data files, but instead it is just an archived web page (and an old one at that). This is what it should look like; this is what it looks like instead.
@Ken Miller There is no problem. The ftp.epa.gov link isn't auto-delivering the index.html file, while the mirror.math.princeton.edu link is delivering it. The dirs are all still there, manually accessible, for instance:
Thanks Benjamin! For what it's worth, that index file is ancient and for just one of several EPA websites that call data from the EmisInventory folder. By delivering it the mirror hides all the directories, so unless someone knows what they are they won't be able to access them manually even though they're all there. So I'd suggest not delivering the index file and showing the list of directories instead, like the EPA site does.
@Ken Miller Usually I am mirroring open source code, many of whom rely on having an index.html that links to their main webpage to initiate downloads. But I see what you are saying, and since for climate data I am hitting ftp servers that don't auto-serve index.html, I've now recursively disabled AutoIndex inside of the /pub/climatemirror/ directory.