Friday, December 16, 2011

Blog Archiving

A quick question.  As far as I know the export blog feature on Blogger just saves the text and links to the pics, right?  It must, because my xml file is only 10megs.  Are there any good tools you know of that will archive posts pics and all?  Maybe even slurp down linked files?


Here's a little silo for your time:

5 comments:

  1. I think HTtrack will do the trick for you!

    http://www.httrack.com/

    ReplyDelete
  2. There's curl or wget, if you're comfortable on the command line.

    ReplyDelete
  3. I don't think you're going to get the images with httrack or wget, but you could use them to get everything else and then use the firefox plugin DownThemAll to retrieve all the images (so you also get the ones outside of those stored in picassa). Be patient with wget - I just dumped your site and it came out to 372mb. :)

    It'd only be useful in that format for offline viewing. The code is theirs so you couldn't host it elsewhere, just the data you get in the export they let you have.

    ReplyDelete
  4. Thanks, all. I used webhttrack years ago and it was pretty easy, also gets the pics images ZB. But after letting it run for an hour yesterday I lost patience, I'll have to let it run over night some time.

    ReplyDelete
  5. I'm with Paul: I'd use wget or curl. In fact, I have already used curl for blognapping. A quick search for "wget blogspot backup" turned up links such as Using wget To Back Up a Blogger Blog and How To Automate Backup Of Blogspot Blog.

    ReplyDelete