Send to a Friend

mdy's avatar

Do you know of software that can generate .mht, .pdf, or similar single-file documents from the many .html and image files I've saved on my hard disk?

Asked by mdy (1152points) May 27th, 2008

Over the years, I’ve saved a lot of web pages on my hard disk on various topics. Although they’re neatly organized by topic in subdirectories, a lot of these web pages were not saved in .mht (Microsoft’s web archive) or .pdf format.

Consequently, not only do I have a lot of .html files in my hard drive… I’ve also got a lot of subdirectories that contain the related images for each of these .html files.

I’m therefore hoping to find software that will:

1. Accept a directory on my hard disk as input parameter

2. Go through all the .html files in that directory and locate the related child subdirectory that contains the related images and stylesheets of each .html file

3. For each .html file, generate or produce a single .mht (or .pdf, .rtf, or .doc) file that contains the .html and all the related images and stylesheets in the relevant child directories, packaged neatly as a single, searchable file.

I’ll be happy to look at both free and non-free software.

PS. Whew! Being a packrat is tough work.

PPS. Google searches have so far only yielded software that can create .mht files from web pages that are on the ‘net, rather than web pages that are already sitting on my hard disk. Other file-merging solutions talk about merging multiple .html files together, which isn’t what I’m looking for. And still others require user intervention with each file. I’m hoping for something that’s batch-oriented and can work on all the .html files in an entire directory in one go.

Using Fluther

or

Using Email

Separate multiple emails with commas.
We’ll only use these emails for this message.