[Techtalk] compiling a bunch of HTML files

Jaroslaw Fedevych (UALUG jaroslaw at linux.org.ua
Thu Sep 9 13:35:16 EST 2004


On Thu, Sep 09, 2004 at 11:11:22AM +0100, Noir wrote:
> I have a bunch of (about 100!) HTML files that I'd
> like to compile in one file. All the files have
> frames, redundant pictures I'd like to exclude; they

Find out which files are actually content; sort them out;
you can also write a simple perl thing which eliminates
<img> tags (I bet there's already one-liner for you); cat
them together.

> also have their individual directories where the gif/
> jpg files reside.

You may easily forget about those directories if you don't
need the stuff inside.

> 
> My plan is to put them in one file and convert to pdf
> (OpenOffice).

I have used html2latex and TeXmacs. The first is rather
selective about html it's gonna be fed, and the latter is
better, but painfully slow. I have tried OOO, but it's too
slow for me, either :) You may have a look at the two 
alternatives I've mentioned if the results don't satisfy 
you.


-- 
X Windows: You'll envy the dead.



More information about the Techtalk mailing list