[Techtalk] compiling a bunch of HTML files
Jaroslaw Fedevych (UALUG
jaroslaw at linux.org.ua
Thu Sep 9 13:35:16 EST 2004
On Thu, Sep 09, 2004 at 11:11:22AM +0100, Noir wrote:
> I have a bunch of (about 100!) HTML files that I'd
> like to compile in one file. All the files have
> frames, redundant pictures I'd like to exclude; they
Find out which files are actually content; sort them out;
you can also write a simple perl thing which eliminates
<img> tags (I bet there's already one-liner for you); cat
them together.
> also have their individual directories where the gif/
> jpg files reside.
You may easily forget about those directories if you don't
need the stuff inside.
>
> My plan is to put them in one file and convert to pdf
> (OpenOffice).
I have used html2latex and TeXmacs. The first is rather
selective about html it's gonna be fed, and the latter is
better, but painfully slow. I have tried OOO, but it's too
slow for me, either :) You may have a look at the two
alternatives I've mentioned if the results don't satisfy
you.
--
X Windows: You'll envy the dead.
More information about the Techtalk
mailing list