No problem!

I don't suppose you can access them via lynx or other browser?  Then
maybe you could [curl|wget|lynx] -> [mht-rip|tetex|LaTEX] -> txt2pdf?

Or even simply (presuming that the IIS server isn't doing some insanity
with browser matching or something):
$> curl | txt2pdf

Just a thought.

R

On Tue, 2009-03-17 at 09:50 -0400, Portelance, Brad wrote:
> Hi Rubin, These are web pages. I just found out that we can't get the files 
> off the server easily thanks to M$ web extensions. Grrr. Guess we'll be 
> converting the old fashion way.
> 
> Thanks for the tip!
> Brad
> 
> -----Original Message-----
> From: Vermont Area Group of Unix Enthusiasts [mailto:[email protected]] On 
> Behalf Of Rubin Bennett
> Sent: Tuesday, March 17, 2009 9:07 AM
> To: [email protected]
> Subject: Re: Convert MHT files to PDF
> 
> On Tue, 2009-03-17 at 08:57 -0400, Portelance, Brad wrote:
> > Happy St. Patrick's Day!
> > 
> > Does anyone know of a good way to batch convert .MHT files to .PDF in 
> > Linux? I have done some googling and searching the system and can't come up 
> > with anything that would help me batch convert 200 documents.
> > 
> > Has anyone done this?
> > Thanks!
> > Brad
> Are you converting emails or webpages?
> http://linuxgazette.net/160/misc/lg/2_cent_tip__reading_mht_files_in_linux.html
> 
> Seems to be about emails, but it's probably similar enough.  LaTEX I
> believe can also do some of what you want.
> 
> mht-rip will rip the text from an MHT archive and dump it to test, and
> from there you could text2pdf easily enough.  I guess it depends on
> whether you need images to come over or not.
> 
> Happy Green Beer day!
> 
> Rubin
> 
-- 
Rubin Bennett
rbTechnologies, LLC
80 Carleton Boulevard
East Montpelier, VT 05651

(802)223-4448
http://thatitguy.com

"Think for yourselves and let others enjoy the privilege to do so too."
  Voltaire, Essay on Tolerance
  French author, humanist, rationalist, & satirist (1694 - 1778)

Reply via email to