I'm trying to use plucker-build to slurp travel info from my travel
agent.  The info is nicely displayed through a web interface, and they
even offer an avantgo download (yuck), so i thought i'd give it a try
with plucker.

when i run plucker-build, i get:

Processing https://www.virtuallythere.com/cgi-bin/n.....=0&clocktype=12.
           1 collected, 2 still to do
Guessing type for /cgi-bin/nph-itinerary
  Moved to '...'
  Retrieved ok
unknown/unknown type not yet handled
  Parsing failed!

So it looks like the site is not correctly setting the MIME type.
Anyone got a good solution for this?  I've considered:

1) hacking Parser.py to default to text/html rather than give up on an
   unknown MIME type.  
2) Using some other browser or sitescooper to download the pages and
   then using plucker.

ian

Reply via email to