Hello people,

I've got trouble in distilling web page
http://teorver-online.narod.ru/teorver73.html

Here is is the traceback:
( between == TRACEBACK BEGIN == and == TRACEBACK END == ):

== TRACEBACK BEGIN ==

Initializing Plucker spidering engine...
 
-----------------------------------------------------------
Updating channel: TeorVer...
-----------------------------------------------------------
Pluckerdir is 'C:\Program Files\Plucker'...
Using proxy '' with authentication for user ''...
---- 0 collected, 1 to do ----
Processing http://teorver-online.narod.ru/teorver73.html...
Traceback (most recent call last):
  File "C:\Program Files\Plucker/parser/python/PyPlucker/Spider.py", line 1734, 
in ?
    sys.exit(realmain(None))
  File "C:\Program Files\Plucker/parser/python/PyPlucker/Spider.py", line 1719, 
in realmain
    retval = main (config, exclusion_lists)
  File "C:\Program Files\Plucker/parser/python/PyPlucker/Spider.py", line 1124, 
in main
    spider.process_all(verbose=verbosity)
  File "C:\Program Files\Plucker/parser/python/PyPlucker/Spider.py", line 623, 
in process_all
    self.process (verbose, estimate, statusfile)
  File "C:\Program Files\Plucker/parser/python/PyPlucker/Spider.py", line 732, 
in process
    post_data=post_data)
  File "C:\Program Files\Plucker/parser/python\PyPlucker\Retriever.py", line 
313, in retrieve
    result = self._retrieve (url, alias_list, post_data)
  File "C:\Program Files\Plucker/parser/python\PyPlucker\Retriever.py", line 
212, in _retrieve
    webdoc = self._urlopener.open (real_url, post_data)
  File "C:\Program Files\Plucker\parser\python\vm\lib\urllib.py", line 176, in 
open
    return getattr(self, name)(url)
  File "C:\Program Files\Plucker\parser\python\vm\lib\urllib.py", line 277, in 
open_http
    h = httplib.HTTP(host)
  File "C:\Program Files\Plucker\parser\python\vm\lib\httplib.py", line 666, in 
__init__
    self._conn = self._connection_class(host, port)
  File "C:\Program Files\Plucker\parser\python\vm\lib\httplib.py", line 342, in 
__init__
    self._set_hostport(host, port)
  File "C:\Program Files\Plucker\parser\python\vm\lib\httplib.py", line 348, in 
_set_hostport
    port = int(host[i+1:])
ValueError: invalid literal for int(): 
Installing channel output to destinations...
Setting new due date...
Tasks completed for all channels.

== TRACEBACK END ==

Plucker distiller seem to parse a port number while there is no port
number in the URL...

Any suggestions?

I'm using Plucker Desktop 1.6.2.0 for Windows.

Sincerely,
   Alexei Agafonov
   mailto:[EMAIL PROTECTED]


_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to