Michael Wechner wrote:
ok. So what about adding a comment to nutch-site.xml, e.g.
<!-- NOTE: In order to use https please add protocol-httpclient, but
be aware of possible performance problems! -->
They were not performance problems. There were some issues related to
using multiple threads, which would sometimes cause the httpclient
library to fail. There was also a logging message produce in the
internals of httpclient that was difficult to turn off - but now that we
are using log4j this should be straightforward. There was a bug in
chunked encoding handling that would cause hangs.
There were also other intermittent problems with this library, so after
much deliberation we decided to leave the simpler plugin as the default ...
These issues may have been solved in a newer version of httpclient library.
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com