[
https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14322166#comment-14322166
]
Fabio Santagostino commented on NUTCH-1086:
-------------------------------------------
Hi,
I've done an attempt to rewrite the component using httpclient 4.4. It works
for me !
My main goal was to use a correct implementation of NTLMv2 auhentication for my
corporate web sites.
Anyway it seams to be backward compatible with previous implementation. Proxy
support is the only part I've not tested yet.
I had to change only 2 classes (in attachment) :
-
/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java
-
/src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpResponse.java
Of course package dependency files must be modified also. In /ivy/ivy.xml :
+ added httpclient 4.4 version
{code:xml}
<dependency org="org.apache.httpcomponents" name="httpclient" rev="4.4"
conf="*->master" />
{code}
+ updated codec version from {code:xml}<dependency org="commons-codec"
name="commons-codec" rev="1.3" conf="*->default" />{code} to
{code:xml}<dependency org="commons-codec" name="commons-codec" rev="1.4"
conf="*->default" />{code}
Files in attachment are tested for v1.9 branch, but probably minor changes are
needed to make it suitable for v2.3.
Regards,
Fabio
> Rewrite protocol-httpclient
> ---------------------------
>
> Key: NUTCH-1086
> URL: https://issues.apache.org/jira/browse/NUTCH-1086
> Project: Nutch
> Issue Type: Improvement
> Components: protocol
> Affects Versions: nutchgora, 1.5
> Reporter: Markus Jelsma
> Fix For: 2.4
>
> Attachments: Http.java, HttpResponse.java
>
>
> There are several issues about protocol-httpclient and several comments about
> rewriting the plugin with the new http client libraries. There is, however,
> not yet an issue for rewriting/reimplementing protocol-httpclient.
> http://hc.apache.org/httpcomponents-client-ga/
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)