[ 
https://issues.apache.org/jira/browse/NUTCH-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated NUTCH-987:
--------------------------------

    Description: 
At the moment we cannot send data directly to a public HTTP auth protected Solr 
instance. I've a WIP that passes a configured HTTPClient object to 
CommonsHttpSolrServer, it works. This issue should add this ability to 
indexing, dedup and clean and be configured from some configuration file.

Enable Solr HTTP auth communication by setting the following parameters in your 
nutch-site config:
* solr.auth=true
* solr.auth.username=USERNAME
* solr.auth.password=PASSWORD

  was:
At the moment we cannot send data directly to a public HTTP auth protected Solr 
instance. I've a WIP that passes a configured HTTPClient object to 
CommonsHttpSolrServer, it works. This issue should add this ability to 
indexing, dedup and clean and be configured from some configuration file.

The question is, is the current httpclient-auth.xml the correct place? It does 
provide a nice means to configure the AuthScope objects but it is used for 
fetching. But, since AuthScope is used we could easily add the credentials for 
Solr there as well and add a new nutch-default option for toggling HTTP auth.

Thoughts?


> Support HTTP auth for Solr communication
> ----------------------------------------
>
>                 Key: NUTCH-987
>                 URL: https://issues.apache.org/jira/browse/NUTCH-987
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.4, 2.0
>
>         Attachments: NUTCH-987-1.3-hack.patch, NUTCH-987-1.4-3.patch, 
> NUTCH-987-1.4.1-2.patch, SolrUtils.java
>
>
> At the moment we cannot send data directly to a public HTTP auth protected 
> Solr instance. I've a WIP that passes a configured HTTPClient object to 
> CommonsHttpSolrServer, it works. This issue should add this ability to 
> indexing, dedup and clean and be configured from some configuration file.
> Enable Solr HTTP auth communication by setting the following parameters in 
> your nutch-site config:
> * solr.auth=true
> * solr.auth.username=USERNAME
> * solr.auth.password=PASSWORD

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to