date:20081007

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

2008-10-07 Thread JIRA


[ 
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637545#action_12637545
 ] 

Doğacan Güney commented on NUTCH-442:
-

Great!

(I am obviously +1 on adding this before 1.0 :D )

So, can I get some reviews on what people think of this patch then?

On solrj: I will send an updated patch that uses solrj instead.

> Integrate Solr/Nutch
> 
>
> Key: NUTCH-442
> URL: https://issues.apache.org/jira/browse/NUTCH-442
> Project: Nutch
>  Issue Type: New Feature
> Environment: Ubuntu linux
>Reporter: rubdabadub
> Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, 
> NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, 
> NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, 
> RFC_multiple_search_backends.patch, schema.xml
>
>
> Hi:
> After trying out Sami's patch regarding Solr/Nutch. Can be found here 
> (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html)
>  and I can confirm it worked :-) And that lead me to request the following :
> I would be very very great full if this could be included in nutch 0.9 as I 
> am trying to eliminate my python based crawler which post documents to solr. 
> As I am in the corporate enviornment I can't install trunk version in the 
> production enviornment thus I am asking this to be included in 0.9 release. I 
> hope my wish would be granted.
> I look forward to get some feedback.
> Thank you.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

2008-10-07 Thread Andrzej Bialecki (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637507#action_12637507
 ] 

Andrzej Bialecki  commented on NUTCH-442:
-

+1 on adding this before 1.0 - it would be a shame to miss this functionality 
when it's been asked for over and over. One change that should be made (either 
in this patch or as a follow-up) is to use SolrJ instead of plain HTTP.

I don't think we need to branch for this - as long as the patch passes tests 
and runs basic commands IMHO it's good enough to expose a wider audience to it. 
Applying this to trunk/ actually gives us better chances that it will be tested 
by more people.

> Integrate Solr/Nutch
> 
>
> Key: NUTCH-442
> URL: https://issues.apache.org/jira/browse/NUTCH-442
> Project: Nutch
>  Issue Type: New Feature
> Environment: Ubuntu linux
>Reporter: rubdabadub
> Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, 
> NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, 
> NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, 
> RFC_multiple_search_backends.patch, schema.xml
>
>
> Hi:
> After trying out Sami's patch regarding Solr/Nutch. Can be found here 
> (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html)
>  and I can confirm it worked :-) And that lead me to request the following :
> I would be very very great full if this could be included in nutch 0.9 as I 
> am trying to eliminate my python based crawler which post documents to solr. 
> As I am in the corporate enviornment I can't install trunk version in the 
> production enviornment thus I am asking this to be included in 0.9 release. I 
> hope my wish would be granted.
> I look forward to get some feedback.
> Thank you.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

2008-10-07 Thread Enis Soztutar (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637489#action_12637489
 ] 

Enis Soztutar commented on NUTCH-442:
-

I personally believe this patch should be in before 1.0, since it does not make 
sense to make such a change in 1.1. However since there is some need to test 
this patch more thoroughly, I guess we can make a branch and commit it there, 
so that people can test this easily. However branching has it's own problems, 
especially keeping in sync with trunk would get harder and harder. 

Since this issue has a large number of votes and watchers, I suggest we branch 
and commit it, test this out a little bit more, and merge to trunk before 1.0. 

> Integrate Solr/Nutch
> 
>
> Key: NUTCH-442
> URL: https://issues.apache.org/jira/browse/NUTCH-442
> Project: Nutch
>  Issue Type: New Feature
> Environment: Ubuntu linux
>Reporter: rubdabadub
> Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, 
> NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, 
> NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, 
> RFC_multiple_search_backends.patch, schema.xml
>
>
> Hi:
> After trying out Sami's patch regarding Solr/Nutch. Can be found here 
> (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html)
>  and I can confirm it worked :-) And that lead me to request the following :
> I would be very very great full if this could be included in nutch 0.9 as I 
> am trying to eliminate my python based crawler which post documents to solr. 
> As I am in the corporate enviornment I can't install trunk version in the 
> production enviornment thus I am asking this to be included in 0.9 release. I 
> hope my wish would be granted.
> I look forward to get some feedback.
> Thank you.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

2008-10-07 Thread JIRA


[ 
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637419#action_12637419
 ] 

Doğacan Güney commented on NUTCH-442:
-

Thanks to everyone for comments. Unfortunately this patch will probably have to 
wait until after 1.0 to get in.

But since many people are interested in having some sort of Solr integration in 
trunk maybe we can update Sami Siren's solr patch and commit it for 1.0.

What do others think?

> Integrate Solr/Nutch
> 
>
> Key: NUTCH-442
> URL: https://issues.apache.org/jira/browse/NUTCH-442
> Project: Nutch
>  Issue Type: New Feature
> Environment: Ubuntu linux
>Reporter: rubdabadub
> Attachments: Crawl.patch, Indexer.patch, NUTCH-442_v4.patch, 
> NUTCH-442_v5.patch, NUTCH-442_v6.patch.txt, NUTCH-442_v7.patch.txt, 
> NUTCH-442_v7a.patch.txt, NUTCH_442_v3.patch, 
> RFC_multiple_search_backends.patch, schema.xml
>
>
> Hi:
> After trying out Sami's patch regarding Solr/Nutch. Can be found here 
> (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html)
>  and I can confirm it worked :-) And that lead me to request the following :
> I would be very very great full if this could be included in nutch 0.9 as I 
> am trying to eliminate my python based crawler which post documents to solr. 
> As I am in the corporate enviornment I can't install trunk version in the 
> production enviornment thus I am asking this to be included in 0.9 release. I 
> hope my wish would be granted.
> I look forward to get some feedback.
> Thank you.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

[jira] Commented: (NUTCH-442) Integrate Solr/Nutch

4 matches

Site Navigation

Mail list logo

Footer information