[jira] Created: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

2010-09-08 Thread Markus Jelsma (JIRA)
Confusion in nutch-default between http.content.limit and file.content.limit Key: NUTCH-900 URL: https://issues.apache.org/jira/browse/NUTCH-900 Project: Nutch

[jira] Updated: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

2010-09-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-900: Attachment: NUTCH-900.MarkusJelsma.100908.patch.txt Confusion in nutch-default between

[jira] Updated: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

2010-09-08 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-900: Patch Info: [Patch Available] Confusion in nutch-default between http.content.limit and

[jira] Created: (NUTCH-901) Make index-more plug-in configurable

2010-09-08 Thread Markus Jelsma (JIRA)
Make index-more plug-in configurable -- Key: NUTCH-901 URL: https://issues.apache.org/jira/browse/NUTCH-901 Project: Nutch Issue Type: Improvement Components: indexer Reporter:

[Nutch Wiki] Update of GORA_HBase by JulienNioche

2010-09-08 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The GORA_HBase page has been changed by JulienNioche. http://wiki.apache.org/nutch/GORA_HBase -- New page: This document describes how to

[Nutch Wiki] Update of FrontPage by JulienNioche

2010-09-08 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The FrontPage page has been changed by JulienNioche. http://wiki.apache.org/nutch/FrontPage?action=diffrev1=137rev2=138 -- Please

Re: Nutch 2.0 Help

2010-09-08 Thread Julien Nioche
Hi guys, I've summarized the steps to follow for having GORA+Hbase with Nutch 2.0 on http://wiki.apache.org/nutch/GORA_HBase Feel free to amend and improve as you see fit. Please bear in mind that Nutch 2.0 is at a very early stage and is far from being bug-proof, see in particular [1]. HTH

[jira] Updated: (NUTCH-901) Make index-more plug-in configurable

2010-09-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-901: Summary: Make index-more plug-in configurable (was: Make index-more plug-in configurable

[jira] Updated: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

2010-09-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-900: Fix Version/s: 2.0 Affects Version/s: 2.0 To be fixed in the trunk as well Confusion in

[jira] Assigned: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

2010-09-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-900: --- Assignee: Julien Nioche Confusion in nutch-default between http.content.limit and

[jira] Closed: (NUTCH-900) Confusion in nutch-default between http.content.limit and file.content.limit

2010-09-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche closed NUTCH-900. --- Resolution: Fixed Committed revision 994984 (trunk) Committed revision 994985 (1.2) Thanks!

[jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable

2010-09-08 Thread Andrey Sapegin (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907182#action_12907182 ] Andrey Sapegin commented on NUTCH-407: -- Please accept the original patch or find a

Re: Nutch 2.0 Help

2010-09-08 Thread Enis Soztutar
Hi, I think we need to commit all the necessary files to nutch so that it can work out of the box for sql, hbase and casssandra. We can even write commented-out entries in gora.properties, nutch-site.xml, etc so that using nutch with different backends becomes a configuration change. I will open

[jira] Created: (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box

2010-09-08 Thread Enis Soztutar (JIRA)
Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box -- Key: NUTCH-902 URL:

[jira] Created: (NUTCH-903) RESUME_KEY field in FetcherJob.Java has not been get correctly

2010-09-08 Thread JIRA
RESUME_KEY field in FetcherJob.Java has not been get correctly -- Key: NUTCH-903 URL: https://issues.apache.org/jira/browse/NUTCH-903 Project: Nutch Issue Type: Bug

[jira] Updated: (NUTCH-903) RESUME_KEY field in FetcherJob.Java has not been get correctly

2010-09-08 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] faruk berksöz updated NUTCH-903: Description: Source modification request for nutch 2.0 .xx FetcherJob.Java ...

[jira] Updated: (NUTCH-903) RESUME_KEY field in FetcherJob.Java has not been get correctly

2010-09-08 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] faruk berksöz updated NUTCH-903: Description: Source modification request for nutch 2.0 . FetcherJob.Java ...

[jira] Closed: (NUTCH-903) RESUME_KEY field in FetcherJob.Java has not been get correctly

2010-09-08 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] faruk berksöz closed NUTCH-903. --- Resolution: Fixed I'm so sorry... Description is not readable.Why i don't know.I close this one and

[jira] Created: (NUTCH-904) -resume option is always processed as false in FetcherJob.

2010-09-08 Thread JIRA
-resume option is always processed as false in FetcherJob. --- Key: NUTCH-904 URL: https://issues.apache.org/jira/browse/NUTCH-904 Project: Nutch Issue Type: Bug

[jira] Commented: (NUTCH-407) Make Nutch crawling parent directories for file protocol configurable

2010-09-08 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907275#action_12907275 ] Chris A. Mattmann commented on NUTCH-407: - Hmmm: I agree here. If no one objects in

[jira] Updated: (NUTCH-904) -resume option is always processed as false in FetcherJob.

2010-09-08 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] faruk berksöz updated NUTCH-904: Attachment: NUTCH-904.patch patch -resume option is always processed as false in FetcherJob.

[jira] Commented: (NUTCH-893) DataStore.put() silently loses records when executed from multiple processes

2010-09-08 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907297#action_12907297 ] Andrzej Bialecki commented on NUTCH-893: - Very good catch - yes, the test now