[jira] [Updated] (NUTCH-1570) Add filtering capability to Datastore Queries

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1570: - Component/s: REST_api Add filtering capability to Datastore Queries

[jira] [Updated] (NUTCH-1570) Add filtering capability to Datastore Queries

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1570: - Issue Type: Improvement (was: Bug) Add filtering capability to Datastore Queries

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Navid Shekoufa (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994461#comment-13994461 ] Navid Shekoufa commented on NUTCH-1714: --- Did anybody notice my previous comment?! I

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994464#comment-13994464 ] Lewis John McGibbney commented on NUTCH-1714: - I'll have a look in an hour or

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994466#comment-13994466 ] Lewis John McGibbney commented on NUTCH-1714: - It would appear that I've had

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993488#comment-13993488 ] Julien Nioche commented on NUTCH-1714: -- [~alpar] I presume you added the methods

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Navid Shekoufa (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994476#comment-13994476 ] Navid Shekoufa commented on NUTCH-1714: --- Yes, I've been actively testing this patch

[jira] [Commented] (NUTCH-1770) Nutch is failing to parse all PDFs

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993483#comment-13993483 ] Julien Nioche commented on NUTCH-1770: -- IIRC this mechanism was put in place as

[jira] [Resolved] (NUTCH-1770) Nutch is failing to parse all PDFs

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1770. -- Resolution: Not a Problem Rogerio - please close it once you've checked that changing the

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Ralf (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13992939#comment-13992939 ] Ralf commented on NUTCH-1714: - OK, what do I have to do in order to use Gora 0.4? which

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993492#comment-13993492 ] Julien Nioche commented on NUTCH-1679: -- Ralf : 2.3 has not been released. The current

[jira] [Closed] (NUTCH-1764) readdb to show command-line help if no action (-stats, -dump, etc.) given

2014-05-11 Thread Diaa (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diaa closed NUTCH-1764. --- Works readdb to show command-line help if no action (-stats, -dump, etc.) given

Re: [jira] [Commented] (NUTCH-1766) Generator to unlock crawldb and remove tempdir if generate job fails

2014-05-11 Thread Diaa Abdallah
Anyone wanna commit this? On Mon, Apr 28, 2014 at 12:04 AM, Diaa (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/NUTCH-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13982485#comment-13982485] Diaa commented on

[jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page.

2014-05-11 Thread Ralf (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993547#comment-13993547 ] Ralf commented on NUTCH-1679: - Checked out revision 1593523. UpdateDb using batchId, link

[jira] [Updated] (NUTCH-1766) Generator to unlock crawldb and remove tempdir if generate job fails

2014-05-11 Thread Diaa (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diaa updated NUTCH-1766: Priority: Major (was: Minor) Generator to unlock crawldb and remove tempdir if generate job fails

[jira] [Commented] (NUTCH-1622) Create Outlinks with metadata

2014-05-11 Thread Daniel Kugel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994665#comment-13994665 ] Daniel Kugel commented on NUTCH-1622: - We were talking about the uncommitted patch for

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-11 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13993515#comment-13993515 ] Julien Nioche commented on NUTCH-1714: -- A few other things I noticed in my test crawl

[jira] [Commented] (NUTCH-1770) Nutch is failing to parse all PDFs

2014-05-11 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13994833#comment-13994833 ] Tilman Hausherr commented on NUTCH-1770: [~ararog] there is no such thing as