Release 1.0?

2009-01-28 Thread Marko Bauhardt
Hi all, is there a timeline for the release 1.0? Currently it exists 33 issues (9 Bugs). Is there a plan for a feature freeze? Maybe some big issues can be moved to version 1.1? Thanks for response Marko

[jira] Updated: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-626: Attachment: NUTCH-626_v2.patch I updated your patch to apply and compile in latest trunk. I am not

[jira] Closed: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney closed NUTCH-571. --- Resolution: Fixed Fix Version/s: 1.0.0 This seems simple enough. I committed it as of rev.

[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668001#action_12668001 ] Doğacan Güney commented on NUTCH-643: - So... Can we commit this patch and pdfbox? It

[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password

2009-01-28 Thread Guillaume Smet (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668010#action_12668010 ] Guillaume Smet commented on NUTCH-643: -- Hi Doğacan, The problem isn't the license of

[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password

2009-01-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668015#action_12668015 ] Andrzej Bialecki commented on NUTCH-643: - +1. Yes, it's compatible.

[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password

2009-01-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668018#action_12668018 ] Andrzej Bialecki commented on NUTCH-643: - (sorry Guillame, missed your comment) -

[jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668021#action_12668021 ] Doğacan Güney commented on NUTCH-643: - Right, we should update tika to 0.2

[jira] Closed: (NUTCH-680) Update external jars to latest versions

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney closed NUTCH-680. --- Resolution: Fixed Updating jdom and jaxen causes parse-oo tests to fail. So I am closing this issue.

[jira] Commented: (NUTCH-628) Host database to keep track of host-level information

2009-01-28 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668135#action_12668135 ] Otis Gospodnetic commented on NUTCH-628: Thanks for the update. Sorry, I don't

[jira] Commented: (NUTCH-628) Host database to keep track of host-level information

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668141#action_12668141 ] Doğacan Güney commented on NUTCH-628: - When someone thinks of crawldb, he would probably

[jira] Commented: (NUTCH-628) Host database to keep track of host-level information

2009-01-28 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668164#action_12668164 ] Andrzej Bialecki commented on NUTCH-628: - I agree that the crawldb/current/ subdir

[jira] Commented: (NUTCH-628) Host database to keep track of host-level information

2009-01-28 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668170#action_12668170 ] Doğacan Güney commented on NUTCH-628: - This tool can also read crawl_fetch and other

[jira] Commented: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3

2009-01-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12668324#action_12668324 ] Hudson commented on NUTCH-571: -- Integrated in Nutch-trunk #708 (See