[jira] [Updated] (NUTCH-2055) Random Crawl Delay

2015-07-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-2055: Attachment: (was: NUTCH-2055.patch) Random Crawl Delay --

[jira] [Created] (NUTCH-2055) Random Crawl Delay

2015-07-01 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-2055: --- Summary: Random Crawl Delay Key: NUTCH-2055 URL: https://issues.apache.org/jira/browse/NUTCH-2055 Project: Nutch Issue Type: New Feature Affects Versions:

[jira] [Updated] (NUTCH-2055) Random Crawl Delay

2015-07-01 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-2055: Attachment: NUTCH-2055.patch can someone review this patch ? Random Crawl Delay

[jira] [Updated] (NUTCH-1940) Port HTTP POST Authentication to 2.X

2015-07-01 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1940: Attachment: NUTCH-1940.patch [~lewismc] could you review it ? I found a bug in orginal code. If

[jira] [Created] (NUTCH-2054) When Using Form Auth settings can not read response body

2015-07-01 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-2054: --- Summary: When Using Form Auth settings can not read response body Key: NUTCH-2054 URL: https://issues.apache.org/jira/browse/NUTCH-2054 Project: Nutch Issue

[jira] [Commented] (NUTCH-1170) Write JUnit tests for urlfilter-validator

2015-05-24 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14557721#comment-14557721 ] Talat UYARER commented on NUTCH-1170: - [~halil] This issue looks fixed. Could you

[jira] [Resolved] (NUTCH-1169) Write JUnit tests for urlfilter-prefix

2015-05-24 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER resolved NUTCH-1169. - Resolution: Duplicate I guess This is done. It was my first patch to Nutch :) I close this issue.

[jira] [Created] (NUTCH-2003) topN is not work correctly

2015-04-29 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-2003: --- Summary: topN is not work correctly Key: NUTCH-2003 URL: https://issues.apache.org/jira/browse/NUTCH-2003 Project: Nutch Issue Type: Bug Affects Versions:

[jira] [Updated] (NUTCH-1741) Support of Sitemaps in Nutch 2.x

2015-03-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1741: Labels: gsoc2015 (was: ) Support of Sitemaps in Nutch 2.x

[jira] [Commented] (NUTCH-1924) Nutch + HBase Docker

2015-02-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302984#comment-14302984 ] Talat UYARER commented on NUTCH-1924: - Hi [~rrydziu] and [~lewismc], First of all

[jira] [Updated] (NUTCH-1855) Upgrade Hadoop dependencies to Hadoop 2

2015-01-05 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1855: Attachment: NUTCH-1855.patch This issue is related with Gora than Nutch. If Gora works on hadoop 2

[jira] [Assigned] (NUTCH-1899) upgrade restlet lib to prevent build failure

2014-12-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER reassigned NUTCH-1899: --- Assignee: Talat UYARER upgrade restlet lib to prevent build failure

[jira] [Updated] (NUTCH-1899) upgrade restlet lib to prevent build failure

2014-12-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1899: Attachment: NUTCH-1899.patch Hi [~wastl-nagel], Thanks. I create a patch. If it is OK, I will

[jira] [Created] (NUTCH-1900) DockerFile for Nutch 2.x

2014-12-16 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1900: --- Summary: DockerFile for Nutch 2.x Key: NUTCH-1900 URL: https://issues.apache.org/jira/browse/NUTCH-1900 Project: Nutch Issue Type: New Feature

[jira] [Updated] (NUTCH-1900) DockerFile for Nutch 2.x

2014-12-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1900: Description: We can create Dockerfile can be used to build a Docker image running the latest Nutch

[jira] [Updated] (NUTCH-1900) DockerFile for Nutch 2.x

2014-12-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1900: Labels: docker (was: ) DockerFile for Nutch 2.x Key:

[jira] [Commented] (NUTCH-1709) Generated classes o.a.n.storage.Host and o.a.n.storage.ProtocolStatus contain methods not defined in source .avsc

2014-12-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248181#comment-14248181 ] Talat UYARER commented on NUTCH-1709: - Sorry [~lewismc], I have seen yet. Can you have

[jira] [Comment Edited] (NUTCH-1709) Generated classes o.a.n.storage.Host and o.a.n.storage.ProtocolStatus contain methods not defined in source .avsc

2014-12-16 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248181#comment-14248181 ] Talat UYARER edited comment on NUTCH-1709 at 12/16/14 12:08 PM:

[jira] [Updated] (NUTCH-1644) Should have a parser that uses xpath

2014-11-01 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1644: Attachment: filter-xpath.patch Hi [~lewismc], This is Emir's filter xpath. I refactor for 2.x and

[jira] [Commented] (NUTCH-1644) Should have a parser that uses xpath

2014-10-23 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181070#comment-14181070 ] Talat UYARER commented on NUTCH-1644: - [~lewismc], We used Emir's Xpath filter. We

[jira] [Commented] (NUTCH-1660) Index filter for Page's latitude and longitude

2014-10-01 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14154752#comment-14154752 ] Talat UYARER commented on NUTCH-1660: - [~lewismc] I agree with you. This will be very

[jira] [Assigned] (NUTCH-1660) Index filter for Page's latitude and longitude

2014-10-01 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER reassigned NUTCH-1660: --- Assignee: Talat UYARER Index filter for Page's latitude and longitude

[jira] [Commented] (NUTCH-1660) Index filter for Page's latitude and longitude

2014-10-01 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14154761#comment-14154761 ] Talat UYARER commented on NUTCH-1660: - You are tight. IP is not reliable. But Location

[jira] [Commented] (NUTCH-1660) Index filter for Page's latitude and longitude

2014-09-30 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14153470#comment-14153470 ] Talat UYARER commented on NUTCH-1660: - It used maxmind 1.2.1 If everybody is OK, can I

[jira] [Commented] (NUTCH-1848) Bug in DashboardPage.html instances counter

2014-09-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151360#comment-14151360 ] Talat UYARER commented on NUTCH-1848: - Congras Nima. Nice patch, Good job Bug in

[jira] [Created] (NUTCH-1855) Upgrade Hadoop dependencies to Hadoop 2

2014-09-25 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1855: --- Summary: Upgrade Hadoop dependencies to Hadoop 2 Key: NUTCH-1855 URL: https://issues.apache.org/jira/browse/NUTCH-1855 Project: Nutch Issue Type: Improvement

[jira] [Closed] (NUTCH-1852) Runtime error on Hadoop 2.4.0 caused by hadoop-core

2014-09-25 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER closed NUTCH-1852. --- Resolution: Invalid Runtime error on Hadoop 2.4.0 caused by hadoop-core

[jira] [Commented] (NUTCH-1852) Runtime error on Hadoop 2.4.0 caused by hadoop-core

2014-09-25 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147479#comment-14147479 ] Talat UYARER commented on NUTCH-1852: - Hi [~dobromyslov], Now We do not support

[jira] [Updated] (NUTCH-1843) Upgrade to Gora 0.5

2014-09-24 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1843: Attachment: NUTCH-1843.patch Hi [~lewismc], I test with Hbase. There does not seem any problem. I

[jira] [Commented] (NUTCH-1843) Upgrade to Gora 0.5

2014-09-22 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14143556#comment-14143556 ] Talat UYARER commented on NUTCH-1843: - Is not anybody working on this ? If this is

[jira] [Commented] (NUTCH-1843) Upgrade to Gora 0.5

2014-09-22 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14144309#comment-14144309 ] Talat UYARER commented on NUTCH-1843: - Ok [~lewismc] Good deal, I start to work today

[jira] [Commented] (NUTCH-1845) Nutch cannot save inlinks

2014-09-21 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142879#comment-14142879 ] Talat UYARER commented on NUTCH-1845: - Hi Zhiwen, Thank you for your attention.

[jira] [Created] (NUTCH-1788) Tika may return multiple values for Title on PDF's

2014-05-25 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1788: --- Summary: Tika may return multiple values for Title on PDF's Key: NUTCH-1788 URL: https://issues.apache.org/jira/browse/NUTCH-1788 Project: Nutch Issue Type:

[jira] [Closed] (NUTCH-1784) modifiedTime and prevmodifiedTime never set

2014-05-17 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER closed NUTCH-1784. --- Resolution: Duplicate We fiexed it. modifiedTime and prevmodifiedTime never set

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to Gora 0.4

2014-05-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13997408#comment-13997408 ] Talat UYARER commented on NUTCH-1714: - +1 Thanks [~jnioche] Nutch 2.x upgrade to

[jira] [Resolved] (NUTCH-1725) CleaningJob's reducer does not commit deleted docs.

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER resolved NUTCH-1725. - Resolution: Fixed Thanks [~ilhamikalkan], Good catch Committed revision 1592202.

[jira] [Commented] (NUTCH-1662) Indexer Plugin for Solr Cloud

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988684#comment-13988684 ] Talat UYARER commented on NUTCH-1662: - Can Someone review it ? Indexer Plugin for

[jira] [Commented] (NUTCH-1657) ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988692#comment-13988692 ] Talat UYARER commented on NUTCH-1657: - Committed revision 1592207.

[jira] [Closed] (NUTCH-1677) ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION are not set in Parse HTML

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER closed NUTCH-1677. --- Resolution: Duplicate ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION are not set in Parse

[jira] [Commented] (NUTCH-1643) Unnecessary fetching with http.content.limit when using protocol-http

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13988706#comment-13988706 ] Talat UYARER commented on NUTCH-1643: - Hi [~lewismc], Sorry for late reply. I can

[jira] [Updated] (NUTCH-1618) Turn speculative execution off for Fetching

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1618: Summary: Turn speculative execution off for Fetching (was: Fetches some websites multiple times

[jira] [Assigned] (NUTCH-1618) Turn speculative execution off for Fetching

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER reassigned NUTCH-1618: --- Assignee: Talat UYARER Turn speculative execution off for Fetching

[jira] [Updated] (NUTCH-1618) Turn speculative execution off for Fetching

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1618: Attachment: NUTCH-1618-v2.patch I update for nonprefix format Turn speculative execution off for

[jira] [Resolved] (NUTCH-1618) Turn speculative execution off for Fetching

2014-05-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER resolved NUTCH-1618. - Resolution: Fixed Thanks [~jnioche] Committed revision 1592218. Turn speculative execution off

[jira] [Updated] (NUTCH-1662) Indexer Plugin for Solr Cloud

2014-05-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1662: Attachment: NUTCH-1662-v2.patch Solr Url's changes with Zookeeper hosts. It works with fine. I

[jira] [Updated] (NUTCH-1662) Indexer Plugin for Solr Cloud

2014-05-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1662: Patch Info: Patch Available Indexer Plugin for Solr Cloud -

[jira] [Commented] (NUTCH-1753) Eclipse dependecy problem for 2.x

2014-05-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13987473#comment-13987473 ] Talat UYARER commented on NUTCH-1753: - I committed this issue. Eclipse dependecy

[jira] [Resolved] (NUTCH-1753) Eclipse dependecy problem for 2.x

2014-05-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER resolved NUTCH-1753. - Resolution: Fixed Fix Version/s: (was: 2.4) 2.3 Committed revision

[jira] [Resolved] (NUTCH-1728) indexer-solr plugin is not delete docs from solr

2014-05-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER resolved NUTCH-1728. - Resolution: Fixed Committed revision 1591849. indexer-solr plugin is not delete docs from solr

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to use GORA_94 branch

2014-04-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973910#comment-13973910 ] Talat UYARER commented on NUTCH-1714: - Thanks [~alxksn] for updating. I will test it.

[jira] [Updated] (NUTCH-1753) Eclipse dependecy problem for 2.x

2014-04-09 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1753: Attachment: NUTCH-1753.patch This patch can solve it. Eclipse dependecy problem for 2.x

[jira] [Created] (NUTCH-1753) Eclipse dependecy problem for 2.x

2014-04-09 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1753: --- Summary: Eclipse dependecy problem for 2.x Key: NUTCH-1753 URL: https://issues.apache.org/jira/browse/NUTCH-1753 Project: Nutch Issue Type: Bug Affects

[jira] [Comment Edited] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13938960#comment-13938960 ] Talat UYARER edited comment on NUTCH-1738 at 3/18/14 8:29 AM: --

[jira] [Updated] (NUTCH-1738) Expose number of URLs generated per batch in GeneratorJob

2014-03-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1738: Attachment: NUTCH-1738.patch Hi [~lewis] , I attached a patch for this information. Can you

[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-12 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1478: Attachment: NUTCH-1478v6.patch I update unnecessary configuration. Some trival updates.

[jira] [Updated] (NUTCH-1253) Incompatible neko and xerces versions

2014-03-11 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1253: Attachment: NUTCH-1253-2.x-eclipse.patch [~icebergx5] is right. At the present 2.x branch does

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-03-06 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13922462#comment-13922462 ] Talat UYARER commented on NUTCH-1478: - Hi [~vagkarv], - First question is correct.

[jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-02-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1478: Attachment: NUTCH-1478v5.patch I fixed several mistakes within the patch. This is final.

[jira] [Comment Edited] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-02-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13915619#comment-13915619 ] Talat UYARER edited comment on NUTCH-1478 at 2/28/14 10:03 AM:

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-02-04 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890587#comment-13890587 ] Talat UYARER commented on NUTCH-1478: - Hi [~popalka], Your problem because of your

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-02-04 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890602#comment-13890602 ] Talat UYARER commented on NUTCH-1478: - Hi [~popalka], Can you share your seedlist and

[jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series

2014-02-04 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890600#comment-13890600 ] Talat UYARER commented on NUTCH-1478: - It is not related with the topic but this patch

[jira] [Commented] (NUTCH-1676) Add rudimentary SSL support to protocol-http

2014-01-24 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13880962#comment-13880962 ] Talat UYARER commented on NUTCH-1676: - hi [~markus17], Could you port this for 2.x ?

[jira] [Commented] (NUTCH-1572) Nutch 2.x should use o.a.g.mem.store.MemStore for testing

2014-01-21 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13877723#comment-13877723 ] Talat UYARER commented on NUTCH-1572: - Hi [~lewismc], I tested this issue your right.

[jira] [Commented] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size)

2014-01-19 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875926#comment-13875926 ] Talat UYARER commented on NUTCH-1630: - Hi [~tejasp], At First depth, we accept zero

[jira] [Commented] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size)

2014-01-19 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875972#comment-13875972 ] Talat UYARER commented on NUTCH-1630: - Hi [~tejasp], I guess You miss understood me.

[jira] [Commented] (NUTCH-1630) How to achieve finishing fetch approximately at the same time for each queue (a.k.a adaptive queue size)

2014-01-19 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13875987#comment-13875987 ] Talat UYARER commented on NUTCH-1630: - Hi again [~tejasp] :), Thanks for your

[jira] [Commented] (NUTCH-1655) Indexer Plugin for Elastic Search

2014-01-15 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13872009#comment-13872009 ] Talat UYARER commented on NUTCH-1655: - Hi [~markus17], I have already included

[jira] [Commented] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870662#comment-13870662 ] Talat UYARER commented on NUTCH-1568: - Hi [~lewismc], Thanks for update. In your

[jira] [Commented] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870671#comment-13870671 ] Talat UYARER commented on NUTCH-1568: - Thanks [~lewis], I have an objection :) can you

[jira] [Comment Edited] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870671#comment-13870671 ] Talat UYARER edited comment on NUTCH-1568 at 1/14/14 12:33 PM:

[jira] [Updated] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-13 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1568: Attachment: NUTCH-1568-v3.path hi [~lewismc], I create with no prefix. Sorry for duplicate work.

[jira] [Updated] (NUTCH-1655) Indexer Plugin for Elastic Search

2014-01-13 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1655: Attachment: NUTCH-1655-v2.path Updated for no-prefix. Indexer Plugin for Elastic Search

[jira] [Commented] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-13 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13869550#comment-13869550 ] Talat UYARER commented on NUTCH-1568: - This patch contains pluggable IndexJob and Solr

[jira] [Updated] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-11 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1568: Attachment: (was: NUTCH-1568-v2.patch) port pluggable indexing architecture to 2.x

[jira] [Updated] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-11 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1568: Attachment: NUTCH-1568-v2.patch Hi [~lewismc], Can you check my patch ? port pluggable indexing

[jira] [Updated] (NUTCH-1655) Indexer Plugin for Elastic Search

2014-01-11 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1655: Attachment: (was: NUTCH-1655.patch) Indexer Plugin for Elastic Search

[jira] [Updated] (NUTCH-1655) Indexer Plugin for Elastic Search

2014-01-11 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1655: Attachment: NUTCH-1655.patch I forget add some parts (ivy.xml etc.) . Now I update my patch.

[jira] [Commented] (NUTCH-1371) Replace Ivy with Maven Ant tasks

2014-01-07 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13864133#comment-13864133 ] Talat UYARER commented on NUTCH-1371: - Is there any opinion ? Replace Ivy with

[jira] [Commented] (NUTCH-1371) Replace Ivy with Maven Ant tasks

2014-01-07 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13864182#comment-13864182 ] Talat UYARER commented on NUTCH-1371: - Hi [~lewismc], Different from the [~jnioche]'s

[jira] [Commented] (NUTCH-1371) Replace Ivy with Maven Ant tasks

2014-01-07 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13864215#comment-13864215 ] Talat UYARER commented on NUTCH-1371: - Hi [~jnioche], Actually I have some problems

[jira] [Commented] (NUTCH-1371) Replace Ivy with Maven Ant tasks

2014-01-06 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862941#comment-13862941 ] Talat UYARER commented on NUTCH-1371: - May I learn Why do we completely migrate to

[jira] [Created] (NUTCH-1677) ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION are not set in Parse HTML

2013-11-29 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1677: --- Summary: ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION are not set in Parse HTML Key: NUTCH-1677 URL: https://issues.apache.org/jira/browse/NUTCH-1677

[jira] [Updated] (NUTCH-1661) Language based crawling

2013-11-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1661: Attachment: (was: NUTCH-1661.patch) Language based crawling ---

[jira] [Updated] (NUTCH-1661) Language based crawling

2013-11-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1661: Attachment: NUTCH-1661.patch it is cleaned for unnecessary files. Language based crawling

[jira] [Commented] (NUTCH-1659) Custom partitioner for Adaptive Queue Size

2013-11-04 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13812771#comment-13812771 ] Talat UYARER commented on NUTCH-1659: - [~markus17] If you plan to use this patch you

[jira] [Commented] (NUTCH-1659) Custom partitioner for Adaptive Queue Size

2013-11-03 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13812351#comment-13812351 ] Talat UYARER commented on NUTCH-1659: - [~lewismc], Adaptive queue size is completed

[jira] [Updated] (NUTCH-1643) Unnecessary fetching with http.content.limit when using protocol-http

2013-11-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1643: Attachment: NUTCH-1643v3.patch Hİ [~lewismc], Today You work very hard :) I add necessary codes

[jira] [Created] (NUTCH-1660) Index filter for Page's latitude and longitude

2013-11-02 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1660: --- Summary: Index filter for Page's latitude and longitude Key: NUTCH-1660 URL: https://issues.apache.org/jira/browse/NUTCH-1660 Project: Nutch Issue Type: New

[jira] [Created] (NUTCH-1661) Language based crawling

2013-11-02 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1661: --- Summary: Language based crawling Key: NUTCH-1661 URL: https://issues.apache.org/jira/browse/NUTCH-1661 Project: Nutch Issue Type: Improvement Affects

[jira] [Updated] (NUTCH-1661) Language based crawling

2013-11-02 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1661: Attachment: NUTCH-1661.patch Patch Attached. Language based crawling ---

[jira] [Commented] (NUTCH-1643) Unnecessary fetching with http.content.limit when using protocol-http

2013-10-29 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13808006#comment-13808006 ] Talat UYARER commented on NUTCH-1643: - Hi [~lewismc], I can look every protocol for

[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified

2013-10-29 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13808122#comment-13808122 ] Talat UYARER commented on NUTCH-1564: - [~amuseme.lu] How do you check this problem. Do

[jira] [Updated] (NUTCH-1413) Fetcher to record response time

2013-10-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1413: Attachment: NUTCH-1413_metadata_v3.patch Sorry for my fault. Thanks Sebastian. Now I updated my

[jira] [Commented] (NUTCH-1564) AdaptiveFetchSchedule: sync_delta forces immediate refetch for documents not modified

2013-10-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13806860#comment-13806860 ] Talat UYARER commented on NUTCH-1564: - Hi Feng, I have same problem. I do some search

[jira] [Created] (NUTCH-1657) ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser

2013-10-28 Thread Talat UYARER (JIRA)
Talat UYARER created NUTCH-1657: --- Summary: ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser Key: NUTCH-1657 URL: https://issues.apache.org/jira/browse/NUTCH-1657 Project:

[jira] [Updated] (NUTCH-1657) ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser

2013-10-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1657: Patch Info: Patch Available ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in

[jira] [Updated] (NUTCH-1657) ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser

2013-10-28 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1657: Attachment: NUTCH-1657.patch I create a patch. ORIGINAL_CHAR_ENCODING and

[jira] [Commented] (NUTCH-1413) Fetcher to record response time

2013-10-27 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13806402#comment-13806402 ] Talat UYARER commented on NUTCH-1413: - You are right. It needs a configuration

[jira] [Updated] (NUTCH-1413) Fetcher to record response time

2013-10-27 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1413: Attachment: NUTCH-1413_metadata_v2.patch I add configuration option. Fetcher to record response

  1   2   >