[
https://issues.apache.org/jira/browse/NUTCH-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14332417#comment-14332417
]
Sebastian Nagel commented on NUTCH-1944:
This issue duplicates NUTCH-1785 but this
[
https://issues.apache.org/jira/browse/NUTCH-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1870:
---
Attachment: NUTCH-1870-trunk-v4.patch
New patch including:
* load all configuration files from
[
https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338198#comment-14338198
]
Sebastian Nagel commented on NUTCH-1950:
Is it really a good idea to take the syst
[
https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338684#comment-14338684
]
Sebastian Nagel commented on NUTCH-1950:
Great! For a MD5 calculation, see o.a.had
[
https://issues.apache.org/jira/browse/NUTCH-1957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14357691#comment-14357691
]
Sebastian Nagel commented on NUTCH-1957:
Just a few thoughts to finally solve this
[
https://issues.apache.org/jira/browse/NUTCH-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14358349#comment-14358349
]
Sebastian Nagel commented on NUTCH-1956:
+1
> Members to be public in URLCrawlDat
[
https://issues.apache.org/jira/browse/NUTCH-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14366044#comment-14366044
]
Sebastian Nagel commented on NUTCH-1967:
+1
MimeUtil.cleanMimeType() could be an a
[
https://issues.apache.org/jira/browse/NUTCH-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371018#comment-14371018
]
Sebastian Nagel commented on NUTCH-1971:
+1 Since NUTCH-1786 crawldb.url.filters a
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14375693#comment-14375693
]
Sebastian Nagel commented on NUTCH-1941:
Hi [~asitangm], thanks! The patch needs s
[
https://issues.apache.org/jira/browse/NUTCH-1958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14376227#comment-14376227
]
Sebastian Nagel commented on NUTCH-1958:
Scoring-oping is not that bad, scores are
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14378821#comment-14378821
]
Sebastian Nagel commented on NUTCH-1941:
Great, that's a step forward. Before goin
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14379895#comment-14379895
]
Sebastian Nagel commented on NUTCH-1941:
It's not about concurrent write accesses
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381679#comment-14381679
]
Sebastian Nagel commented on NUTCH-1941:
Solution 2 is simpler because it does not
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1941:
---
Attachment: NUTCH-1941-v5.patch
Attached new patch v5
- including descriptions in nutch-defaul
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1941:
---
Patch Info: Patch Available
Affects Version/s: 2.3
1.9
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-1941:
--
Assignee: Sebastian Nagel
> Optional rolling http.agent.name's
> --
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14384636#comment-14384636
]
Sebastian Nagel commented on NUTCH-1941:
Great! Also protocol-httpclient is now ro
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on NUTCH-1941 started by Sebastian Nagel.
--
> Optional rolling http.agent.name's
> --
>
>
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1941:
---
Attachment: NUTCH-1941-2x-v6.patch
Patch for 2.x
> Optional rolling http.agent.name's
> -
[
https://issues.apache.org/jira/browse/NUTCH-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1941.
Resolution: Fixed
Committed to trunk and 2.x, r1669692. Thanks, [~asitang]!
> Optional roll
[
https://issues.apache.org/jira/browse/NUTCH-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388715#comment-14388715
]
Sebastian Nagel commented on NUTCH-1979:
+1
> CrawlDbReader to implement Tool
> -
[
https://issues.apache.org/jira/browse/NUTCH-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389221#comment-14389221
]
Sebastian Nagel commented on NUTCH-1979:
Needs a trivial fix in TestCrawlDbMerger:
[
https://issues.apache.org/jira/browse/NUTCH-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14389351#comment-14389351
]
Sebastian Nagel commented on NUTCH-1771:
>From [~chongli] in NUTCH-1978:
{quote}
S
[
https://issues.apache.org/jira/browse/NUTCH-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1978.
Resolution: Duplicate
Hi [~chongli], this is clearly a duplicate of NUTCH-1771. It's better
[
https://issues.apache.org/jira/browse/NUTCH-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1771:
---
Affects Version/s: 1.10
> Solrindex fails if a segment is corrupted or incomplete
> --
[
https://issues.apache.org/jira/browse/NUTCH-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14391647#comment-14391647
]
Sebastian Nagel commented on NUTCH-1771:
Hi [~chongli], the patch looks clean and
[
https://issues.apache.org/jira/browse/NUTCH-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14394230#comment-14394230
]
Sebastian Nagel commented on NUTCH-1771:
Again: nice patch.
* SegmentChecker holds
[
https://issues.apache.org/jira/browse/NUTCH-1771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14483527#comment-14483527
]
Sebastian Nagel commented on NUTCH-1771:
+1 : will commit soon. Thanks, [~chongli]
[
https://issues.apache.org/jira/browse/NUTCH-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1981:
---
Fix Version/s: 1.11
2.4
> Upgrade icu4j to version 51.1
> -
[
https://issues.apache.org/jira/browse/NUTCH-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14485157#comment-14485157
]
Sebastian Nagel commented on NUTCH-1981:
There should be no problem to upgrade the
[
https://issues.apache.org/jira/browse/NUTCH-1247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14485297#comment-14485297
]
Sebastian Nagel commented on NUTCH-1247:
Close this issue? With NUTCH-578 and NUTC
[
https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14487035#comment-14487035
]
Sebastian Nagel commented on NUTCH-1854:
Definitely: fetcher.store.content=false a
[
https://issues.apache.org/jira/browse/NUTCH-1981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1981.
Resolution: Fixed
Fix Version/s: (was: 1.11)
1.10
Committed to
[
https://issues.apache.org/jira/browse/NUTCH-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14491240#comment-14491240
]
Sebastian Nagel commented on NUTCH-1984:
Thanks, [~aspa]! That's 3 problems which
[
https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492286#comment-14492286
]
Sebastian Nagel commented on NUTCH-1854:
Thanks, [~asitang]!
* NUTCH-1771 is commi
[
https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492286#comment-14492286
]
Sebastian Nagel edited comment on NUTCH-1854 at 4/13/15 11:24 AM:
--
[
https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492419#comment-14492419
]
Sebastian Nagel commented on NUTCH-1927:
* http.robot.rules.whitelist should be em
[
https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14493655#comment-14493655
]
Sebastian Nagel commented on NUTCH-1854:
+1 Great! Needs formatting. Will commit s
[
https://issues.apache.org/jira/browse/NUTCH-1987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14496986#comment-14496986
]
Sebastian Nagel commented on NUTCH-1987:
Agreed: it's time to skip the Solr-URL be
[
https://issues.apache.org/jira/browse/NUTCH-1986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497000#comment-14497000
]
Sebastian Nagel commented on NUTCH-1986:
+1 that's the default values you have to
[
https://issues.apache.org/jira/browse/NUTCH-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497149#comment-14497149
]
Sebastian Nagel commented on NUTCH-1988:
+1
Could be alternatively {{-dirlevels n}
[
https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14497251#comment-14497251
]
Sebastian Nagel commented on NUTCH-1927:
Hi Chris, the class WhiteListRobotRules s
[
https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1927:
---
Attachment: NUTCH-1927.2015-04-16.patch
Hi Chris,
bq. Can you please reply with code?
yep, att
[
https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14500544#comment-14500544
]
Sebastian Nagel commented on NUTCH-1927:
Hi, Chris: agreed to log more verbosely.
[
https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1927:
---
Attachment: test_NUTCH-1927.2015-04-17.txt
NUTCH-1927.2015-04-17.patch
Patch t
[
https://issues.apache.org/jira/browse/NUTCH-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14500652#comment-14500652
]
Sebastian Nagel commented on NUTCH-1927:
Committed to trunk r1674399. Should be ea
[
https://issues.apache.org/jira/browse/NUTCH-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1854.
Resolution: Fixed
Committed to trunk, r1674581. Thanks!
> ./bin/crawl fails with a parsing
[
https://issues.apache.org/jira/browse/NUTCH-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1990:
---
Attachment: NUTCH-1990-trial1.patch
Sounds reasonable and would "en passant" resolve NUTCH-106
[
https://issues.apache.org/jira/browse/NUTCH-1697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1697:
---
Attachment: NUTCH-1697-trunk-v2.patch
Patch which applies to recent trunk. Both variants to pa
[
https://issues.apache.org/jira/browse/NUTCH-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1990:
---
Attachment: NUTCH-1990-v1.patch
Uuuh, a lot of garbage :( I've also run the test after spendi
[
https://issues.apache.org/jira/browse/NUTCH-1991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1991:
---
Attachment: NUTCH-1991-trunk.v2.patch
Thanks, [~ilopata1]! Updated patch to apply against trun
[
https://issues.apache.org/jira/browse/NUTCH-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14507865#comment-14507865
]
Sebastian Nagel commented on NUTCH-1993:
+1
> Nutch does not use backup parsers
>
[
https://issues.apache.org/jira/browse/NUTCH-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14507895#comment-14507895
]
Sebastian Nagel commented on NUTCH-1990:
Applied also to 2.x, r1675499 to finally
[
https://issues.apache.org/jira/browse/NUTCH-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1062.
Resolution: Fixed
Fix Version/s: (was: 1.11)
1.10
[
https://issues.apache.org/jira/browse/NUTCH-1994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14509678#comment-14509678
]
Sebastian Nagel commented on NUTCH-1994:
+1
> Upgrade to Apache Tika 1.8
> --
[
https://issues.apache.org/jira/browse/NUTCH-1998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reopened NUTCH-1998:
Commit r1678520 breaks the Jenkins build: TestCommonCrawlDataDumper needs to be
adapted to the
[
https://issues.apache.org/jira/browse/NUTCH-1998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1998.
Resolution: Fixed
Fixed unit test and issue ID in change log, r1678824.
> Add support for u
Sebastian Nagel created NUTCH-2007:
--
Summary: add test libs to classpath of bin/nutch junit
Key: NUTCH-2007
URL: https://issues.apache.org/jira/browse/NUTCH-2007
Project: Nutch
Issue Type: B
[
https://issues.apache.org/jira/browse/NUTCH-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2007:
---
Attachment: NUTCH-2007-trunk-v1.patch
> add test libs to classpath of bin/nutch junit
> --
Sebastian Nagel created NUTCH-2008:
--
Summary: IndexerMapReduce to use single instance of
NutchIndexAction for deletions
Key: NUTCH-2008
URL: https://issues.apache.org/jira/browse/NUTCH-2008
Project:
[
https://issues.apache.org/jira/browse/NUTCH-2008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2008:
---
Attachment: NUTCH-2008-trunk-v1.patch
> IndexerMapReduce to use single instance of NutchIndexA
[
https://issues.apache.org/jira/browse/NUTCH-2008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2008:
---
Attachment: NUTCH-2008-trunk-v2.patch
Right, could be static. Thanks!
> IndexerMapReduce to u
[
https://issues.apache.org/jira/browse/NUTCH-2008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2008.
Resolution: Fixed
Assignee: Sebastian Nagel
Committed to trunk, r1679335.
> IndexerMa
[
https://issues.apache.org/jira/browse/NUTCH-2002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14545473#comment-14545473
]
Sebastian Nagel commented on NUTCH-2002:
+1 makes ParserChecker a more powerful de
Sebastian Nagel created NUTCH-2012:
--
Summary: Merge parsechecker and indexchecker
Key: NUTCH-2012
URL: https://issues.apache.org/jira/browse/NUTCH-2012
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-2006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14545527#comment-14545527
]
Sebastian Nagel commented on NUTCH-2006:
+1 to complete indexchecker (opened NUTCH
[
https://issues.apache.org/jira/browse/NUTCH-2002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14545532#comment-14545532
]
Sebastian Nagel commented on NUTCH-2002:
one point: also redirects should be check
Sebastian Nagel created NUTCH-2013:
--
Summary: Fetcher: missing logs "fetching ..." on stdout
Key: NUTCH-2013
URL: https://issues.apache.org/jira/browse/NUTCH-2013
Project: Nutch
Issue Type:
Sebastian Nagel created NUTCH-2014:
--
Summary: Fetcher hang-up on completion
Key: NUTCH-2014
URL: https://issues.apache.org/jira/browse/NUTCH-2014
Project: Nutch
Issue Type: Bug
R
[
https://issues.apache.org/jira/browse/NUTCH-2014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2014:
---
Attachment: NUTCH-2014-v1.patch
The reason is a mix-up of the counters for active threads and
[
https://issues.apache.org/jira/browse/NUTCH-2014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2014:
---
Component/s: fetcher
Patch Info: Patch Available
Affects Version/s: 1.11
[
https://issues.apache.org/jira/browse/NUTCH-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reopened NUTCH-2011:
Sorry, but this needs some rework:
- after 35.000+ fetched pages and the default max. heap size
[
https://issues.apache.org/jira/browse/NUTCH-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14547213#comment-14547213
]
Sebastian Nagel commented on NUTCH-2015:
Ok. Ev. this could be changed to make it
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14547337#comment-14547337
]
Sebastian Nagel commented on NUTCH-1995:
Hi Guiseppe, wild cards are ok if it is a
[
https://issues.apache.org/jira/browse/NUTCH-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14547676#comment-14547676
]
Sebastian Nagel commented on NUTCH-2011:
Yes, that's because of the nodeDB feature
[
https://issues.apache.org/jira/browse/NUTCH-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14547760#comment-14547760
]
Sebastian Nagel commented on NUTCH-2011:
Hi [~sujenshah], first a few questions to
Sebastian Nagel created NUTCH-2016:
--
Summary: Remove OldFetcher from trunk
Key: NUTCH-2016
URL: https://issues.apache.org/jira/browse/NUTCH-2016
Project: Nutch
Issue Type: Wish
Com
Sebastian Nagel created NUTCH-2017:
--
Summary: Remove debug log from MimeUtil
Key: NUTCH-2017
URL: https://issues.apache.org/jira/browse/NUTCH-2017
Project: Nutch
Issue Type: Bug
Affects
[
https://issues.apache.org/jira/browse/NUTCH-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2017:
---
Attachment: NUTCH-2017.patch
> Remove debug log from MimeUtil
> --
[
https://issues.apache.org/jira/browse/NUTCH-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14547845#comment-14547845
]
Sebastian Nagel commented on NUTCH-2011:
??about modifying the CrawlDb to hold one
[
https://issues.apache.org/jira/browse/NUTCH-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2013:
---
Attachment: NUTCH-2013-v1.patch
Patch to make all classes in the fetcher package pulled out fr
[
https://issues.apache.org/jira/browse/NUTCH-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2013:
---
Patch Info: Patch Available
> Fetcher: missing logs "fetching ..." on stdout
> ---
[
https://issues.apache.org/jira/browse/NUTCH-2014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2014.
Resolution: Fixed
Committed to trunk/1.x, r1680109. Thanks for the review, [~lewismc]!
> Fe
[
https://issues.apache.org/jira/browse/NUTCH-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14549280#comment-14549280
]
Sebastian Nagel commented on NUTCH-2013:
Thanks! Committed to trunk/1.x, r1680110.
[
https://issues.apache.org/jira/browse/NUTCH-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2013.
Resolution: Fixed
Assignee: Sebastian Nagel
> Fetcher: missing logs "fetching ..." on
[
https://issues.apache.org/jira/browse/NUTCH-2014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2014:
--
Assignee: Sebastian Nagel
> Fetcher hang-up on completion
> ---
[
https://issues.apache.org/jira/browse/NUTCH-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14552524#comment-14552524
]
Sebastian Nagel commented on NUTCH-2011:
Yes, relying on CrawlDb should be the rig
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14553219#comment-14553219
]
Sebastian Nagel commented on NUTCH-1995:
Hi Chris, it's not about Guiseppe's use c
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14554954#comment-14554954
]
Sebastian Nagel commented on NUTCH-1995:
Agreed. If you know how to modify the cod
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14559883#comment-14559883
]
Sebastian Nagel commented on NUTCH-1995:
+1, yes
* there is already an output / lo
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560659#comment-14560659
]
Sebastian Nagel commented on NUTCH-1995:
The result of {{conf.getStrings("http.rob
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14560659#comment-14560659
]
Sebastian Nagel edited comment on NUTCH-1995 at 5/27/15 9:04 AM:
---
[
https://issues.apache.org/jira/browse/NUTCH-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2007:
--
Assignee: Sebastian Nagel
> add test libs to classpath of bin/nutch junit
> ---
[
https://issues.apache.org/jira/browse/NUTCH-1995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14561567#comment-14561567
]
Sebastian Nagel commented on NUTCH-1995:
Great work, [~gostep]! Please, resolve!
[
https://issues.apache.org/jira/browse/NUTCH-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2007.
Resolution: Fixed
Committed to trunk, r1682103.
> add test libs to classpath of bin/nutch j
[
https://issues.apache.org/jira/browse/NUTCH-1247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-1247.
Resolution: Not A Problem
Resolving. This is hardly necessary and would make CrawlDb incompa
[
https://issues.apache.org/jira/browse/NUTCH-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14567037#comment-14567037
]
Sebastian Nagel commented on NUTCH-2015:
+1 to commit [~sujenshah]'s latest patch
[
https://issues.apache.org/jira/browse/NUTCH-2035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14571680#comment-14571680
]
Sebastian Nagel commented on NUTCH-2035:
Thanks, [~betolink]!
* is it possible to
[
https://issues.apache.org/jira/browse/NUTCH-2032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14571694#comment-14571694
]
Sebastian Nagel commented on NUTCH-2032:
Hi [~betolink], your solution/patch alrea
[
https://issues.apache.org/jira/browse/NUTCH-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14571698#comment-14571698
]
Sebastian Nagel commented on NUTCH-2034:
Thanks, good idea! But strictly speaking
701 - 800 of 3450 matches
Mail list logo