[
https://issues.apache.org/jira/browse/NUTCH-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13256423#comment-13256423
]
Julien Nioche commented on NUTCH-1314:
--
I was under the impression that the patch did
[
https://issues.apache.org/jira/browse/NUTCH-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13256437#comment-13256437
]
Julien Nioche commented on NUTCH-1314:
--
This makes a good case for the merging of URL
[
https://issues.apache.org/jira/browse/NUTCH-1297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13256531#comment-13256531
]
Julien Nioche commented on NUTCH-1297:
--
Hi Ferdy
Indeed, it is related but does not
[
https://issues.apache.org/jira/browse/NUTCH-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13251684#comment-13251684
]
Julien Nioche commented on NUTCH-1331:
--
This can be done with the ScoringFilters
[
https://issues.apache.org/jira/browse/NUTCH-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13244143#comment-13244143
]
Julien Nioche commented on NUTCH-1234:
--
Markus - you need to update the list of
[
https://issues.apache.org/jira/browse/NUTCH-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13243096#comment-13243096
]
Julien Nioche commented on NUTCH-1234:
--
Sure, will have a look at it next week
[
https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13241085#comment-13241085
]
Julien Nioche commented on NUTCH-1024:
--
Hi Markus
Will have a closer look later. 2
[
https://issues.apache.org/jira/browse/NUTCH-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13235468#comment-13235468
]
Julien Nioche commented on NUTCH-809:
-
Hi Lewis
bq. Can you confirm what you would
[
https://issues.apache.org/jira/browse/NUTCH-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13234316#comment-13234316
]
Julien Nioche commented on NUTCH-809:
-
Trunk : Committed revision 1303371.
Not
[
https://issues.apache.org/jira/browse/NUTCH-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13229290#comment-13229290
]
Julien Nioche commented on NUTCH-1310:
--
code location - same as
property
[
https://issues.apache.org/jira/browse/NUTCH-1297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13221908#comment-13221908
]
Julien Nioche commented on NUTCH-1297:
--
This can already be addressed by giving a
[
https://issues.apache.org/jira/browse/NUTCH-1293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13220087#comment-13220087
]
Julien Nioche commented on NUTCH-1293:
--
wrong patch?
[
https://issues.apache.org/jira/browse/NUTCH-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13220101#comment-13220101
]
Julien Nioche commented on NUTCH-1258:
--
Weird. Yes, please do fix and commit if you
[
https://issues.apache.org/jira/browse/NUTCH-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13212502#comment-13212502
]
Julien Nioche commented on NUTCH-1281:
--
Behnam,
I suppose that you are seeing this
[
https://issues.apache.org/jira/browse/NUTCH-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210870#comment-13210870
]
Julien Nioche commented on NUTCH-1079:
--
Don't rely on me for this one. I am not in
[
https://issues.apache.org/jira/browse/NUTCH-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13210871#comment-13210871
]
Julien Nioche commented on NUTCH-1246:
--
open as not done in nutchgora AFAIK
[
https://issues.apache.org/jira/browse/NUTCH-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13207724#comment-13207724
]
Julien Nioche commented on NUTCH-1259:
--
good catch. Had overlooked the fact that the
[
https://issues.apache.org/jira/browse/NUTCH-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13207753#comment-13207753
]
Julien Nioche commented on NUTCH-1259:
--
bq. But what about segments fetched with and
[
https://issues.apache.org/jira/browse/NUTCH-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13205490#comment-13205490
]
Julien Nioche commented on NUTCH-1259:
--
I haven't looked at NUTCH-1024. Does it take
[
https://issues.apache.org/jira/browse/NUTCH-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13205670#comment-13205670
]
Julien Nioche commented on NUTCH-1259:
--
Nah, might as well do it in this one. Will
[
https://issues.apache.org/jira/browse/NUTCH-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13204463#comment-13204463
]
Julien Nioche commented on NUTCH-1259:
--
bq. // DO NOT ADD Content-Type FROM
[
https://issues.apache.org/jira/browse/NUTCH-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13202474#comment-13202474
]
Julien Nioche commented on NUTCH-1259:
--
bq. I'll commit this one tomorrow unless
[
https://issues.apache.org/jira/browse/NUTCH-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13201378#comment-13201378
]
Julien Nioche commented on NUTCH-1264:
--
Attached a second version which does not
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13197853#comment-13197853
]
Julien Nioche commented on NUTCH-1005:
--
Markus, the parser should store the MD in
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13197888#comment-13197888
]
Julien Nioche commented on NUTCH-1005:
--
bq. I assume i have to disable the indexing
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13197890#comment-13197890
]
Julien Nioche commented on NUTCH-1005:
--
BTW if you can think of a better name for
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13197898#comment-13197898
]
Julien Nioche commented on NUTCH-1005:
--
bq. index-meta comes to mind! It's exactly
[
https://issues.apache.org/jira/browse/NUTCH-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13196829#comment-13196829
]
Julien Nioche commented on NUTCH-1262:
--
Just wondering, does not Tika's Mimetype
[
https://issues.apache.org/jira/browse/NUTCH-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13196950#comment-13196950
]
Julien Nioche commented on NUTCH-1242:
--
Shouldn't this test for noFilter and
[
https://issues.apache.org/jira/browse/NUTCH-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13192997#comment-13192997
]
Julien Nioche commented on NUTCH-1258:
--
What about using a similar mechanism for the
[
https://issues.apache.org/jira/browse/NUTCH-1254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13188492#comment-13188492
]
Julien Nioche commented on NUTCH-1254:
--
Should be done as part of
[
https://issues.apache.org/jira/browse/NUTCH-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13185033#comment-13185033
]
Julien Nioche commented on NUTCH-1246:
--
trunk : Committed revision 1230610
[
https://issues.apache.org/jira/browse/NUTCH-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182575#comment-13182575
]
Julien Nioche commented on NUTCH-1244:
--
Not tested but looks Ok, compiles and passes
[
https://issues.apache.org/jira/browse/NUTCH-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13180434#comment-13180434
]
Julien Nioche commented on NUTCH-1244:
--
duplicates
[
https://issues.apache.org/jira/browse/NUTCH-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13180458#comment-13180458
]
Julien Nioche commented on NUTCH-1244:
--
yep. Would be good to add an optional filter
[
https://issues.apache.org/jira/browse/NUTCH-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13179382#comment-13179382
]
Julien Nioche commented on NUTCH-1241:
--
Entering '.+/product/.*' is not that
[
https://issues.apache.org/jira/browse/NUTCH-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13179501#comment-13179501
]
Julien Nioche commented on NUTCH-1241:
--
bq. However, due to NUTCH-1029 i cannot test
[
https://issues.apache.org/jira/browse/NUTCH-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13172397#comment-13172397
]
Julien Nioche commented on NUTCH-1184:
--
Just managed to have a look and haven't seen
[
https://issues.apache.org/jira/browse/NUTCH-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13163481#comment-13163481
]
Julien Nioche commented on NUTCH-1047:
--
bq. If you'd need WARC files, for some
[
https://issues.apache.org/jira/browse/NUTCH-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13163704#comment-13163704
]
Julien Nioche commented on NUTCH-1047:
--
The class NutchIndexWriter and
[
https://issues.apache.org/jira/browse/NUTCH-1047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13162704#comment-13162704
]
Julien Nioche commented on NUTCH-1047:
--
It would be nice to have a plugin
[
https://issues.apache.org/jira/browse/NUTCH-1213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13158429#comment-13158429
]
Julien Nioche commented on NUTCH-1213:
--
Looks fine to me, feel free to go ahead and
[
https://issues.apache.org/jira/browse/NUTCH-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13156645#comment-13156645
]
Julien Nioche commented on NUTCH-1205:
--
Why upgrading the sub-dependencies such as
[
https://issues.apache.org/jira/browse/NUTCH-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13150453#comment-13150453
]
Julien Nioche commented on NUTCH-1184:
--
Markus, can you hold it until 1.4 is
[
https://issues.apache.org/jira/browse/NUTCH-1200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13148393#comment-13148393
]
Julien Nioche commented on NUTCH-1200:
--
I'm definitely against the idea of putting
[
https://issues.apache.org/jira/browse/NUTCH-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13143038#comment-13143038
]
Julien Nioche commented on NUTCH-1098:
--
@Radim
Sounds like I am not going to is your
[
https://issues.apache.org/jira/browse/NUTCH-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13141267#comment-13141267
]
Julien Nioche commented on NUTCH-1188:
--
+1 to commit. See corresponding class in
[
https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13140008#comment-13140008
]
Julien Nioche commented on NUTCH-882:
-
nope, go ahead
Design a Host
[
https://issues.apache.org/jira/browse/NUTCH-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13140047#comment-13140047
]
Julien Nioche commented on NUTCH-1185:
--
Or we could catch the exceptions (OOME or
[
https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117213#comment-13117213
]
Julien Nioche commented on NUTCH-672:
-
You are welcome. Looks fine to me +1 to commit
[
https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13116308#comment-13116308
]
Julien Nioche commented on NUTCH-1078:
--
I had modified LogUtil in 2.0 (see
51 matches
Mail list logo