[
https://issues.apache.org/jira/browse/NUTCH-1075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1075:
-
Attachment: NUTCH-1075-v3.patch
Added parameter to bypass the check on isReasonablyCertain
[
https://issues.apache.org/jira/browse/NUTCH-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche reassigned NUTCH-1064:
Assignee: Julien Nioche
> o.a.n.util.MimeUtil uses deprecated Tika meth
[
https://issues.apache.org/jira/browse/NUTCH-1075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13087685#comment-13087685
]
Julien Nioche commented on NUTCH-1075:
--
the identification should not be affecte
[
https://issues.apache.org/jira/browse/NUTCH-1075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1075.
--
Resolution: Fixed
Committed revision 1159621.
Thanks for reviewing it!
> Delegate langu
[
https://issues.apache.org/jira/browse/NUTCH-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1085:
-
Attachment: NUTCH-1085.patch
> Nutch script does not require HADOOP_H
Reporter: Julien Nioche
Assignee: Julien Nioche
Fix For: 1.4, 2.0
The Nutch script currently requires HADOOP_HOME to be set and point to a valid
HADOOP setup in order to run in distributed mode. What is actually needs is not
the location of the whole Hadoop setup but just to
[
https://issues.apache.org/jira/browse/NUTCH-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13088602#comment-13088602
]
Julien Nioche commented on NUTCH-1067:
--
Looks good but 2 comments th
[
https://issues.apache.org/jira/browse/NUTCH-1073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13088597#comment-13088597
]
Julien Nioche commented on NUTCH-1073:
--
Will commit shortly unless someone ha
[
https://issues.apache.org/jira/browse/NUTCH-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13088677#comment-13088677
]
Julien Nioche commented on NUTCH-1067:
--
{quote}
* this is going to be diffi
[
https://issues.apache.org/jira/browse/NUTCH-1073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13088712#comment-13088712
]
Julien Nioche commented on NUTCH-1073:
--
Don't think it can be applied as is
[
https://issues.apache.org/jira/browse/NUTCH-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1085.
--
Resolution: Fixed
Trunk : Committed revision 1160738
1.4 : Committed revision 1160734
> Nu
[
https://issues.apache.org/jira/browse/NUTCH-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1089.
--
Resolution: Fixed
1.4 Committed revision 1160753.
trunk Committed revision 1160754
Thanks
[
https://issues.apache.org/jira/browse/NUTCH-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090081#comment-13090081
]
Julien Nioche commented on NUTCH-1057:
--
Haven't you committed it already?
[
https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090082#comment-13090082
]
Julien Nioche commented on NUTCH-1024:
--
Do you mind if we wait a bit? I'
[
https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090115#comment-13090115
]
Julien Nioche commented on NUTCH-1024:
--
There is a JIRA issue for 2.0 h
[
https://issues.apache.org/jira/browse/NUTCH-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090477#comment-13090477
]
Julien Nioche commented on NUTCH-1095:
--
+1 thanks!
> remove i18n from Nutch
[
https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13090811#comment-13090811
]
Julien Nioche commented on NUTCH-937:
-
Markus,
The param plugin.folder is multiva
[
https://issues.apache.org/jira/browse/NUTCH-990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche reopened NUTCH-990:
-
> protocol-httpclient fails with short pa
[
https://issues.apache.org/jira/browse/NUTCH-990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-990.
-
Resolution: Fixed
Fix Version/s: (was: 1.3)
1.4
A patch has been
[
https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13091747#comment-13091747
]
Julien Nioche commented on NUTCH-937:
-
@Radim : Nutch is based on the Ap
[
https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13093807#comment-13093807
]
Julien Nioche commented on NUTCH-937:
-
@Ferdy - good detective work! I like
[
https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche reassigned NUTCH-937:
---
Assignee: Julien Nioche (was: Markus Jelsma)
> When nutch is run on hadoop > 0.20.2 (
[
https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-937:
Priority: Minor (was: Major)
Patch Info: [Patch Available]
Issue Type: Improvement (was
[
https://issues.apache.org/jira/browse/NUTCH-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13095229#comment-13095229
]
Julien Nioche commented on NUTCH-937:
-
Works fine on Hadoop-0.20.203.0 and
[
https://issues.apache.org/jira/browse/NUTCH-1073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1073.
--
Resolution: Fixed
Committed revision 1164064.
> Rename paramet
[
https://issues.apache.org/jira/browse/NUTCH-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1096.
--
Resolution: Fixed
trunk : Committed revision 1164107
1.4 : Committed revision 1164108
Thanks
[
https://issues.apache.org/jira/browse/NUTCH-1102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13097948#comment-13097948
]
Julien Nioche commented on NUTCH-1102:
--
@Markus : in the future maybe try and ha
[
https://issues.apache.org/jira/browse/NUTCH-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13097950#comment-13097950
]
Julien Nioche commented on NUTCH-1067:
--
see comments on NUTCH-1102
Patch for
[
https://issues.apache.org/jira/browse/NUTCH-1101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13097955#comment-13097955
]
Julien Nioche commented on NUTCH-1101:
--
The functionality makes sense but I am
[
https://issues.apache.org/jira/browse/NUTCH-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102011#comment-13102011
]
Julien Nioche commented on NUTCH-1108:
--
Can you parse the file succesfully with
[
https://issues.apache.org/jira/browse/NUTCH-914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche closed NUTCH-914.
---
Resolution: Fixed
Thanks Lewis
> Implement Apache Project Branding Requireme
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13104427#comment-13104427
]
Julien Nioche commented on NUTCH-1005:
--
Can't you do that with urlmet
[
https://issues.apache.org/jira/browse/NUTCH-1067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche reopened NUTCH-1067:
--
At revision 1170548.
ant clean then ant =>
compile-core:
[javac] /data/nutch-1.4/build.
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105456#comment-13105456
]
Julien Nioche commented on NUTCH-1005:
--
you are right. I'd read your com
[
https://issues.apache.org/jira/browse/NUTCH-1112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche closed NUTCH-1112.
Resolution: Duplicate
https://issues.apache.org/jira/browse/NUTCH-1089 already fixed this. Thanks
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108633#comment-13108633
]
Julien Nioche commented on NUTCH-1052:
--
I like the original idea and agree
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108701#comment-13108701
]
Julien Nioche commented on NUTCH-1052:
--
Yep, that's the idea.
The class
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108757#comment-13108757
]
Julien Nioche commented on NUTCH-1052:
--
{quote}
Julien, will it break on Ha
[
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13109435#comment-13109435
]
Julien Nioche commented on NUTCH-1005:
--
let's try and come up with a sing
[
https://issues.apache.org/jira/browse/NUTCH-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13112576#comment-13112576
]
Julien Nioche commented on NUTCH-1115:
--
+1 Don't forget to add the same
[
https://issues.apache.org/jira/browse/NUTCH-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13114077#comment-13114077
]
Julien Nioche commented on NUTCH-1129:
--
Any23 might graduate into a Tika subpro
Reporter: Julien Nioche
Fix For: 2.0
We had to build GORA locally prior to building Nutch 2.0 but can now rely on
the published artefacts with version 0.1.1-incubation
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http
[
https://issues.apache.org/jira/browse/NUTCH-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche closed NUTCH-1131.
> Rely on published artefacts for GORA
>
>
>
[
https://issues.apache.org/jira/browse/NUTCH-1131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1131.
--
Resolution: Fixed
Committed revision 1175571.
> Rely on published artefacts for G
[
https://issues.apache.org/jira/browse/NUTCH-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262500#comment-13262500
]
Julien Nioche commented on NUTCH-882:
-
Ferdy I'll let you close it. I don
[
https://issues.apache.org/jira/browse/NUTCH-1347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13265728#comment-13265728
]
Julien Nioche commented on NUTCH-1347:
--
Not clear what the issue is. You can g
[
https://issues.apache.org/jira/browse/NUTCH-1347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13265804#comment-13265804
]
Julien Nioche commented on NUTCH-1347:
--
bq. i can not recognize your solution
th
[
https://issues.apache.org/jira/browse/NUTCH-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13270304#comment-13270304
]
Julien Nioche commented on NUTCH-809:
-
Kristof, please use the mailing list ins
[
https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1370:
-
Priority: Minor (was: Major)
Running in pseudo-distributed mode gives you more information if
Julien Nioche created NUTCH-1371:
Summary: Replace Ivy with Maven Ant tasks
Key: NUTCH-1371
URL: https://issues.apache.org/jira/browse/NUTCH-1371
Project: Nutch
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/NUTCH-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1371:
-
Attachment: NUTCH-1371.patch
Preliminary version. Needs maven-ant-tasks-2.1.3.jar in ivy dir
[
https://issues.apache.org/jira/browse/NUTCH-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13281024#comment-13281024
]
Julien Nioche commented on NUTCH-1375:
--
your patch generates nois
[
https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1370:
-
Affects Version/s: (was: 1.4)
1.5
Fix Version/s: (was: 1.5
Julien Nioche created NUTCH-1396:
Summary: Upgrade to Tika 1.1
Key: NUTCH-1396
URL: https://issues.apache.org/jira/browse/NUTCH-1396
Project: Nutch
Issue Type: Bug
Affects Versions
[
https://issues.apache.org/jira/browse/NUTCH-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1396:
-
Attachment: NUTCH-1396.patch
> Upgrade to Tika
[
https://issues.apache.org/jira/browse/NUTCH-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche closed NUTCH-1396.
Assignee: Julien Nioche
Thanks Lewis
> Upgrade to Tika
[
https://issues.apache.org/jira/browse/NUTCH-1081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295672#comment-13295672
]
Julien Nioche commented on NUTCH-1081:
--
The tests for nutchgora seem to work
Julien Nioche created NUTCH-1398:
Summary: Upgrade to Hadoop 1.0.3
Key: NUTCH-1398
URL: https://issues.apache.org/jira/browse/NUTCH-1398
Project: Nutch
Issue Type: Improvement
Affects
[
https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295674#comment-13295674
]
Julien Nioche commented on NUTCH-1398:
--
trunk : Committed revision 1350630.
[
https://issues.apache.org/jira/browse/NUTCH-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295738#comment-13295738
]
Julien Nioche commented on NUTCH-1397:
--
Lewis, the language identification
Julien Nioche created NUTCH-1399:
Summary: TestProtocolHttpClient fails
Key: NUTCH-1399
URL: https://issues.apache.org/jira/browse/NUTCH-1399
Project: Nutch
Issue Type: Bug
Affects
[
https://issues.apache.org/jira/browse/NUTCH-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1399:
-
Attachment: NUTCH-1399.patch
> TestProtocolHttpClient fa
Julien Nioche created NUTCH-1401:
Summary: Upgrade to Hadoop 1.0.3
Key: NUTCH-1401
URL: https://issues.apache.org/jira/browse/NUTCH-1401
Project: Nutch
Issue Type: Improvement
Affects
Julien Nioche created NUTCH-1402:
Summary: Create AbstractScoringFilter
Key: NUTCH-1402
URL: https://issues.apache.org/jira/browse/NUTCH-1402
Project: Nutch
Issue Type: Improvement
Julien Nioche created NUTCH-1403:
Summary: Add default ScoringFilter for manipulating metadata
Key: NUTCH-1403
URL: https://issues.apache.org/jira/browse/NUTCH-1403
Project: Nutch
Issue
[
https://issues.apache.org/jira/browse/NUTCH-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1401:
-
Fix Version/s: (was: 1.6)
1.5.1
Assignee: Julien Nioche
[
https://issues.apache.org/jira/browse/NUTCH-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1400:
-
Fix Version/s: (was: 2.1)
(was: 1.6)
1.5.1
Julien Nioche created NUTCH-1404:
Summary: Nutch script fails to find job file in deploy mode
Key: NUTCH-1404
URL: https://issues.apache.org/jira/browse/NUTCH-1404
Project: Nutch
Issue Type
[
https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1398:
-
Affects Version/s: (was: nutchgora)
Fix Version/s: (was: 2.1)
> Upgrade
[
https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1398:
-
Fix Version/s: (was: 1.6)
1.5.1
> Upgrade to Hadoop 1.
[
https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1398.
--
Resolution: Fixed
> Upgrade to Hadoop 1.
[
https://issues.apache.org/jira/browse/NUTCH-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1401.
--
Resolution: Fixed
Committed revision 1351705 in branch nutchgora
> Upgr
[
https://issues.apache.org/jira/browse/NUTCH-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1401:
-
Affects Version/s: (was: 1.5)
Fix Version/s: (was: 1.5.1)
> Upgrade
[
https://issues.apache.org/jira/browse/NUTCH-1404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1404.
--
Resolution: Fixed
Nutchgora : Committed revision 1351707.
Trunk : Committed revision 1351709
[
https://issues.apache.org/jira/browse/NUTCH-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1400.
--
Resolution: Fixed
Trunk => Committed revision 1352008.
NutchGora => Committed revision 1
[
https://issues.apache.org/jira/browse/NUTCH-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1391:
-
Fix Version/s: (was: 2.1)
nutchgora
I think we should fix it for 2.0
[
https://issues.apache.org/jira/browse/NUTCH-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13397411#comment-13397411
]
Julien Nioche commented on NUTCH-1391:
--
A repeart of NUTCH-1110 -> we d
[
https://issues.apache.org/jira/browse/NUTCH-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche resolved NUTCH-1391.
--
Resolution: Fixed
Committed revision 1352037.
> readdb -stats fi
[
https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398247#comment-13398247
]
Julien Nioche commented on NUTCH-1406:
--
See http://wiki.apache.org/n
[
https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398340#comment-13398340
]
Julien Nioche commented on NUTCH-1031:
--
crawler-commons is not super active a
[
https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398363#comment-13398363
]
Julien Nioche commented on NUTCH-1388:
--
Let's release 1.
[
https://issues.apache.org/jira/browse/NUTCH-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398362#comment-13398362
]
Julien Nioche commented on NUTCH-1341:
--
Let's release 1.5.1 first then add
[
https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13398420#comment-13398420
]
Julien Nioche commented on NUTCH-1406:
--
bq. index-metatags plugin (sometimes
[
https://issues.apache.org/jira/browse/NUTCH-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13399250#comment-13399250
]
Julien Nioche commented on NUTCH-1406:
--
BTW we have formatting rules for Eclips
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13401465#comment-13401465
]
Julien Nioche commented on NUTCH-1405:
--
Correct me if I 'm wrong but doe
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402940#comment-13402940
]
Julien Nioche commented on NUTCH-1405:
--
Can you please add some tests for thi
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402973#comment-13402973
]
Julien Nioche commented on NUTCH-1405:
--
what about the command "nu
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403052#comment-13403052
]
Julien Nioche commented on NUTCH-1405:
--
Markus, make sure you generate a patch
[
https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1087:
-
Attachment: crawl
WORK IN PROGRESS
Need to add more comments + include the injection, linkd and
[
https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche reassigned NUTCH-1087:
Assignee: Julien Nioche
> Deprecate crawl command and replace with example scr
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13406551#comment-13406551
]
Julien Nioche commented on NUTCH-1405:
--
db.injector.preserve.metadata is prob
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13406564#comment-13406564
]
Julien Nioche commented on NUTCH-1405:
--
the way I was thinking about it was
[
https://issues.apache.org/jira/browse/NUTCH-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13406942#comment-13406942
]
Julien Nioche commented on NUTCH-1405:
--
Passes the tests, all good! +1 Thanks Ma
[
https://issues.apache.org/jira/browse/NUTCH-1414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13408055#comment-13408055
]
Julien Nioche commented on NUTCH-1414:
--
I'm concerned about the prolife
[
https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13409578#comment-13409578
]
Julien Nioche commented on NUTCH-1360:
--
Guys, unless a change is trivial pleas
[
https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1360:
-
Fix Version/s: (was: nutchgora)
2.1
> Suport the storing of
[
https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1087:
-
Attachment: NUTCH-1087.patch
First version of the nutch crawl script. Please test and review
[
https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1087:
-
Attachment: (was: crawl)
> Deprecate crawl command and replace with example scr
[
https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13410344#comment-13410344
]
Julien Nioche commented on NUTCH-1087:
--
Good catch Markus. Ideally we'd ne
[
https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-1087:
-
Attachment: NUTCH-1087-1.6-3.patch
The script now determines where the nutch script is located
801 - 900 of 1807 matches
Mail list logo