tickets for nutch beginners

2013-04-27 Thread Michael Aro
Hi, Could you curate some tickets for a beginner? Mike.

version for apache nutch giraph integration and irc

2013-04-27 Thread Michael Aro
Hi All, I have being reading the nutch wiki. Currently, there are branches for 2.x e.g. 2.1 and 1.x e.g. 1.6. The trunk is the 1.6 version. The trunk contains the ".webgraph" package including classes LinkRank.java and WebGraph.java. These classes do not exist in Nutch 2.1. Have they being modifi

Re: version for apache nutch giraph integration and irc

2013-04-27 Thread Sebastian Nagel
Hi Mike > I have being reading the nutch wiki. Currently, there are branches for 2.x > e.g. 2.1 and 1.x e.g. 1.6. The trunk is the 1.6 version. Trunk is for the 1.x releases (last release is 1.6), while branches/2.x is 2.1, etc. > The trunk contains the ".webgraph" package including classes Link

Re: tickets for nutch beginners

2013-04-27 Thread Tejas Patil
Hi Mike, There were few jiras for some plugins not having junits. You can go through the corresponding plugin code, understand its working and then write a junit for it. That would give you some idea about the nutch code. Below are those jira links: https://issues.apache.org/jira/browse/NUTCH-1116

[Nutch Wiki] Update of "AdminGroup" by kiranchitturi

2013-04-27 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "AdminGroup" page has been changed by kiranchitturi: http://wiki.apache.org/nutch/AdminGroup?action=diff&rev1=10&rev2=11 * MarkusJelsma * FerdyGalema * kiranchitturi + * Te

[Nutch Wiki] Update of "ContributorsGroup" by kiranchitturi

2013-04-27 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "ContributorsGroup" page has been changed by kiranchitturi: http://wiki.apache.org/nutch/ContributorsGroup?action=diff&rev1=4&rev2=5 * AdminGroup * ElisabethAdler * EdwardDr

[jira] [Commented] (NUTCH-969) FTP erro encoding

2013-04-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643796#comment-13643796 ] Sebastian Nagel commented on NUTCH-969: --- The problem is that the URL encoded path (pe

[jira] [Updated] (NUTCH-969) FTP erro encoding

2013-04-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-969: -- Fix Version/s: 1.8 > FTP erro encoding > - > > Key: NUTCH-96

[jira] [Updated] (NUTCH-969) protocol-ftp with configurable encoding

2013-04-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-969: -- Summary: protocol-ftp with configurable encoding (was: FTP erro encoding) > protocol-ftp wi

[jira] [Updated] (NUTCH-969) FTP erro encoding

2013-04-27 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-969: -- Issue Type: Improvement (was: Bug) > FTP erro encoding > - > >

[Nutch Wiki] Update of "bin/nutch generate" by TejasPatil

2013-04-27 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The "bin/nutch generate" page has been changed by TejasPatil: http://wiki.apache.org/nutch/bin/nutch%20generate?action=diff&rev1=1&rev2=2 Comment: added the usage for generate in 2.x

[jira] [Updated] (NUTCH-829) duplicate hadoop temp files

2013-04-27 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tejas Patil updated NUTCH-829: -- Attachment: NUTCH-829.v2.patch Hi Lewis, There was one more place in Generator where this change could h

[jira] [Commented] (NUTCH-829) duplicate hadoop temp files

2013-04-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643845#comment-13643845 ] Lewis John McGibbney commented on NUTCH-829: +1 for commit. @Tejas, please remo

[jira] [Resolved] (NUTCH-829) duplicate hadoop temp files

2013-04-27 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tejas Patil resolved NUTCH-829. --- Resolution: Fixed Thanks Lewis for pointing that out. Committed @ revision 1476702 >

[jira] [Commented] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch

2013-04-27 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643858#comment-13643858 ] Tejas Patil commented on NUTCH-1528: As this change ain't going to the repo, should we

[jira] [Commented] (NUTCH-346) Improve readability of logs/hadoop.log

2013-04-27 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643863#comment-13643863 ] Tejas Patil commented on NUTCH-346: --- I think that this will be a good addition as current

[jira] [Commented] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch

2013-04-27 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643877#comment-13643877 ] Lewis John McGibbney commented on NUTCH-1528: - For the sake of clarity and whe

[jira] [Commented] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch

2013-04-27 Thread Tejas Patil (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643879#comment-13643879 ] Tejas Patil commented on NUTCH-1528: I think that [~jnioche] and [~wastl-nagel] might

[jira] [Commented] (NUTCH-829) duplicate hadoop temp files

2013-04-27 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13643890#comment-13643890 ] Hudson commented on NUTCH-829: -- Integrated in Nutch-trunk #2183 (See [https://builds.apache.o

Jenkins build is back to normal : Nutch-trunk #2183

2013-04-27 Thread Apache Jenkins Server
See

Build failed in Jenkins: Nutch-nutchgora #584

2013-04-27 Thread Apache Jenkins Server
See -- [...truncated 2883 lines...] clean-lib: resolve-default: [ivy:resolve] :: loading settings :: file = /zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/ivy/ivysetti

Build failed in Jenkins: Nutch-trunk #2184

2013-04-27 Thread Apache Jenkins Server
See -- [...truncated 1096 lines...] AU src/plugin/subcollection/src/java/org/apache/nutch/collection/CollectionManager.java AU src/plugin/subcollection/src/java/org/apache/nutch/collection/pac