Re: Next Nutch release

2007-01-18 Thread Stefan Groschupf
Hi Scott, feel free - I have no options on that. From my very little point of view the nutch > .8 source stream is a one way street. In all my projects we move as far as possible away from nutch. I like hadoop a lot and writing customer tools on top of it is - that easy. But nutch .8 was a p

Re: Next Nutch release

2007-01-18 Thread Stefan Groschupf
Th old hadoop patch is here: https://issues.apache.org/jira/browse/NUTCH-251 Also we had this conversation: http://www.mail-archive.com/hadoop-dev@lucene.apache.org/msg00314.html I guess after this we missed to post the patches we use internally. If someone feels strong about getting the gui work

Re: Next Nutch release

2007-01-18 Thread Scott Green
Stefan, I also dived into contrib/web2 in nutch. The one and admin-gui are both owns some plugins based on nutch plugin architecture. So I think it is great if we extract something in high level and they should have a lot commons. Well, i dont know it is the right time to do this job. On 1/19/0

Re: Next Nutch release

2007-01-18 Thread Doug Cutting
Stefan Groschupf wrote: We run the gui in several production environemnts with patched hadoop code - since this is from our point of view the clean approach. Everything else feels like a workaround to fix some strange hadoop behaviors. Are there issues in Hadoop's Jira for these? If so, do t

Re: Next Nutch release

2007-01-18 Thread Stefan Groschupf
Hi, I just finished reading all source code about nutch gui. And personally i don't like putting a lot of code snippets into jsp files since it takes a lot time when refactoring. So how about to adopt using velocity/freemarker with servlet? In general I agree it is the view layer and should ha

Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-18 Thread Brian Whitman
On Jan 18, 2007, at 4:44 PM, Andrzej Bialecki wrote: java.io.EOFException at java.io.DataInputStream.readFully(DataInputStream.java: 178) at org.apache.hadoop.io.DataOutputBuffer$Buffer.write (DataOutputBuffer.java:57) at org.apache.hadoop.io.DataOutputBuffer.write

Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-18 Thread Andrzej Bialecki
Brian Whitman wrote: I wanted to try last night's nightly for the new freegen command. On my test case, which is: rm -rf crawl bin/nutch inject crawl/crawldb urls/ # a single URL is in urls/urls bin/nutch generate crawl/crawldb crawl/segments bin/nutch fetch crawl/segments/2007... bin/nutch upd

java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-18 Thread Brian Whitman
I wanted to try last night's nightly for the new freegen command. On my test case, which is: rm -rf crawl bin/nutch inject crawl/crawldb urls/ # a single URL is in urls/urls bin/nutch generate crawl/crawldb crawl/segments bin/nutch fetch crawl/segments/2007... bin/nutch updatedb crawl/crawldb cr

Re: Next Nutch release

2007-01-18 Thread Scott Green
Hi, I just finished reading all source code about nutch gui. And personally i don't like putting a lot of code snippets into jsp files since it takes a lot time when refactoring. So how about to adopt using velocity/freemarker with servlet? On 1/17/07, Enis Soztutar <[EMAIL PROTECTED]> wrote: H

[jira] Commented: (NUTCH-39) pagination in search result

2007-01-18 Thread fantoni benjamin (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-39?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465775 ] fantoni benjamin commented on NUTCH-39: --- Somebody can explain to me what is the pivot? > pagination in search re

Re: Field.index...

2007-01-18 Thread Paul Sponagl
Now i wonder if its possible to implement a 'pattern topic search' in the form of 'topic:foo*' to match topics with a value 'foobar' too ? ok - this could be the answer ... http://www.mail-archive.com/nutch-user@lucene.apache.org/msg04519.html

Field.index...

2007-01-18 Thread Paul Sponagl
Hi, i wrote a test plugin with indexer and searcher extension points. The wiki and the creative commons plugin provided good help so searches for 'topic:foo' result in matches with an index field 'topic' of value 'foo'. Now i wonder if its possible to implement a 'pattern topic search' in

[jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content

2007-01-18 Thread Armel Nene (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465700 ] Armel Nene commented on NUTCH-61: - I have attached a new patch as the old one need updating before using with Nutch 0.

[jira] Updated: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content

2007-01-18 Thread Armel Nene (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Armel Nene updated NUTCH-61: Attachment: nutch-61-492176.patch > Adaptive re-fetch interval. Detecting umodified content > ---