Hi Scott,
feel free - I have no options on that.
From my very little point of view the nutch > .8 source stream is a
one way street.
In all my projects we move as far as possible away from nutch. I like
hadoop a lot and writing customer tools on top of it is - that easy.
But nutch .8 was a p
Th old hadoop patch is here:
https://issues.apache.org/jira/browse/NUTCH-251
Also we had this conversation:
http://www.mail-archive.com/hadoop-dev@lucene.apache.org/msg00314.html
I guess after this we missed to post the patches we use internally.
If someone feels strong about getting the gui work
Stefan,
I also dived into contrib/web2 in nutch. The one and admin-gui are
both owns some plugins based on nutch plugin architecture. So I think
it is great if we extract something in high level and they should have
a lot commons. Well, i dont know it is the right time to do this job.
On 1/19/0
Stefan Groschupf wrote:
We run the gui in several production environemnts with patched hadoop
code - since this is from our point of view the clean approach.
Everything else feels like a workaround to fix some strange hadoop
behaviors.
Are there issues in Hadoop's Jira for these? If so, do t
Hi,
I just finished reading all source code about nutch gui. And
personally i don't like putting a lot of code snippets into jsp files
since it takes a lot time when refactoring. So how about to adopt
using velocity/freemarker with servlet?
In general I agree it is the view layer and should ha
On Jan 18, 2007, at 4:44 PM, Andrzej Bialecki wrote:
java.io.EOFException
at java.io.DataInputStream.readFully(DataInputStream.java:
178)
at org.apache.hadoop.io.DataOutputBuffer$Buffer.write
(DataOutputBuffer.java:57)
at org.apache.hadoop.io.DataOutputBuffer.write
Brian Whitman wrote:
I wanted to try last night's nightly for the new freegen command.
On my test case, which is:
rm -rf crawl
bin/nutch inject crawl/crawldb urls/ # a single URL is in urls/urls
bin/nutch generate crawl/crawldb crawl/segments
bin/nutch fetch crawl/segments/2007...
bin/nutch upd
I wanted to try last night's nightly for the new freegen command.
On my test case, which is:
rm -rf crawl
bin/nutch inject crawl/crawldb urls/ # a single URL is in urls/urls
bin/nutch generate crawl/crawldb crawl/segments
bin/nutch fetch crawl/segments/2007...
bin/nutch updatedb crawl/crawldb cr
Hi,
I just finished reading all source code about nutch gui. And
personally i don't like putting a lot of code snippets into jsp files
since it takes a lot time when refactoring. So how about to adopt
using velocity/freemarker with servlet?
On 1/17/07, Enis Soztutar <[EMAIL PROTECTED]> wrote:
H
[
https://issues.apache.org/jira/browse/NUTCH-39?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465775
]
fantoni benjamin commented on NUTCH-39:
---
Somebody can explain to me what is the pivot?
> pagination in search re
Now i wonder if its possible to implement a 'pattern topic search'
in the form of 'topic:foo*' to match topics with a value 'foobar'
too ?
ok - this could be the answer ...
http://www.mail-archive.com/nutch-user@lucene.apache.org/msg04519.html
Hi,
i wrote a test plugin with indexer and searcher extension points.
The wiki and the creative commons plugin provided good help so
searches for 'topic:foo' result in matches with an index field
'topic' of value 'foo'.
Now i wonder if its possible to implement a 'pattern topic search' in
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465700
]
Armel Nene commented on NUTCH-61:
-
I have attached a new patch as the old one need updating before using with
Nutch 0.
[
https://issues.apache.org/jira/browse/NUTCH-61?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Armel Nene updated NUTCH-61:
Attachment: nutch-61-492176.patch
> Adaptive re-fetch interval. Detecting umodified content
> ---
14 matches
Mail list logo