Storing the index on a dfs works just change conf to use dfs in
nutch.war/Web-inf/classes/nutch-default.xml and setup the correct
path in the property searcher.dir.
However it is slow.
Anyway in case you say little search app, than I strongly suggest
using a local file system.
Nutch 0.8 runs by default on a local file system and does not require
many boxes and no dfs!
Stefan
Am 08.03.2006 um 18:57 schrieb Olive g:
Thank you! Sorry I am a newbie. I meant searching an index located
on dfs for a term.
I would like to run my little search app from command line on Linux.
Help please!
From: Stefan Groschupf <[EMAIL PROTECTED]>
Reply-To: [email protected]
To: [email protected]
Subject: Re: how to search data on DSF (0.8)
Date: Wed, 8 Mar 2006 18:46:25 +0100
MIME-Version: 1.0 (Apple Message framework v746.2)
Received: from mail.apache.org ([209.237.227.199]) by bay0-mc12-
f13.bay0.hotmail.com with Microsoft SMTPSVC(6.0.3790.211); Wed, 8
Mar 2006 09:46:49 -0800
Received: (qmail 99998 invoked by uid 500); 8 Mar 2006 17:46:48 -0000
Received: (qmail 99987 invoked by uid 99); 8 Mar 2006 17:46:48 -0000
Received: from asf.osuosl.org (HELO asf.osuosl.org)
(140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed,
08 Mar 2006 09:46:48 -0800
Received: pass (asf.osuosl.org: local policy)
Received: from [212.122.60.61] (HELO mslinux.media-style.com)
(212.122.60.61) by apache.org (qpsmtpd/0.29) with ESMTP; Wed,
08 Mar 2006 09:46:47 -0800
Received: from localhost (localhost [127.0.0.1])by mslinux.media-
style.com (Postfix) with ESMTP id 52DD5144450for <nutch-
[EMAIL PROTECTED]>; Wed, 8 Mar 2006 18:38:35 +0100 (CET)
Received: from mslinux.media-style.com ([127.0.0.1])by localhost
(mslinux.media-style.com [127.0.0.1]) (amavisd-new, port 10024)
with ESMTP id 18583-05 for <[email protected]>;Wed, 8
Mar 2006 18:38:35 +0100 (CET)
Received: from [192.168.200.39] (unknown [212.122.60.61])by
mslinux.media-style.com (Postfix) with ESMTP id 18E17144420for
<[email protected]>; Wed, 8 Mar 2006 18:38:35 +0100 (CET)
X-Message-Info: JGTYoYF78jEHjJx36Oi8+Z3TmmkSEdPtfpLB7P/ybN8=
Mailing-List: contact [EMAIL PROTECTED]; run by ezmlm
Precedence: bulk
List-Help: <mailto:[EMAIL PROTECTED]>
List-Unsubscribe: <mailto:[EMAIL PROTECTED]>
List-Post: <mailto:[email protected]>
List-Id: <nutch-user.lucene.apache.org>
Delivered-To: mailing list [email protected]
X-ASF-Spam-Status: No, hits=0.0 required=10.0tests=HTML_MESSAGE
X-Spam-Check-By: apache.org
References: <[EMAIL PROTECTED]>
X-Mailer: Apple Mail (2.746.2)
X-Virus-Scanned: by amavisd-new-20030616-p10 (Debian) at media-
style.com
X-Virus-Checked: Checked by ClamAV on apache.org
Return-Path: nutch-user-return-4469-
[EMAIL PROTECTED]
X-OriginalArrivalTime: 08 Mar 2006 17:46:49.0322 (UTC) FILETIME=
[46B42CA0:01C642D8]
what means search data.
you can do
bin/hadoop dfs -ls to browse the dfs.
Also there are some junit tests in the hadoop project that
illustrate how to use the api (TestDFS).
cheers
Stefan
Am 08.03.2006 um 18:40 schrieb Olive g:
Hello,
Does anyone have sample code (using the Nutch API and running
from command line) to search data
on DSF? I am using version 0.8.
Thank you.
Olive
_________________________________________________________________
Don’t just search. Find. Check out the new MSN Search! http://
search.msn.click-url.com/go/onm00200636ave/direct/01/
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net
_________________________________________________________________
Don’t just search. Find. Check out the new MSN Search! http://
search.msn.click-url.com/go/onm00200636ave/direct/01/
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net