Okay, I actually just wrote you a long email of what to do, step by step but
when I tried to send it, my web mail session timed out and forced me to
re-login, losing it all... I'm not happy :(
But straight to the point, since your using the older 0.7 code-base you can use
partially fetched
On 1/2/07, Sean Dean [EMAIL PROTECTED] wrote:
There actually isn't much of a reason to generate huge multi-million page
fetch lists when you can create lots of smaller ones and merge them together. This allows
for more of a ladder-style approach, and in some cases reduces the risk of errors in
thank you Sean Dean
that sounds good .. i will try it out .
tell me if i am rite :
i case of a dmoz index file is injected in the db .. then i generate only
few segments by using -subset and then fetch them ..
and then go on and generate the next set of segments i hope i am heading the
right
@lucene.apache.org
Sent: Tuesday, January 2, 2007 4:18:36 AM
Subject: Re: fetcher : some doubts
On 1/2/07, Sean Dean [EMAIL PROTECTED] wrote:
There actually isn't much of a reason to generate huge multi-million page
fetch lists when you can create lots of smaller ones and merge them together
AM
Subject: Re: fetcher : some doubts
thank you Sean Dean
that sounds good .. i will try it out .
tell me if i am rite :
i case of a dmoz index file is injected in the db .. then i generate only
few segments by using -subset and then fetch them ..
and then go on and generate the next set
o i understand it now ..
well and thanks again for ur help sean
i was wondering if anyone wud be interested in making a gui to setup and run
the crawl .. say for no voice users
i dont know if there is any ..
i wud be glad to help if people are keen on making one
Thanks Regards
Shrinivas
]
To: nutch-user@lucene.apache.org
Sent: Tuesday, January 2, 2007 5:03:19 AM
Subject: Re: fetcher : some doubts
o i understand it now ..
well and thanks again for ur help sean
i was wondering if anyone wud be interested in making a gui to setup and run
the crawl .. say for no voice users
i
On 1/2/07, Sean Dean [EMAIL PROTECTED] wrote:
You need to delete the old index before you re-index when working within the
same directory structure
This is the procedure I follow, which is pretty much what your doing. This
assumes you already have at least one active segment and index. Edit as
[mailto:[EMAIL PROTECTED]
Sent: 02 January 2007 10:42
To: nutch-user@lucene.apache.org
Subject: Re: fetcher : some doubts
On 1/2/07, Sean Dean [EMAIL PROTECTED] wrote:
You need to delete the old index before you re-index when working within
the same directory structure
This is the procedure I follow
, 2007 5:41:51 AM
Subject: Re: fetcher : some doubts
On 1/2/07, Sean Dean [EMAIL PROTECTED] wrote:
You need to delete the old index before you re-index when working within the
same directory structure
This is the procedure I follow, which is pretty much what your doing. This
assumes you
10 matches
Mail list logo