Re: Map-Reduce

2005-08-03 Thread Cheolgoo Kang
Yeah, it would be great if we had a Directory subclass like MapReduceDirectory. I'm looking for the ComputeFarm that is implemented a distributed parallel computing environment on the JINI technology. On 8/4/05, Paul Smith <[EMAIL PROTECTED]> wrote: > I've been reading the Nutch MapReduce stuff[

Map-Reduce

2005-08-03 Thread Paul Smith
I've been reading the Nutch MapReduce stuff[1], and the original Google paper [2]. I know there's a mapreduce branch in the nutch project, but is there any plan/talk of perhaps integrating something like that directly into the Lucene API? For projects that need a lower-level API like Luc

Re: duplicates from multiple index

2005-08-03 Thread Kashif Khadim
Hi David, It works very well and thanks a lot for your help. Kashif --- David Spencer <[EMAIL PROTECTED]> wrote: > Kashif Khadim wrote: > > > Hi , > > > > I have multiple index of lucene and want know how > can > > i delete duplicates from these index. I am using > > MultiSearcher to search

Re: duplicates from multiple index

2005-08-03 Thread David Spencer
Kashif Khadim wrote: Hi , I have multiple index of lucene and want know how can i delete duplicates from these index. I am using MultiSearcher to search on these. I have duplicates "urls" in these index, any sample code or tool will be a big help. Here's some ancient code that I've used - co

duplicates from multiple index

2005-08-03 Thread Kashif Khadim
Hi , I have multiple index of lucene and want know how can i delete duplicates from these index. I am using MultiSearcher to search on these. I have duplicates "urls" in these index, any sample code or tool will be a big help. Thanks, Kashif. __

[ANN] Nux-1.3 released

2005-08-03 Thread Wolfgang Hoschek
The Nux-1.3 release has been uploaded to http://dsd.lbl.gov/nux/ Nux is an open-source Java toolkit making efficient and powerful XML processing easy. Changelog: •Upgraded to saxonb-8.5 (saxon-8.4 and 8.3 should continue to work as well). •Upgraded to xom-1.1-rc1 (w