Google Desktop Could be Better

2004-10-15 Thread Kevin A. Burton
http://www.peerfear.org/rss/permalink/2004/10/15/GoogleDesktopCouldBeBetter/ -- Use Rojo (RSS/Atom aggregator). Visit http://rojo.com. Ask me for an invite! Also see irc.freenode.net #rojo if you want to chat. Rojo is Hiring! - http://www.rojonetworks.com/JobsAtRojo.html If you're interested

Re: How extract a Field.Text(String, String) field to process it with a Stylesheet?

2004-10-15 Thread Otis Gospodnetic
That's likely because you used an Analyzer that stripped the XML (, , etc.) from the original text. If you want to preserve the original text, use an Analyzer that doesn't throw your XML away. You can write your own Analyzer that doesn't discard anything, for instance. Otis --- Juan A.

Re: Index + Searching

2004-10-15 Thread Fred Toth
Hi, I built a MetaTagsManager class. In HTMLDocument, there is a line of code that causes the doc to be parsed: HTMLParser parser = new HTMLParser(f); After parsing, all of the parse results are available. I added: Properties prop = parser.getMetaTags(); //prop.list(System.out);

Re: How extract a Field.Text(String, String) field to process it with a Stylesheet?

2004-10-15 Thread Morus Walter
Otis Gospodnetic writes: That's likely because you used an Analyzer that stripped the XML (, , etc.) from the original text. If you want to preserve the original text, use an Analyzer that doesn't throw your XML away. You can write your own Analyzer that doesn't discard anything, for

Re: How extract a Field.Text(String, String) field to process it with a Stylesheet?

2004-10-15 Thread Otis Gospodnetic
That's true, sorry for the confusion. The original text is stored verbatim. Otis --- Morus Walter [EMAIL PROTECTED] wrote: Otis Gospodnetic writes: That's likely because you used an Analyzer that stripped the XML (, , etc.) from the original text. If you want to preserve the original

Re: Google Desktop Could be Better

2004-10-15 Thread petite_abeille
On Oct 15, 2004, at 16:10, Tom Cunningham wrote: I'd be interested in trying to implement some of these ideas on Mac OS X, mostly because it's not already covered by Google Desktop, and I think the screensaver idea would work pretty well there. Anyone else want to give this a shot? Google

Re: your mail

2004-10-15 Thread Andrzej Bialecki
Christoph Mangold wrote: Hi Andrzej, I found this Email of yours on the lucene newsgroup. Hi Christoph, You need to remember that what I described is more a hack than a proper solution. However, it worked for me well enough, without the need to modify the core of Lucene... There are other

Re: Google Desktop Could be Better

2004-10-15 Thread Jonathan Hager
Kevin I have to agreed, the google desktop could be better. Last spring, Greg Moulliet put together a quick little searching application in Swing using Lucene. It was more of a proof of concept or prototype than something ready to be open sourced or even supported. But for less than a weeks

RE: Multi + Parallel

2004-10-15 Thread Will Allen
I am using 6 indexers / indexes to balance the speed of indexing against query performance for 40+ million documents. I came to this number through trial and error, and performance testing on the indexing side with a fast 4 processor machine. The trick is to max out the I/O throughput. -Will

Atomicity in Lucene operations

2004-10-15 Thread Christian Rodriguez
Hello guys, I need additions and deletions of documents to the index to be ATOMIC (they either happen to completion or not at all). On top of this, I need updates (which I currently implement with a deletion of the document followed by an addition) to be ATOMIC and DURABLE (once I return from

Re: Atomicity in Lucene operations

2004-10-15 Thread Nader Henein
We use Lucene over 4 replicated indecies and we have to maintain atomicity on deletion and updates with multiple fallback points. I'll send you the right up, it's too big to CC the entire board. nader henein Christian Rodriguez wrote: Hello guys, I need additions and deletions of documents to

threading and indexing......

2004-10-15 Thread Chris Fraschetti
if i have four threads all trying to call my index function, will lucene do what is necessary for each thread to wait until the writer is available.. or will the threads get an exception? -- ___ Chris Fraschetti, Student CompSci System Admin

Re: threading and indexing......

2004-10-15 Thread Otis Gospodnetic
If all 4 threads use the same instance of IndexWriter everything should be okay, as Lucene synchronizes vital blocks. Otis --- Chris Fraschetti [EMAIL PROTECTED] wrote: if i have four threads all trying to call my index function, will lucene do what is necessary for each thread to wait until