Remove a token from a field

2003-10-31 Thread Albert Vila Puig
Hi, Is there a way to remove a token from a document field entry?. For example, I've got a UnStored field in my index and I want to remove a token from this field without doing the delete and add document (because I'm inserting the documents by date and I don't want to loose that sort).

Re: Remove a token from a field

2003-10-31 Thread Erik Hatcher
On Friday, October 31, 2003, at 03:53 AM, Albert Vila Puig wrote: Hi, Is there a way to remove a token from a document field entry?. For example, I've got a UnStored field in my index and I want to remove a token from this field without doing the delete and add document (because I'm

The best way forward

2003-10-31 Thread jt oob
Hi, I am currently indexing around 6 million text documents using lucene. We have a new server arriving in the next few weeks which the queries will be run on. With the following stats: Dell 6650 - 4 x Xeon HT CPU's, 16 GB RAM, 36GB SCSI Ultra160 Hdd. (connected to 1.5TB IDE RAID with actual

Re: The best way forward

2003-10-31 Thread Otis Gospodnetic
Wow, with 16GB RAM, I would definitely load the index into RAM. You can use RAMDirectory(Directory) constructor for that. As for RAMDrives. I have no experience with those, but I have heard of some people using ramfs under Linux. Ramfs is a memory based filesystem. Mount it and you have

Seting boost fsactor for keywords in one field

2003-10-31 Thread Dragan Jotanovic
I have an index with data about images (those data are obtained from database). In Document among other fields I have one field that I use for sorting. That field could take 10 different values (1 to 10). I set boost for that field like following: viewPriority:1^10 viewPriority:2^9

RE: Seting boost fsactor for keywords in one field

2003-10-31 Thread Jason Haruska
The boost factor is just one part of the relevance equation. It adjusts the weight of the term for the query. It can still be trumped by other more relevant parts of the query. Try adjusting the boost to larger values. I do have one question. If you know the order of documents to be retrieved,

Re: Remove a token from a field

2003-10-31 Thread Albert Vila Puig
I know there is no way to update a document without doing a delete/add. But I'm asking if this feature is viable to be implemented in an efficient way. Thanks Erik Hatcher wrote: On Friday, October 31, 2003, at 03:53 AM, Albert Vila Puig wrote: Hi, Is there a way to remove a token from

Re: Seting boost fsactor for keywords in one field

2003-10-31 Thread Dragan Jotanovic
I do have one question. If you know the order of documents to be retrieved, why use Lucene? Why not just display the results to the user in your hand picked order? I need a very fast search engine because I am working with thousends of images. Lucine is providing me with that. Does somebody know

Re: The best way forward

2003-10-31 Thread Philippe Laflamme
Having more RAM does not necessarily mean you can use it in your process. Keep in mind that a Xeon is a 32 bit x86 architecture, hence can only physically address 4GB of RAM. This means that theoretically a process cannot access more than 4GB (all of them accounted for can add up to 16GB and more

RE: Seting boost fsactor for keywords in one field

2003-10-31 Thread Jason Haruska
Another way to get the results you want it to store ViewPriority in the index. Before you return the results to the user, sort the list by this field outside of lucene. -Original Message- From: Dragan Jotanovic [mailto:[EMAIL PROTECTED] Sent: Friday, October 31, 2003 10:44 AM To: Lucene

One access at a time ?

2003-10-31 Thread Guilherme Barile
Hello I'm building a web application that uses lucene, the problem I'm facing is that only one user may write to the index each time, and I simply can't imagine a way to deal with this. Anyone ever did something like this ? Should I write a non-web app to add documents to the index ? Thanks

Re: One access at a time ?

2003-10-31 Thread Stephane Vaucher
You can have a central component (a singleton for example) that can be responsible for writes to the index. sv On 31 Oct 2003, Guilherme Barile wrote: Hello I'm building a web application that uses lucene, the problem I'm facing is that only one user may write to the index each time,

Re: The best way forward

2003-10-31 Thread Dror Matalon
There was a recent discussion on the postgres PERFORMENCE mailing list (check out their archive) about using over 4GM of ram on linux. AFAIR the gist of it was that while a single process can't use more than 4G, the kernel can and *will* use the additional memory for caching. So I'd recommend

crash in Lucene

2003-10-31 Thread Chong, Herb
i get this error when indexing a collection of 120,000 small text documents. java.lang.NullPointerException at org.apache.lucene.index.IndexWriter.close(Unknown Source) at org.apache.lucene.demo.IndexFiles.main(IndexFiles.java:73) the index appears to be usable when i run