Using one physical lucene index for multiple projects

2005-08-31 Thread Markus Fischer
Hi, I've about five different projects which would need to access a lucene index for searching. The projects are completely unrelated to each other however it's all about the same: indexing HTML documents. Since all these projects are also runnig inside the same hosting company, I thought ab

Re: Ideal Index Fragmentation

2005-08-31 Thread Erik Hatcher
On Aug 30, 2005, at 9:53 PM, Friedland, Zachary (EDS - Strategy) wrote: More assorted questions: *I have been reading the posts on using Filter vs. BooleanQuery. To implement a search-within-a-search, it seems the Filter is advantageous due to its cacheability, but are there other pr

Re: Using one physical lucene index for multiple projects

2005-08-31 Thread Maik Schreiber
I'm running XML-RPC to allow access to the index. My idea was that I share a SecretKey between the projects indexers and projects clients. So the indexer can only add/remove documents with his key and the client can only access documents with his key. In our projects we're using HTTP authenti

Re: Announcement: Lucene powering CNET.com Product Category Listings

2005-08-31 Thread David Spencer
Nice write up. One other nice thing I noticed is you seem to sort numeric attributes numerically instead of alphabetically e.g. here: http://reviews.cnet.com/4566-3156_7-0.html?filter=500193_5314692_ see the 3rd col, "Find by max speed", and note that has has choices in this order: < 2

Re: Using one physical lucene index for multiple projects

2005-08-31 Thread Markus Fischer
Maik Schreiber wrote: I'm running XML-RPC to allow access to the index. My idea was that I share a SecretKey between the projects indexers and projects clients. So the indexer can only add/remove documents with his key and the client can only access documents with his key. In our projects w

Indexing source files

2005-08-31 Thread Axel
Hi I'm quite new to lucene, and I'm looking for information, how I can start implementing a search engine for code completion, phpdoc hovering etc. Currently I'm using an approach similar to ctags: http://ctags.sourceforge.net/, which isn't very fast in the startup time for large projects. For the

Re: custom sort

2005-08-31 Thread raymondcreel (sent by Nabble.com)
Actually in this case I am sorting by score already but I'm not sure if that helps. Regardless of how I do my primary sort, I want to tweak the results such that some hardcoded number of documents that match some criteria get pushed or frontloaded to the top of the results. For instance think

Re: custom sort

2005-08-31 Thread Jason Haruska
I had to do something similar, but I plan on re-writing it into something more elegant. I hope this helps give you some ideas. 1. Create a QueryFilter on only those items that matched the criteria (have a required clause in your boolean query) 2. Create a BitFilter which takes a BitSet from step

Problem With RAMDirectory.

2005-08-31 Thread Mark Horan
Hi, I'm using the Lucene demo to create an index for 10 text files, just for testing. I use the demo to do searches on the created index, and all seems well. I want to perform the search via an applet, for reasons not relevant to the problem I'm having. I zip up the three files in the index so the

JDBCDirectory Information

2005-08-31 Thread Anthony Vito
I finally put up a small page on my site with the JDBCDirectory information. Nothing special, but centralized information makes progress... Now.. to write the manual http://www.anthonyvito.com/xwiki/bin/view/Main/JDBCDirectory -vito --

Re: custom sort

2005-08-31 Thread Chris Hostetter
: What I'm doing now is taking the whole resulting document collection, : iterating through it and manually moving these 10 documents to the front : of the collection. This is slow and ugly. I was hoping there might be : a slicker way to do it as part of the actual sort. I will play around : wi

RE: Announcement: Lucene powering CNET.com Product Category Listings

2005-08-31 Thread Chris Hostetter
Zach, Let's simplify the issue to displaying counts for a single "Attribute" -- manufacturer. Imagine that the main search you are executing is "+category:cameras +price:[0 to 10]" and you want to sort it by name. You also want to display the counts per manufacturer for all products that match