Re: boolean query or

2008-07-09 Thread Karl Wettin
9 jul 2008 kl. 07.37 skrev Cam Bazz: Hello, Is it possible to make a boolean query where a word is equal to fieldA or fieldB? in other words, I like to search a word in two fields, if word passes in fieldA or fieldB, then it is a hit. XOR? +(A B) -(+A +B) karl

how to get IndexReader Remote?

2008-07-09 Thread lutan
I have a MultiSearcher from remote using by Naming.bind(rmi://+IP+:+PORT+/+NAME, RemoteSearchable) ,but MultiSearcher doesn't has getIndexReader() . How to get IndexReader? _ MSN 中文网,最新时尚生活资讯,白领聚集门户。 http://cn.msn.com

performance feedback

2008-07-09 Thread Beard, Brian
I just did an update from lucene 2.2.0 to 2.3.2 and thought I'd give some kudos for the indexing performance enhancements. The lucene indexing portion is about 6-8 times faster. Previously we were doing ~60-120 documents per second, now we're between 400-1000, depending on the type of document,

performance feedback

2008-07-09 Thread Beard, Brian
I just did an update from lucene 2.2.0 to 2.3.2 and thought I'd give some kudos for the indexing performance enhancements. The lucene indexing portion is about 6-8 times faster. Previously we were doing ~60-120 documents per second, now we're between 400-1000, depending on the type of document,

Re: My own nalyzer in lucene

2008-07-09 Thread starz10de
I think there is problem of creating the jar files, it is created correctly where i can see myanalyzer inside, but I made a test where i change the name of Standardanlazer and then i did the same steps again , then i faced the same error inside the IndexFile.java : the eroor the constructor

Re: My own nalyzer in lucene

2008-07-09 Thread Erick Erickson
Let's see your analyzer code. Actually, I'm really only curious about the class declaration. Did you subclass your analyzer from the proper Lucene superclass? Say Analyzer? And why bother putting it in a separate jar file in the first place? Unless you really need to share this jar across

Searching for instances within a document

2008-07-09 Thread jnance
Hi, I am indexing lots of text files and need to see how many times a certain word comes up in each text file. Right now I have this constructor for search: static void search(Searcher searcher, String queryString) throws ParseException, IOException { QueryParser parser = new

Re: Searching for instances within a document

2008-07-09 Thread Erick Erickson
I know this has been discussed before, so if you search the archive you might find an answer more quickly. I don't remember what the resolution was, so I can't help there. Best Erick On Wed, Jul 9, 2008 at 9:49 AM, jnance [EMAIL PROTECTED] wrote: Hi, I am indexing lots of text files and

Re: Searching for instances within a document

2008-07-09 Thread jnance
Ok, I'll see if I can find anything. Thanks, James -- View this message in context: http://www.nabble.com/Searching-for-instances-within-a-document-tp18362075p18362432.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Re: performance feedback

2008-07-09 Thread Michael McCandless
This is great to hear! If you tweak things a bit (increase RAM buffer size, use autoCommit=false, use threads, etc) you should be able to eke out some more gains... Are you storing fields using term vectors on any of your fields? Mike Beard, Brian wrote: I just did an update from

RE: performance feedback

2008-07-09 Thread Beard, Brian
I will try tweaking RAM, and check about autoCommit=false. It's on the future agenda to multi-thread through the index writer. The indexing time I quoted includes the document creation time which would definitely improve with multi-threading. I'm doing batch updates of up to 1000 a pop, and

Re: performance feedback

2008-07-09 Thread Yonik Seeley
On Wed, Jul 9, 2008 at 11:35 AM, Beard, Brian [EMAIL PROTECTED] wrote: I will try tweaking RAM, and check about autoCommit=false. It's on the future agenda to multi-thread through the index writer. The indexing time I quoted includes the document creation time which would definitely improve

Re: performance feedback

2008-07-09 Thread Yonik Seeley
On Wed, Jul 9, 2008 at 11:35 AM, Beard, Brian [EMAIL PROTECTED] wrote: I will try tweaking RAM, and check about autoCommit=false. It's on the future agenda to multi-thread through the index writer. The indexing time I quoted includes the document creation time which would definitely improve

Payloads and SpanScorer

2008-07-09 Thread Peter Keegan
If a SpanQuery is constructed from one or more BoostingTermQuery(s), the payloads on the terms are never processed by the SpanScorer. It seems to me that you would want the SpanScorer to score the document both on the spans distance and the payload score. So, either the SpanScorer would have to

.fdt file

2008-07-09 Thread blazingwolf7
Hi, I had recently found out that Lucene will retrieve the content of a document from a file .fdt. I am trying to retrieve the entire file in one go instead of retrieving it based on document number. can it be done? -- View this message in context:

.fdt file

2008-07-09 Thread blazingwolf7
Hi, I had recently found out that Lucene will retrieve the content of a document from a file .fdt. I am trying to retrieve the entire file in one go instead of retrieving it based on document number. can it be done? -- View this message in context:

Re: .fdt file

2008-07-09 Thread Yonik Seeley
On Wed, Jul 9, 2008 at 9:01 PM, blazingwolf7 [EMAIL PROTECTED] wrote: I had recently found out that Lucene will retrieve the content of a document from a file .fdt. I am trying to retrieve the entire file in one go instead of retrieving it based on document number. can it be done? cp can

Re: .fdt file

2008-07-09 Thread blazingwolf7
Sorry,but I am still quite new to Lucene. What exactly is cp? Yonik Seeley wrote: On Wed, Jul 9, 2008 at 9:01 PM, blazingwolf7 [EMAIL PROTECTED] wrote: I had recently found out that Lucene will retrieve the content of a document from a file .fdt. I am trying to retrieve the entire file

Re: .fdt file

2008-07-09 Thread Yonik Seeley
On Wed, Jul 9, 2008 at 11:13 PM, blazingwolf7 [EMAIL PROTECTED] wrote: Sorry,but I am still quite new to Lucene. What exactly is cp? The unix command for copy (hence the smiley). Some of your recent questions seem to be suffering from an XY problem: