Re: Need help with filtering

2004-11-17 Thread Paul Elschot
On Wednesday 17 November 2004 01:20, Edwin Tang wrote: Hello, I have been using DateFilter to limit my search results to a certain date range. I am now asked to replace this filter with one where my search results have document IDs greater than a given document ID. This document ID is

Re: COUNT SUBINDEX [IN MERGERINDEX]

2004-11-17 Thread Paul Elschot
On Wednesday 17 November 2004 07:10, Karthik N S wrote: Hi guy's Apologies. So A Mergeed Index is again a Single [ addition of subIndexes... ), If that case , If One of the Field Types is of type 'Field.Keyword' whic is Unique across the subIndexes [Before Merging].

Re: Index Locking Issues Resolved...I hope

2004-11-17 Thread jeichels
I was thinking that perhaps I can pre-stem words before sticking them in a search field in the database perhaps using Lucene stemming code, then try to use the Natural Language Search found in MySql 4.1.1. I am confident the MySql product can't keep up with Lucene yet, but at least they hvae

RE: COUNT SUBINDEX [IN MERGERINDEX]

2004-11-17 Thread Karthik N S
Hi Guys Apologies.. I am Still Confused.. ;( Let me make it more simple Question On using Search from a Index without any SearchWord, I would like to count the total number of Documents present in it. [ I Only have the Field Types 'Field.Keyword' which stores the Unique

tool to check the index field

2004-11-17 Thread lingaraju
HI ALL I am having index file created by other people Now i want to know how many field are there in the index Is there any third party tool to do this I saw some where some GUI tool to do this but forgot the name. Regards LingaRaju

RE: tool to check the index field

2004-11-17 Thread Viparthi, Kiran (AFIS)
Try using : Luke : http://www.getopt.org/luke/ Limo : http://limo.sourceforge.net/ Regards, Kiran. -Original Message- From: lingaraju [mailto:[EMAIL PROTECTED] Sent: 17 November 2004 16:00 To: Lucene Users List Subject: tool to check the index field HI ALL I am having index file

RE: best ways of using IndexSearcher

2004-11-17 Thread Aviran
Yes, IndexSearcher is thread safe. Aviran http://www.aviransplace.com -Original Message- From: Abhay Saswade [mailto:[EMAIL PROTECTED] Sent: Tuesday, November 16, 2004 15:16 PM To: Lucene Users List Subject: Re: best ways of using IndexSearcher Hello, Can I use single instance of

Re: Need help with filtering

2004-11-17 Thread Edwin Tang
Ah... recoding DateFilter. I will look into this today. Thanks for the help. Ed --- Paul Elschot [EMAIL PROTECTED] wrote: On Wednesday 17 November 2004 01:20, Edwin Tang wrote: Hello, I have been using DateFilter to limit my search results to a certain date range. I am now asked to

Re: Whitespace Analyzer not producing expected search results

2004-11-17 Thread lee . a . carroll
Thanks for the suggestions Erik. Displaying the query string is really usefull and this is what i've found. I issue a search using the search term ResponseHelper.writeNoCachingHeaders\(response\); The search is parsed using a query parser and produces the following query string

Re: tool to check the index field

2004-11-17 Thread Luke Shannon
Try this: http://www.getopt.org/luke/ Luke - Original Message - From: lingaraju [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Wednesday, November 17, 2004 10:00 AM Subject: tool to check the index field HI ALL I am having index file created by other people Now

Index copy

2004-11-17 Thread Ravi
Whats the bestway to copy an index from one directory to another? I tried opening an IndexWriter at the new location and used addIndexes to read from the old index. But that was very slow. Thanks in advance, Ravi. - To

Re: Whitespace Analyzer not producing expected search results

2004-11-17 Thread Erik Hatcher
On Nov 17, 2004, at 7:44 AM, [EMAIL PROTECTED] wrote: I then try a search using the term ResponseHelper.writeNoCachingHeaders\(*\); now I'm expecting this to be a wider search term and it should find at least two, possibly more docs? the query parser produces the query

Re: Whitespace Analyzer not producing expected search results

2004-11-17 Thread lee . a . carroll
Thanks a lot for the solution / explanation. Saved the day Erik. Summary Observation: Using a wild carded search term with queryParser and the WhitespaceAnalyser returned no hits when when hits where expected. Reason: This was caused by the default behaviour of queryParser to lower case

index document pdf

2004-11-17 Thread Miguel Angel
Hi, i downloading pdfbox 0.6.4 , what add in the source code the demo`s lucene -- Miguel Angel Angeles R. Asesoria en Conectividad y Servidores Telf. 97451277 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional

Re: WildcardTermEnum skipping terms containing numbers?!

2004-11-17 Thread Yonik Seeley
test __ Do you Yahoo!? The all-new My Yahoo! - Get yours free! http://my.yahoo.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL

Re: WildcardTermEnum skipping terms containing numbers?!

2004-11-17 Thread Sanyi
Enumerating the terms using WildcardTermEnum and an IndexReader seems to be too buggy to use. I'm now reimplementing my code using WildcardTermEnum.wildcardEquals which seems to be better so far. --- Sanyi [EMAIL PROTECTED] wrote: Hi! I have following problem with 1.4.2: I'm searching for

Re: Index copy

2004-11-17 Thread Justin Swanhart
You could lock your index for writes, then copy the file using operating system copy commands. Another way would be to lock your index, make a filesystem snapshot, then unlock your index. You can then safely copy the snapshot without interupting further index operations. On Wed, 17 Nov 2004

Something missing !!???

2004-11-17 Thread abdulrahman galal
i noticed in the last period that alot of people disscus with each others about the bugs of lucene ... but something is missing ... i consider lucene is an indexing tool for text files and so one ... but there are alot of tools that makes this indexing like access ... what about compression

Re: Something missing !!???

2004-11-17 Thread Justin Swanhart
The HEAD version of CVS supports gz compression. You will need to check it out using cvs if you want to use it. On Wed, 17 Nov 2004 21:43:36 +0200, abdulrahman galal [EMAIL PROTECTED] wrote: i noticed in the last period that alot of people disscus with each others about the bugs of lucene ...

version documents

2004-11-17 Thread Luke Shannon
Hey all; I have ran into an interesting case. Our system has notes. These need to be indexed. They are xml files called default.xml and are easily parsed and indexed. No problem, have been doing it all week. The problem is if someone edits the note, the system doesn't update the default.xml.

mergeFactor

2004-11-17 Thread Ravi
Can somebody explain the difference between the parameters minMergeDocs and mergeFactor in IndexWriter. When I read the documentation, it looks like both of them represent number of documents to be in buffer before they are merged into a new segment. Thanks in advance, Ravi.

Re: version documents

2004-11-17 Thread Justin Swanhart
Split the filename into basefilename and version and make each a keyword. Sort your query by version descending, and only use the first basefile you encounter. On Wed, 17 Nov 2004 15:05:19 -0500, Luke Shannon [EMAIL PROTECTED] wrote: Hey all; I have ran into an interesting case. Our

Re: version documents

2004-11-17 Thread Luke Shannon
That is a good idea. Thanks! - Original Message - From: Justin Swanhart [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Sent: Wednesday, November 17, 2004 3:38 PM Subject: Re: version documents Split the filename into basefilename and version and make each a keyword. Sort

Lucene and SVD

2004-11-17 Thread DES
Hi I need some kind of implementation of SVD (singular value decomposition) or LSI with Lucene engine. Have anyone any ideas how to create a query table for decomposition? The table must have documents as rows and terms as columns, if a term is presented in the docuement, the corresponding

Considering intermediary solution before Lucene question

2004-11-17 Thread jeichels
Is there a way to use Lucene stemming and stop word removal without using the rest of the tool? I am downloading the code now, but I imagine the answer might be deeply burried. I would like to be able to send in a phrase and get back a collection of keywords if possible. I am thinking of

Re: Considering intermediary solution before Lucene question

2004-11-17 Thread Otis Gospodnetic
Yes, you can use just the Analysis part. For instance, I use this for http://www.simpy.com and I believe we also have this in the Lucene book as part of the source code package: /** * Gets Tokens extracted from the given text, using the specified Analyzer. * * @param analyzer

Re: Considering intermediary solution before Lucene question

2004-11-17 Thread jeichels
This is so cool Otis. I was just to write this off of something in the FAQ, but this is better then what I was doing. This rocks!!! Thank you. JohnE P.S.: I am assuming you use org.apache.lucene.analysis.Token? There are three Token's under Lucene. - Original Message - From:

Re: Considering intermediary solution before Lucene question

2004-11-17 Thread Chris Lamprecht
John, It actually should be pretty easy to use just the parts of Lucene you want (the analyzers, etc) without using the rest. See the example of the PorterStemmer from this article: http://www.onjava.com/pub/a/onjava/2003/01/15/lucene.html?page=2 You could feed a Reader to the tokenStream()

Re: Considering intermediary solution before Lucene question

2004-11-17 Thread jeichels
I thank you both. I have it already partly implemented here. It seems easy. At least this should carry through my product until I can really get to use Lucene. I am not sure how far I can take MySql with stemmed, indexed key words, but should give me maybe 6 monthes at least of something

RE: Index copy

2004-11-17 Thread Ravi
Thanks. I was looking for an o/s independent way of copying. Probably I can use BufferedInputStream and BufferedOutputStream classes to copy the index to a different location. -Original Message- From: Justin Swanhart [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 17, 2004 2:35 PM