Re: How to implement Lucene

2013-02-06 Thread Bas Gijzen
Hey, The Lucene documentation is very good: http://lucene.apache.org/core/4_1_0/ The example code from the link below is very helpful http://lucene.apache.org/core/4_1_0/core/overview-summary.html#overview_description Kind regards, Bas 2013/2/5 VIGNESH S > Hi, > > For Basics on Lucene How to

Re: Info on core search classes

2013-02-06 Thread Bas Gijzen
Hey, I read recently read a blog post about AtomicReader and CompositeReaders http://www.searchworkings.org/blog/-/blogs/uwe-says%3A-is-your-reader-atomic I found it very interesting Kind regards, Bas 2013/2/5 Igor Shalyminov > Hi! > > I wonder where one can get information about current Luce

Re: How to implement Lucene

2013-02-06 Thread Swapnil Patil
Hi, Try to use solr with DIH (Data import handler). It can import data from database to lucene index directly. http://wiki.apache.org/solr/DataImportHandler On Wed, Feb 6, 2013 at 2:03 PM, Bas Gijzen wrote: > Hey, > > The Lucene documentation is very good: > http://lucene.apache.org/core/4_1_0/

Handle expression in the index

2013-02-06 Thread Nicolas Roduit
I'm starting with Lucene 4 and have built my own analyzer with stemming and synonyms. This works perfectly. I built a Lucene index with several documents (with an ID) containing a text (with TextField) and a list of words or expressions related to the text (a kind of tag). Everything is OK wh

Re: Handle expression in the index

2013-02-06 Thread Alon Muchnick
hi Nicolas , if i understand correctly what you are describing is that your tag field will contain Lucine queries syntax - one word = exact match , 2 words "xx yy" = phrase match , and so on . there is a search method called "Prospective search" which fits this situation . you can try and use th

Re: How to get field names and types from an IndexSearcher

2013-02-06 Thread Rolf Veen
Just for the record, the solution that I adopted is as follows: - Create a setType(String field, String type) and call it for any known numeric fields, before adding any document. This method saves the type definition in a file and also sets the Map that is used in the method StandardQueryPa

updateDocument question

2013-02-06 Thread Becker, Thomas
I've built a search prototype feature for my application using Lucene, and it works great. The application monitors a remote system and currently indexes just a few core attributes of the objects on that system. I get notifications when objects change, and I then update the Lucene index to kee

Re: Lucene vs RDBMS indexing at scale

2013-02-06 Thread Andrew Gilmartin
Drew Kutcharian wrote: I'm trying to figure out what would be a better approach to indexing when it comes to a large number of records (say 1 billion) A rule of thumb is that if you want a list of exact matches use a database. If you want a ranked list of matches use Lucene. -- Andrew

RE: How to implement Lucene

2013-02-06 Thread Álvaro Vargas Quezada
Thanks to all! I'll see your links and I'll tell you. Thanks again :D > Date: Wed, 6 Feb 2013 14:22:30 +0530 > Subject: Re: How to implement Lucene > From: ping.swap...@gmail.com > To: java-user@lucene.apache.org > > Hi, > Try to use solr with DIH (Data import handler). It can import data from >

Re: updateDocument question

2013-02-06 Thread Adrien Grand
Hi Thomas, On Wed, Feb 6, 2013 at 2:50 PM, Becker, Thomas wrote: > I've built a search prototype feature for my application using Lucene, and it > works great. The application monitors a remote system and currently indexes > just a few core attributes of the objects on that system. I get > n

Re: Setting Similarity classes in Benchmark .alg scripts

2013-02-06 Thread Robert Muir
Just to be sure what you are trying to do: A) compare the relevance of different similarities? this is something the benchmark.quality package (actually pretty much unrelated from the rest of the benchmark package!) does, if you have some e.g. TREC collection or whatever to test with. B) compare

testing whether a field has terms before adding document to Index

2013-02-06 Thread Jon Stewart
Hello, I have an application where a great many documents may not have any terms after StandardAnalyzer has had its way with the body. In that case, depending on some other metadata, I may not wish to add the document to the index altogether. Is there a way to tell? i.e., current I'm doing this:

Re: testing whether a field has terms before adding document to Index

2013-02-06 Thread Michael McCandless
You could just create the TokenStream yourself, try to read the first token, and if you don't get a token (incrementToken returns false) then skip it? It's a bit wasteful since you'd then init a new TokenStream again if you do index it ... but maybe it's not so bad since you only read one token.