Newbie Questions

2016-08-08 Thread lukes
I am new to Lucene and couple of questions. I have worked with SOLR previously, now trying to work directly with Lucene to get similar functionality(but without SOLR). Below are my questions, thanks in advance. 1) In SOLR, there is functionality of copyfields, which dynamically copies one field

Re: Newbie Questions

2016-08-08 Thread lukes
Can anyone please reply ? . Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Newbie-Questions-tp4290817p4290854.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To

Partial update to document

2016-08-08 Thread lukes
Hi There, Is there a way to partially update the document ? I know there's a API updateDocument on IndexWriter, but that seems to create a new document with just a field i am specifying. What i want is delete some fields from existing(indexed) document, and then add some new fields(could or not

Re: Newbie Questions

2016-08-09 Thread lukes
Thanks for the reply. Is there a way to partially update the document ? I know there's a API updateDocument on IndexWriter, but that seems to create a new document with just a field i am specifying. What i want is delete some fields from existing(indexed) document, and then add some new

Approach for indexing and queryin good volume data.

2016-09-08 Thread lukes
Hi all, I am planning to use Lucene(not in cluster) for indexing and querying good volume data. Use case is, 10-20 documents / second(roughly around 15-20 fields) and in parallel doing query. Below is the approach i am planning to take, can anyone please let me know from their past experience if

Re: Approach for indexing and queryin good volume data.

2016-09-08 Thread lukes
Hi all, Can anyone please respond ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Approach-for-indexing-and-queryin-good-volume-data-tp4295109p4295218.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Re: Handling multiple locale

2016-09-25 Thread lukes
1 more question :). Are numbers analyzed ? Like IntField, LongField, etc. ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297949.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Re: Handling multiple locale

2016-09-25 Thread lukes
Hi Michael, Thanks for the reply. Yeah, that's one brute force way of doing it. But then one has to find all the locales in the system. Is there any other way ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297948.html Sent

PhraseQuery

2016-10-05 Thread lukes
Hi all, I am trying to do phrase query search, but hitting into problem of not getting any results. Basically, i am indexing the document with some fields in it(String, Text, Numbers and so on). My basic queries are working, but when i am trying to do PhraseQueries i am not finding any results.

Re: Optimising segments merges

2016-09-20 Thread lukes
Thanks Mike... Regards -- View this message in context: http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297021.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To

Re: Optimising segments merges

2016-09-20 Thread lukes
Thanks a lot Mike. Can we control of how often natural merge should happen, or what are the factors that define when to kick off "natural merging" ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297017.html Sent from the

Optimising segments merges

2016-09-20 Thread lukes
Hi, In my application i am committing(indexWriter.commit() on every single/batch of documents, but now as a result there are lots of segments getting generated(One thing can be, i don't do commit, but just add document to indexWriter. But now, if system crashes then uncommited documents wouldn't

Re: Handling multiple locale

2016-09-25 Thread lukes
Hi all, Any suggestions from the experts ? I assume, this problem is not coming for the first time. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297927.html Sent from the Lucene - Java Users mailing list archive at

Getting list of committed documents

2016-11-09 Thread lukes
Hi all, I need some feedback on getting hold of documents which got committed during commit call on indexwriter. There are multiple threads which keeps on adding documents to indexWriter in parallel, and there's another thread which wakes up after n number of minutes and does the commit. Below

Re: Getting list of committed documents

2016-11-10 Thread lukes
Hi, Can anyone please suggest or point in some directions. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305503.html Sent from the Lucene - Java Users mailing list archive at Nabble.com.

Re: Getting list of committed documents

2016-11-12 Thread lukes
Hi Michael, Thanks for the reply. Regarding IW(IndexWriter) returning long sequence number, i looked at the signature of commit and it seems to be void. Can you please point me in the direction ? I am using Lucene 5.5.2. Also is this number aggregation of deletes, updates and new documents ? Is

Re: Getting list of committed documents

2016-11-13 Thread lukes
Thanks Mike. Yeah, i saw the changelist you mentioned. Unfortunately i can't upgrade to 6.2 because of stack limitations :( . Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305728.html Sent from the Lucene - Java

Exclusion List for standard tokenizer

2016-11-18 Thread lukes
Hi, Is there any exclusion list of characters which can be defined for StandardTokenizer ? In my case, i want to use StandardTokenizer(as it solves many problems of when to tokenization across languages) but i don't want to tokenize the stream on certain characters for example '@'. Is there a

Re: Exclusion List for standard tokenizer

2016-11-18 Thread lukes
Actually ClassicTokenizer seems to do the job. Any side effects of using ClassicTokenizer rather than StandardTokenizer ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Exclusion-List-for-standard-tokenizer-tp4306511p4306516.html Sent from the Lucene - Java Users

Re: PhraseQuery

2016-10-11 Thread lukes
Thanks Mike. I discovered that earlier. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/PhraseQuery-tp4299871p4300752.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

Re: getBestFragments with SimpleSpanFragmenter

2016-10-13 Thread lukes
Please pass false to mergeContiguousFragments in getBestTextFragments(TokenStream tokenStream, String text, boolean mergeContiguousFragments, int maxNumFragments) and it should work as expected. Regards. -- View this message in context:

Re: getBestFragments with SimpleSpanFragmenter

2016-10-13 Thread lukes
If you open the source, you will see it internally calls this.getBestFragments(tokenStream, text, maxNumFragments) which in turn calls this.getBestTextFragments(tokenStream, text, true, maxNumFragments) (*with flag true*) which will merge the fragments automatically. Regards. -- View this