from:"lukes"

Re: Exclusion List for standard tokenizer

2016-11-18 Thread lukes

Actually ClassicTokenizer seems to do the job. Any side effects of using ClassicTokenizer rather than StandardTokenizer ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Exclusion-List-for-standard-tokenizer-tp4306511p4306516.html Sent from the Lucene - Java Users

Exclusion List for standard tokenizer

2016-11-18 Thread lukes

Hi, Is there any exclusion list of characters which can be defined for StandardTokenizer ? In my case, i want to use StandardTokenizer(as it solves many problems of when to tokenization across languages) but i don't want to tokenize the stream on certain characters for example '@'. Is there a wa

Re: Getting list of committed documents

2016-11-13 Thread lukes

Thanks Mike. Yeah, i saw the changelist you mentioned. Unfortunately i can't upgrade to 6.2 because of stack limitations :( . Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305728.html Sent from the Lucene - Java User

Re: Getting list of committed documents

2016-11-12 Thread lukes

Hi Michael, Thanks for the reply. Regarding IW(IndexWriter) returning long sequence number, i looked at the signature of commit and it seems to be void. Can you please point me in the direction ? I am using Lucene 5.5.2. Also is this number aggregation of deletes, updates and new documents ? Is

Re: Getting list of committed documents

2016-11-10 Thread lukes

Hi, Can anyone please suggest or point in some directions. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305503.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --

Getting list of committed documents

2016-11-09 Thread lukes

Hi all, I need some feedback on getting hold of documents which got committed during commit call on indexwriter. There are multiple threads which keeps on adding documents to indexWriter in parallel, and there's another thread which wakes up after n number of minutes and does the commit. Below a

Re: getBestFragments with SimpleSpanFragmenter

2016-10-13 Thread lukes

If you open the source, you will see it internally calls this.getBestFragments(tokenStream, text, maxNumFragments) which in turn calls this.getBestTextFragments(tokenStream, text, true, maxNumFragments) (*with flag true*) which will merge the fragments automatically. Regards. -- View this

Re: getBestFragments with SimpleSpanFragmenter

2016-10-13 Thread lukes

Please pass false to mergeContiguousFragments in getBestTextFragments(TokenStream tokenStream, String text, boolean mergeContiguousFragments, int maxNumFragments) and it should work as expected. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/getBestFragments-with-

Re: PhraseQuery

2016-10-11 Thread lukes

Thanks Mike. I discovered that earlier. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/PhraseQuery-tp4299871p4300752.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

PhraseQuery

2016-10-05 Thread lukes

Hi all, I am trying to do phrase query search, but hitting into problem of not getting any results. Basically, i am indexing the document with some fields in it(String, Text, Numbers and so on). My basic queries are working, but when i am trying to do PhraseQueries i am not finding any results.

Re: Handling multiple locale

2016-09-26 Thread lukes

Hi Michael, Sorry for the confusion. I didn't mean System, but various types of locales i am indexing till that point :). Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4298170.html Sent from the Lucene - Java Users mailing

Re: Handling multiple locale

2016-09-25 Thread lukes

1 more question :). Are numbers analyzed ? Like IntField, LongField, etc. ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297949.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --

Re: Handling multiple locale

2016-09-25 Thread lukes

Hi Michael, Thanks for the reply. Yeah, that's one brute force way of doing it. But then one has to find all the locales in the system. Is there any other way ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297948.html Sent f

Re: Handling multiple locale

2016-09-25 Thread lukes

Hi all, Any suggestions from the experts ? I assume, this problem is not coming for the first time. Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297927.html Sent from the Lucene - Java Users mailing list archive at Nabb

Handling multiple locale

2016-09-23 Thread lukes

Hi all, I am trying to handle multiple locale(for indexing and querying). Below is the problem i am trying to solve. I am indexing list of documents in lucene with naming conventions : (fieldName) + "_" + locale(language part only ex en, ar, ... ) and proving PerFieldAnalyzerWrapper to indexWri

Re: Optimising segments merges

2016-09-20 Thread lukes

Thanks Mike... Regards -- View this message in context: http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297021.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscri

Re: Optimising segments merges

2016-09-20 Thread lukes

Thanks a lot Mike. Can we control of how often natural merge should happen, or what are the factors that define when to kick off "natural merging" ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297017.html Sent from the Luce

Optimising segments merges

2016-09-20 Thread lukes

Hi, In my application i am committing(indexWriter.commit() on every single/batch of documents, but now as a result there are lots of segments getting generated(One thing can be, i don't do commit, but just add document to indexWriter. But now, if system crashes then uncommited documents wouldn't

Re: Approach for indexing and queryin good volume data.

2016-09-08 Thread lukes

Hi all, Can anyone please respond ? Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Approach-for-indexing-and-queryin-good-volume-data-tp4295109p4295218.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

Approach for indexing and queryin good volume data.

2016-09-07 Thread lukes

Hi all, I am planning to use Lucene(not in cluster) for indexing and querying good volume data. Use case is, 10-20 documents / second(roughly around 15-20 fields) and in parallel doing query. Below is the approach i am planning to take, can anyone please let me know from their past experience if

Re: Newbie Questions

2016-08-09 Thread lukes

Thanks for the reply. Is there a way to partially update the document ? I know there's a API updateDocument on IndexWriter, but that seems to create a new document with just a field i am specifying. What i want is delete some fields from existing(indexed) document, and then add some new fields(c

Re: Newbie Questions

2016-08-08 Thread lukes

Can anyone please reply ? . Regards. -- View this message in context: http://lucene.472066.n3.nabble.com/Newbie-Questions-tp4290817p4290854.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsub

Partial update to document

2016-08-08 Thread lukes

Hi There, Is there a way to partially update the document ? I know there's a API updateDocument on IndexWriter, but that seems to create a new document with just a field i am specifying. What i want is delete some fields from existing(indexed) document, and then add some new fields(could or not

Newbie Questions

2016-08-08 Thread lukes

I am new to Lucene and couple of questions. I have worked with SOLR previously, now trying to work directly with Lucene to get similar functionality(but without SOLR). Below are my questions, thanks in advance. 1) In SOLR, there is functionality of copyfields, which dynamically copies one field va

Re: Exclusion List for standard tokenizer

Exclusion List for standard tokenizer

Re: Getting list of committed documents

Re: Getting list of committed documents

Re: Getting list of committed documents

Getting list of committed documents

Re: getBestFragments with SimpleSpanFragmenter

Re: getBestFragments with SimpleSpanFragmenter

Re: PhraseQuery

PhraseQuery

Re: Handling multiple locale

Re: Handling multiple locale

Re: Handling multiple locale

Re: Handling multiple locale

Handling multiple locale

Re: Optimising segments merges

Re: Optimising segments merges

Optimising segments merges

Re: Approach for indexing and queryin good volume data.

Approach for indexing and queryin good volume data.

Re: Newbie Questions

Re: Newbie Questions

Partial update to document

Newbie Questions

24 matches

Site Navigation

Mail list logo

Footer information