Actually ClassicTokenizer seems to do the job. Any side effects of using
ClassicTokenizer rather than StandardTokenizer ?
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Exclusion-List-for-standard-tokenizer-tp4306511p4306516.html
Sent from the Lucene - Java Users
Hi,
Is there any exclusion list of characters which can be defined for
StandardTokenizer ? In my case, i want to use StandardTokenizer(as it solves
many problems of when to tokenization across languages) but i don't want to
tokenize the stream on certain characters for example '@'. Is there a wa
Thanks Mike. Yeah, i saw the changelist you mentioned. Unfortunately i can't
upgrade to 6.2 because of stack limitations :( .
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305728.html
Sent from the Lucene - Java User
Hi Michael,
Thanks for the reply. Regarding IW(IndexWriter) returning long sequence
number, i looked at the signature of commit and it seems to be void. Can you
please point me in the direction ? I am using Lucene 5.5.2. Also is this
number aggregation of deletes, updates and new documents ? Is
Hi,
Can anyone please suggest or point in some directions.
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Getting-list-of-committed-documents-tp4305258p4305503.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
--
Hi all,
I need some feedback on getting hold of documents which got committed
during commit call on indexwriter. There are multiple threads which keeps on
adding documents to indexWriter in parallel, and there's another thread
which wakes up after n number of minutes and does the commit. Below a
If you open the source, you will see it internally calls
this.getBestFragments(tokenStream, text, maxNumFragments) which in turn
calls
this.getBestTextFragments(tokenStream, text, true, maxNumFragments) (*with
flag true*) which will merge the fragments automatically.
Regards.
--
View this
Please pass false to mergeContiguousFragments in
getBestTextFragments(TokenStream tokenStream, String text, boolean
mergeContiguousFragments, int maxNumFragments) and it should work as
expected.
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/getBestFragments-with-
Thanks Mike. I discovered that earlier.
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/PhraseQuery-tp4299871p4300752.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
Hi all,
I am trying to do phrase query search, but hitting into problem of not
getting any results. Basically, i am indexing the document with some fields
in it(String, Text, Numbers and so on). My basic queries are working, but
when i am trying to do PhraseQueries i am not finding any results.
Hi Michael,
Sorry for the confusion. I didn't mean System, but various types of
locales i am indexing till that point :).
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4298170.html
Sent from the Lucene - Java Users mailing
1 more question :). Are numbers analyzed ? Like IntField, LongField, etc. ?
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297949.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
--
Hi Michael,
Thanks for the reply. Yeah, that's one brute force way of doing it. But
then one has to find all the locales in the system. Is there any other way ?
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297948.html
Sent f
Hi all,
Any suggestions from the experts ? I assume, this problem is not coming
for the first time.
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Handling-multiple-locale-tp4297805p4297927.html
Sent from the Lucene - Java Users mailing list archive at Nabb
Hi all,
I am trying to handle multiple locale(for indexing and querying). Below is
the problem i am trying to solve.
I am indexing list of documents in lucene with naming conventions :
(fieldName) + "_" + locale(language part only ex en, ar, ... ) and proving
PerFieldAnalyzerWrapper to indexWri
Thanks Mike...
Regards
--
View this message in context:
http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297021.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsubscri
Thanks a lot Mike. Can we control of how often natural merge should happen,
or what are the factors that define when to kick off "natural merging" ?
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Optimising-segments-merges-tp4296997p4297017.html
Sent from the Luce
Hi,
In my application i am committing(indexWriter.commit() on every
single/batch of documents, but now as a result there are lots of segments
getting generated(One thing can be, i don't do commit, but just add document
to indexWriter. But now, if system crashes then uncommited documents
wouldn't
Hi all,
Can anyone please respond ?
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Approach-for-indexing-and-queryin-good-volume-data-tp4295109p4295218.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
Hi all,
I am planning to use Lucene(not in cluster) for indexing and querying good
volume data. Use case is, 10-20 documents / second(roughly around 15-20
fields) and in parallel doing query. Below is the approach i am planning to
take, can anyone please let me know from their past experience if
Thanks for the reply.
Is there a way to partially update the document ? I know there's a API
updateDocument on IndexWriter, but that seems to create a new document with
just a field i am specifying. What i want is delete some fields from
existing(indexed) document, and then add some new fields(c
Can anyone please reply ? .
Regards.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Newbie-Questions-tp4290817p4290854.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsub
Hi There,
Is there a way to partially update the document ? I know there's a API
updateDocument on IndexWriter, but that seems to create a new document with
just a field i am specifying. What i want is delete some fields from
existing(indexed) document, and then add some new fields(could or not
I am new to Lucene and couple of questions. I have worked with SOLR
previously, now trying to work directly with Lucene to get similar
functionality(but without SOLR). Below are my questions, thanks in advance.
1) In SOLR, there is functionality of copyfields, which dynamically copies
one field va
24 matches
Mail list logo