Re: Lucene deleteDocument

2015-06-20 Thread Đạt Cao Mạnh
You can check following possibilities : - doc.id.toString() return same string for all of your documents. - your tokenstream for doc.id.toString() return same token for all of your documents. Ex: id 1 23 - {1,23}. id 1 56 - {1,56}. So when you delete document by using iw.deleteDocuments(new

Re: Exception While searching through indices.

2015-06-14 Thread Đạt Cao Mạnh
, On Sat, Jun 13, 2015 at 5:40 AM, Đạt Cao Mạnh caomanhdat...@gmail.com wrote: Hi, the total number of documents in an index of lucene is Integer.MAX_VALUE. So using a single lucene index to index billions documents is not a proper ways. You should consider using Solr Cloud or Elasticsearch to index

Re: Exception While searching through indices.

2015-06-12 Thread Đạt Cao Mạnh
Hi, the total number of documents in an index of lucene is Integer.MAX_VALUE. So using a single lucene index to index billions documents is not a proper ways. You should consider using Solr Cloud or Elasticsearch to index your documents. On 19:43, Fri, 12 Jun 2015 Gimantha Bandara

Re: Cannot comment on Jira issues

2016-04-22 Thread Đạt Cao Mạnh
t; Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: Đạt Cao Mạnh [mailto:caomanhdat...@gmail.com] > > Sent: Friday, April 22, 2016 12:13 PM > > To: java-user@lucene.apache.o

Cannot comment on Jira issues

2016-04-22 Thread Đạt Cao Mạnh
Recently, I cant comment on any jira issues include the one that i created ( https://issues.apache.org/jira/browse/LUCENE-6968). I tried to create a new account but the new one cannot comment too.

Re: Similarity Implementation

2016-07-07 Thread Đạt Cao Mạnh
Hi Siraj, I think https://lucene.apache.org/core/6_1_0/core/index.html?org/apache/lucene/search/ConstantScoreQuery.html should be good enough. On Fri, Jul 8, 2016 at 12:27 AM Siraj Haider wrote: > We are in the process of upgrading from 2.x to 6.x. In 2.x we implemented >

Re: Setting up Lucene 6.x in IntelliJ

2016-09-05 Thread Đạt Cao Mạnh
Hi Reth, You can follow the guide that I posted on http://datcm.blogspot.com/2016/02/setup-enviroment-hacking-into-solr.html. You will have an environment for both Lucene and SOLR. On Tue, Sep 6, 2016 at 10:11 AM Reth RM wrote: > Hello, > > > According to the wiki( >

Re: How to use LSH Filter

2016-09-05 Thread Đạt Cao Mạnh
Hi Reth, MinHashFilterTest class have enough information on how to test MinHashFilter as well as how to use it. On Tue, Sep 6, 2016 at 8:41 AM Reth RM wrote: > Re: this jira regarding LSH > https://issues.apache.org/jira/browse/LUCENE-6968 > > Is there any documentation

Re: Duplicate filtering

2016-09-20 Thread Đạt Cao Mạnh
Solr already support de-duplication when adding new documents. You can refer to the doc at https://cwiki.apache.org/confluence/display/solr/De-Duplication On Tue, Sep 20, 2016 at 12:18 PM Vjeran Marcinko < vjeran.marci...@email.t-com.hr> wrote: > Hello, > > I'm pretty much Lucene newb, so