Lucene 4.3.1 CheckIndex limitation 100 trillion tokens?

2013-07-29 Thread Tom Burton-West
We have very large indexes, almost a terabyte for a single index, and normally it takes overnight to run a checkindex. I started a CheckIndex on Friday and today (Monday) it seems to be stuck testing vectors although we haven't got vectors turned on. (See below) The output file was last written J

Re: getNumericDocValues

2013-07-29 Thread Yonghui Zhao
Got it, thank you very much. 在 2013-7-29 下午11:34,"Adrien Grand" 写道: > Hi, > > On Mon, Jul 29, 2013 at 4:56 PM, Yonghui Zhao > wrote: > > I want to know what will be returned if the input docID is not a valid > id, > > for examples: > > > > 1. the docID beyonds the reader scope > > In that case,

RE: Detect a corrupted index

2013-07-29 Thread ABlaise
Thanks a lot ! I must have been very tired on friday not to be able to find the CheckIndex class... I looked for it though in the documentation but I guess I was not even able to type correctly on my keyboard... This works well but doesn't detect all corruption, I had to add the following test : if

Re: Did you Mean search on Indexes created by Different Files.

2013-07-29 Thread Varun Thacker
Hi, On Mon, Jul 29, 2013 at 4:36 PM, Ankit Murarka < ankit.mura...@rancoretech.com> wrote: > Since I am new to this, I can't stop exploring it and trying to use > different features. > > I am now trying to implement "Did you Mean " search using SpellChecker jar > and Lucene jar. > > The problem

Re: getNumericDocValues

2013-07-29 Thread Adrien Grand
Hi, On Mon, Jul 29, 2013 at 4:56 PM, Yonghui Zhao wrote: > I want to know what will be returned if the input docID is not a valid id, > for examples: > > 1. the docID beyonds the reader scope In that case, the behavior is not defined, it might throw an exception or return a random value. You sh

getNumericDocValues

2013-07-29 Thread Yonghui Zhao
In luncene 4.3 AtomicReader has this interface public abstract NumericDocValues getNumericDocValues(String field) throwsIOException If I get a NumericDocValues of one field from a reader. NumericDocValues has get interface. /** * Returns the numeric value for the specified document ID.

Re: Did you Mean search on Indexes created by Different Files.

2013-07-29 Thread Ankit Murarka
Can anyone guide me on how to achieve the below mentioned objective. I am kind of struck and cannot figure out what is wrong. Tried spellChecker.setSpellIndex(directory); also but still no suggestions. On 7/29/2013 4:36 PM, Ankit Murarka wrote: Since I am new to this, I can't stop exploring it

Re: ERROR: could not read any segments file in directory

2013-07-29 Thread Prakash Chinnakannan
Mike, Just out of curiosity, how to find the number of commits to index?. do you have any links to read about this?. Thanks, Prakash On Mon, Jul 29, 2013 at 4:25 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > On Mon, Jul 29, 2013 at 5:03 AM, Prakash Chinnakannan > wrote: > > Tha

Did you Mean search on Indexes created by Different Files.

2013-07-29 Thread Ankit Murarka
Since I am new to this, I can't stop exploring it and trying to use different features. I am now trying to implement "Did you Mean " search using SpellChecker jar and Lucene jar. The problem I faced are plenty although I have got it working.. code snippet: File dir = new File("D:\\Inde\\");

Re: ERROR: could not read any segments file in directory

2013-07-29 Thread Michael McCandless
On Mon, Jul 29, 2013 at 5:03 AM, Prakash Chinnakannan wrote: > Thanks Mike, we'd restored the index content from the backup. I will also > suggest to move the index to local storage. OK, good that you have backups. >> Do you commit frequently and has this index been running for a very, very > l

Re: ERROR: could not read any segments file in directory

2013-07-29 Thread Prakash Chinnakannan
Thanks Mike, we'd restored the index content from the backup. I will also suggest to move the index to local storage. > Do you commit frequently and has this index been running for a very, very long time? Yes, index files would be there for more than 3 yrs and frequent add/update and delete oper