Re: Indexing with Lucene 1.4.3
The only place where you have to specify that you are using the compound index format is on IndexWriter instance. Nothing needs to be done at search time on IndexSearcher. Otis --- Hetan Shah <[EMAIL PROTECTED]> wrote: > Thanks Chuck, > > I now understand why I see only one file. Another question is do I > have > to specify somewhere in my code or some configuration setting that I > would now be using a compound file format (.cfs file) for index. I > have > an application that was working in version 1.3-final till I moved to > 1.4.3 now I do not get any results back from my searches. > > I tried using Luke and it shows me the content of the index. I can > search using Luke but no success so far with my own application. > > Any pointers? > > Thanks. > -H > > Chuck Williams wrote: > > >That looks right to me, assuming you have done an optimize. All of > your > >index segments are merged into the one .cfs file (which is large, > >right?). Try searching -- it should work. > > > >Chuck > > > > > -Original Message- > > > From: Hetan Shah [mailto:[EMAIL PROTECTED] > > > Sent: Thursday, December 16, 2004 11:00 AM > > > To: Lucene Users List > > > Subject: Indexing with Lucene 1.4.3 > > > > > > Hello, > > > > > > I have been trying to index around 6000 documents using > IndexHTML > >from > > > 1.4.3 and at the end of indexing in my index directory I only > have 3 > > > files. > > > segments > > > deletable and > > > _5en.cfs > > > > > > Can someone tell me what is going on and where are the actual > index > > > files? How can I resolve this issue? > > > Thanks. > > > -H > > > > > > > > > > >- > > > To unsubscribe, e-mail: > [EMAIL PROTECTED] > > > For additional commands, e-mail: > [EMAIL PROTECTED] > > > > > >- > >To unsubscribe, e-mail: [EMAIL PROTECTED] > >For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Indexing with Lucene 1.4.3
Thanks Chuck, I now understand why I see only one file. Another question is do I have to specify somewhere in my code or some configuration setting that I would now be using a compound file format (.cfs file) for index. I have an application that was working in version 1.3-final till I moved to 1.4.3 now I do not get any results back from my searches. I tried using Luke and it shows me the content of the index. I can search using Luke but no success so far with my own application. Any pointers? Thanks. -H Chuck Williams wrote: That looks right to me, assuming you have done an optimize. All of your index segments are merged into the one .cfs file (which is large, right?). Try searching -- it should work. Chuck > -Original Message- > From: Hetan Shah [mailto:[EMAIL PROTECTED] > Sent: Thursday, December 16, 2004 11:00 AM > To: Lucene Users List > Subject: Indexing with Lucene 1.4.3 > > Hello, > > I have been trying to index around 6000 documents using IndexHTML from > 1.4.3 and at the end of indexing in my index directory I only have 3 > files. > segments > deletable and > _5en.cfs > > Can someone tell me what is going on and where are the actual index > files? How can I resolve this issue? > Thanks. > -H > > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Indexing with Lucene 1.4.3
That looks right to me, assuming you have done an optimize. All of your index segments are merged into the one .cfs file (which is large, right?). Try searching -- it should work. Chuck is right, the index looks fine and will be searchable. Since lucene version 1.4, the index is stored per default using the compound file format. The "index" files you are "missing" are merged within one compound file which has the extension "cfs". You can disable the compound file option using IndexWriters setUseCompoundFile(false). Bernhard > -Original Message- > From: Hetan Shah [mailto:[EMAIL PROTECTED] > Sent: Thursday, December 16, 2004 11:00 AM > To: Lucene Users List > Subject: Indexing with Lucene 1.4.3 > > Hello, > > I have been trying to index around 6000 documents using IndexHTML from > 1.4.3 and at the end of indexing in my index directory I only have 3 > files. > segments > deletable and > _5en.cfs > > Can someone tell me what is going on and where are the actual index > files? How can I resolve this issue? > Thanks. > -H > > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: Indexing with Lucene 1.4.3
Hi there Apologies. If u are using the IndexHTML from the demo.jar package which is abvaliable from Lucene1.4.3.zip Then u bettter look at the File Extensions of u'r file's,they may be filtered out of the indexing process due to this code present in IndexHTML.java > > } else if (file.getPath().endsWith(".html") || // index .html files > file.getPath().endsWith(".htm") || // index .htm files > file.getPath().endsWith(".txt")) { // index .txt files > It the Extensions u have is within the 'endsWith' options then u have sucessfully indexed the 6000 Documents of u's Try to use the Luke Monitering S/f avaliable from the Jakartha Lucene Web site and check for the same [Hint Try to use the SearchFiles.class from the Lucene1.4.3.zip to search onthe documents u have indexed sucessfuly] with regards Karthik -Original Message- From: Hetan Shah [mailto:[EMAIL PROTECTED] Sent: Friday, December 17, 2004 12:30 AM To: Lucene Users List Subject: Indexing with Lucene 1.4.3 Hello, I have been trying to index around 6000 documents using IndexHTML from 1.4.3 and at the end of indexing in my index directory I only have 3 files. segments deletable and _5en.cfs Can someone tell me what is going on and where are the actual index files? How can I resolve this issue? Thanks. -H - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
RE: Indexing with Lucene 1.4.3
That looks right to me, assuming you have done an optimize. All of your index segments are merged into the one .cfs file (which is large, right?). Try searching -- it should work. Chuck > -Original Message- > From: Hetan Shah [mailto:[EMAIL PROTECTED] > Sent: Thursday, December 16, 2004 11:00 AM > To: Lucene Users List > Subject: Indexing with Lucene 1.4.3 > > Hello, > > I have been trying to index around 6000 documents using IndexHTML from > 1.4.3 and at the end of indexing in my index directory I only have 3 > files. > segments > deletable and > _5en.cfs > > Can someone tell me what is going on and where are the actual index > files? How can I resolve this issue? > Thanks. > -H > > > - > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Indexing with Lucene 1.4.3
Hello, I have been trying to index around 6000 documents using IndexHTML from 1.4.3 and at the end of indexing in my index directory I only have 3 files. segments deletable and _5en.cfs Can someone tell me what is going on and where are the actual index files? How can I resolve this issue? Thanks. -H - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]