Re: Indexing with Lucene 1.4.3

2004-12-17 Thread Otis Gospodnetic
The only place where you have to specify that you are using the
compound index format is on IndexWriter instance.  Nothing needs to be
done at search time on IndexSearcher.

Otis

--- Hetan Shah <[EMAIL PROTECTED]> wrote:

> Thanks Chuck,
> 
> I now understand why I see only one file. Another question is do I
> have 
> to specify somewhere in my code or some configuration setting that I 
> would now be using a compound file format (.cfs file) for index. I
> have 
> an application that was working in version 1.3-final till I moved to 
> 1.4.3 now I do not get any results back from my searches.
> 
> I tried using Luke and it shows me the content of the index. I can 
> search using Luke but no success so far with my own application.
> 
> Any pointers?
> 
> Thanks.
> -H
> 
> Chuck Williams wrote:
> 
> >That looks right to me, assuming you have done an optimize.  All of
> your
> >index segments are merged into the one .cfs file (which is large,
> >right?).  Try searching -- it should work.
> >
> >Chuck
> >
> >  > -Original Message-
> >  > From: Hetan Shah [mailto:[EMAIL PROTECTED]
> >  > Sent: Thursday, December 16, 2004 11:00 AM
> >  > To: Lucene Users List
> >  > Subject: Indexing with Lucene 1.4.3
> >  > 
> >  > Hello,
> >  > 
> >  > I have been trying to index around 6000 documents using
> IndexHTML
> >from
> >  > 1.4.3 and at the end of indexing in my index directory I only
> have 3
> >  > files.
> >  > segments
> >  > deletable and
> >  > _5en.cfs
> >  > 
> >  > Can someone tell me what is going on and where are the actual
> index
> >  > files? How can I resolve this issue?
> >  > Thanks.
> >  > -H
> >  > 
> >  > 
> >  >
>
>-
> >  > To unsubscribe, e-mail:
> [EMAIL PROTECTED]
> >  > For additional commands, e-mail:
> [EMAIL PROTECTED]
> >
> >
>
>-
> >To unsubscribe, e-mail: [EMAIL PROTECTED]
> >For additional commands, e-mail: [EMAIL PROTECTED]
> >
> >
> 
> 
> 
> -
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Indexing with Lucene 1.4.3

2004-12-17 Thread Hetan Shah
Thanks Chuck,
I now understand why I see only one file. Another question is do I have 
to specify somewhere in my code or some configuration setting that I 
would now be using a compound file format (.cfs file) for index. I have 
an application that was working in version 1.3-final till I moved to 
1.4.3 now I do not get any results back from my searches.

I tried using Luke and it shows me the content of the index. I can 
search using Luke but no success so far with my own application.

Any pointers?
Thanks.
-H
Chuck Williams wrote:
That looks right to me, assuming you have done an optimize.  All of your
index segments are merged into the one .cfs file (which is large,
right?).  Try searching -- it should work.
Chuck
 > -Original Message-
 > From: Hetan Shah [mailto:[EMAIL PROTECTED]
 > Sent: Thursday, December 16, 2004 11:00 AM
 > To: Lucene Users List
 > Subject: Indexing with Lucene 1.4.3
 > 
 > Hello,
 > 
 > I have been trying to index around 6000 documents using IndexHTML
from
 > 1.4.3 and at the end of indexing in my index directory I only have 3
 > files.
 > segments
 > deletable and
 > _5en.cfs
 > 
 > Can someone tell me what is going on and where are the actual index
 > files? How can I resolve this issue?
 > Thanks.
 > -H
 > 
 > 
 >
-
 > To unsubscribe, e-mail: [EMAIL PROTECTED]
 > For additional commands, e-mail: [EMAIL PROTECTED]

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Re: Indexing with Lucene 1.4.3

2004-12-17 Thread Bernhard Messer

That looks right to me, assuming you have done an optimize.  All of your
index segments are merged into the one .cfs file (which is large,
right?).  Try searching -- it should work.
 

Chuck is right, the index looks fine and will be searchable. Since lucene version 1.4, the index is stored per default using the compound file format. The "index" files you are "missing" are merged within one compound file which has the extension "cfs". You can disable the compound file option using 
IndexWriters setUseCompoundFile(false).

Bernhard
 > -Original Message-
 > From: Hetan Shah [mailto:[EMAIL PROTECTED]
 > Sent: Thursday, December 16, 2004 11:00 AM
 > To: Lucene Users List
 > Subject: Indexing with Lucene 1.4.3
 > 
 > Hello,
 > 
 > I have been trying to index around 6000 documents using IndexHTML
from
 > 1.4.3 and at the end of indexing in my index directory I only have 3
 > files.
 > segments
 > deletable and
 > _5en.cfs
 > 
 > Can someone tell me what is going on and where are the actual index
 > files? How can I resolve this issue?
 > Thanks.
 > -H
 > 
 > 
 >
-
 > To unsubscribe, e-mail: [EMAIL PROTECTED]
 > For additional commands, e-mail: [EMAIL PROTECTED]

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
 


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


RE: Indexing with Lucene 1.4.3

2004-12-16 Thread Karthik N S

Hi there

Apologies.



   If u are using the IndexHTML from the demo.jar package which is
abvaliable from Lucene1.4.3.zip

 Then u bettter look at the File Extensions of u'r file's,they may be
filtered out of the indexing process

 due to this code present in IndexHTML.java
 >
 > } else if (file.getPath().endsWith(".html") || // index .html files
 > file.getPath().endsWith(".htm") || // index .htm files
 > file.getPath().endsWith(".txt")) { // index .txt files
 >


It the Extensions u have is within the 'endsWith' options then u have
sucessfully indexed the 6000 Documents of u's

Try to use the Luke Monitering S/f avaliable from the Jakartha Lucene Web
site and check for the same

[Hint Try to use the SearchFiles.class from the Lucene1.4.3.zip to search
onthe documents u have indexed sucessfuly]


with regards
Karthik






-Original Message-
From: Hetan Shah [mailto:[EMAIL PROTECTED]
Sent: Friday, December 17, 2004 12:30 AM
To: Lucene Users List
Subject: Indexing with Lucene 1.4.3


Hello,

I have been trying to index around 6000 documents using IndexHTML from
1.4.3 and at the end of indexing in my index directory I only have 3 files.
segments
deletable and
_5en.cfs

Can someone tell me what is going on and where are the actual index
files? How can I resolve this issue?
Thanks.
-H


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: Indexing with Lucene 1.4.3

2004-12-16 Thread Chuck Williams
That looks right to me, assuming you have done an optimize.  All of your
index segments are merged into the one .cfs file (which is large,
right?).  Try searching -- it should work.

Chuck

  > -Original Message-
  > From: Hetan Shah [mailto:[EMAIL PROTECTED]
  > Sent: Thursday, December 16, 2004 11:00 AM
  > To: Lucene Users List
  > Subject: Indexing with Lucene 1.4.3
  > 
  > Hello,
  > 
  > I have been trying to index around 6000 documents using IndexHTML
from
  > 1.4.3 and at the end of indexing in my index directory I only have 3
  > files.
  > segments
  > deletable and
  > _5en.cfs
  > 
  > Can someone tell me what is going on and where are the actual index
  > files? How can I resolve this issue?
  > Thanks.
  > -H
  > 
  > 
  >
-
  > To unsubscribe, e-mail: [EMAIL PROTECTED]
  > For additional commands, e-mail: [EMAIL PROTECTED]


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Indexing with Lucene 1.4.3

2004-12-16 Thread Hetan Shah
Hello,
I have been trying to index around 6000 documents using IndexHTML from 
1.4.3 and at the end of indexing in my index directory I only have 3 files.
segments
deletable and
_5en.cfs

Can someone tell me what is going on and where are the actual index 
files? How can I resolve this issue?
Thanks.
-H

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]