Daniel Naber wrote:
Hi,
I think there are some small inaccuracies on
http://jakarta.apache.org/lucene/docs/fileformats.html:
-The first sentence refers to Lucene 1.4, but the end of the second
paragraph then refers to 1.3.
-"Term Vectors. For each field in each document, the term vector (sometimes
called document vector) is stored. A term vector consists of the term
text, term frequency and term position." -- I think term vectors don't
contain position information, do they?
Currently they donīt contain position information. I think there was a
version with position info in the beginning, but then positon info was
removed. I donīt remember why.
-"A file named "deletable" contains the names of files that are no longer
used by the index, but which could not be deleted. This is only generated
on Win32, where a file may not be deleted while it is still open." --
Actually the file is also generated on Linux, but it contains only 4 null
bytes.
Correct.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]