dnaber 2004/10/13 14:01:12 Modified: xdocs fileformats.xml docs fileformats.html Log: update to the file format by Bernhard Messer; adding a link to non-Java versions of Lucene Revision Changes Path 1.17 +22 -4 jakarta-lucene/xdocs/fileformats.xml Index: fileformats.xml =================================================================== RCS file: /home/cvs/jakarta-lucene/xdocs/fileformats.xml,v retrieving revision 1.16 retrieving revision 1.17 diff -u -r1.16 -r1.17 --- fileformats.xml 7 Aug 2004 11:27:29 -0000 1.16 +++ fileformats.xml 13 Oct 2004 21:01:12 -0000 1.17 @@ -19,8 +19,10 @@ <p> Jakarta Lucene is written in Java, but several - efforts are underway to write versions of Lucene in other programming - languages. If these versions are to remain compatible with Jakarta + efforts are underway to write + <a href="http://wiki.apache.org/jakarta-lucene/LuceneImplementations">versions + of Lucene in other programming + languages</a>. If these versions are to remain compatible with Jakarta Lucene, then a language-independent definition of the Lucene index format is required. This document thus attempts to provide a complete and independent definition of the Jakarta Lucene 1.4 file @@ -891,16 +893,32 @@ <p>FieldNum --> VInt </p> + + <p><b>Lucene <= 1.4:</b></p> <p>Bits --> Byte </p> <p>Value --> String </p> - <p>Currently - only the low-order bit is used of Bits is used. It is one for + <p>Only the low-order bit of Bits is used. It is one for tokenized fields, and zero for non-tokenized fields. </p> + <p><b>Lucene >= 1.9:</b></p> + <p>Bits --> + Byte + </p> + <p> + <ul> + <li>low order bit is one for tokenized fields</li> + <li>second bit is one for fields containing binary data</li> + <li>third bit is one for fields with compression option enabled</li> + </ul> + </p> + <p>Value --> + String (default) or byte[] when the field value is a binary field + </p> + </li> </ol> 1.30 +22 -4 jakarta-lucene/docs/fileformats.html Index: fileformats.html =================================================================== RCS file: /home/cvs/jakarta-lucene/docs/fileformats.html,v retrieving revision 1.29 retrieving revision 1.30 diff -u -r1.29 -r1.30 --- fileformats.html 7 Aug 2004 11:27:30 -0000 1.29 +++ fileformats.html 13 Oct 2004 21:01:12 -0000 1.30 @@ -142,8 +142,10 @@ </p> <p> Jakarta Lucene is written in Java, but several - efforts are underway to write versions of Lucene in other programming - languages. If these versions are to remain compatible with Jakarta + efforts are underway to write + <a href="http://wiki.apache.org/jakarta-lucene/LuceneImplementations">versions + of Lucene in other programming + languages</a>. If these versions are to remain compatible with Jakarta Lucene, then a language-independent definition of the Lucene index format is required. This document thus attempts to provide a complete and independent definition of the Jakarta Lucene 1.4 file @@ -1307,16 +1309,32 @@ <p>FieldNum --> VInt </p> + + <p><b>Lucene <= 1.4:</b></p> <p>Bits --> Byte </p> <p>Value --> String </p> - <p>Currently - only the low-order bit is used of Bits is used. It is one for + <p>Only the low-order bit of Bits is used. It is one for tokenized fields, and zero for non-tokenized fields. </p> + <p><b>Lucene >= 1.9:</b></p> + <p>Bits --> + Byte + </p> + <p> + <ul> + <li>low order bit is one for tokenized fields</li> + <li>second bit is one for fields containing binary data</li> + <li>third bit is one for fields with compression option enabled</li> + </ul> + </p> + <p>Value --> + String (default) or byte[] when the field value is a binary field + </p> + </li> </ol> </blockquote>
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]