[
https://issues.apache.org/jira/browse/LUCENE-5969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir updated LUCENE-5969:
--------------------------------
Attachment: LUCENE-5969.patch
Here is a difference between trunk and branch as a patch.
* fixed .si to be immutable again, so copySegmentAsIs doesn't rewrite it.
Instead we strip segment prefixes just like .CFS (LUCENE-5412)
* disabled write-access to all old .SI writers, its no longer needed.
* fixed segment/commit unique ID generation (bugs with sign bits). Also changed
this to be a byte[] so it can be efficiently encoded.
* Add CodecUtil.write/checkSegmentHeader, which is a regular header, plus the
ID of the segment. This gives us mismatched files detection.
* beefed up assertingcodec more, with assertinglivedocs
* add lots of safety to .si/.fnm/.del
* moved out cruft to backwards-codecs.
I think this is a good point to merge, and then i will continue on with the
other parts of the index.
> Add Lucene50Codec
> -----------------
>
> Key: LUCENE-5969
> URL: https://issues.apache.org/jira/browse/LUCENE-5969
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Michael McCandless
> Fix For: 5.0, 6.0
>
> Attachments: LUCENE-5969.patch, LUCENE-5969.patch
>
>
> Spinoff from LUCENE-5952:
> * Fix .si to write Version as 3 ints, not a String that requires parsing at
> read time.
> * Lucene42TermVectorsFormat should not use the same codecName as
> Lucene41StoredFieldsFormat
> It would also be nice if we had a "bumpCodecVersion" script so rolling a new
> codec is not so daunting.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]