Some more possible bullets (maybe a little TOO verbose): * Numerous performance improvements: faster exact PhraseQuery; natural segment merging favors segments with deletions; primary key lookup is faster; IndexWriter.addIndexes(Directory[]) uses file copy instead of merging; BufferedIndexInput does fewer bounds checks; compound file is dynamically turned off for large segments; fully deleted segments are dropped on commit; faster snowball analyzers (in contrib); ConcurrentMergeScheduler is more careful about setting priority of merge threads.
* IndexWriter is now configured with a new separate builder API (IndexWriterConfig). * Improved Unicode support: Unicode 4 is supported in core analyzers; StandardAnalyzer now implements the UAX #29 Unicode Text Segmentation standard from Unicode 6.0.0. * IndexWriter.getReader is replaced by IndexReader.open(IndexWriter). In addition you can now specify whether deletes should be resolved when you open an NRT reader. * PayloadProcessorProvider enables your app to customize how payloads are merged. * MultiSearcher is deprecated; ParallelMultiSearcher has been absorbed directly into IndexSearcher * CharTermAttribute replaces TermAttribute (nocommit what else to say?) * On 64bit Windows and Solaris JVMs, MMapDirectory is now the default implementation (returned by FSDirectory.open). MMapDirectory also enables unmapping if the JVM supports it. * New TotalHitCountCollector just counts total number of hits * ReaderFinishedListener API enables external caches to evict entries once a segment is finished * Totally reorganized contrib (nocommit what to say?) Mike http://blog.mikemccandless.com On Thu, Mar 24, 2011 at 4:17 PM, Yonik Seeley <[email protected]> wrote: > I took a look through the "New features" section of CHANGES and > took a shot at what we could put on the lucene website: > > March 2011, Lucene 3.1 available > This release contains numerous bug fixes, optimizations, and > improvements , including: > - Improved Unicode support, including Unicode 4 > - ReusableAnalyzerBase make it easier to reuse TokenStreams correctly > - Protedted words in stemming via KeywordAttribute > - ConstantScoreQuery now allows directly wrapping a Query > - Support for custom ExecutorService in ParallelMultiSearcher > - IndexWriterConfig.setMaxThreadStates for controls of IndexWriter threads > > It's a bit arbitrary, but a starting point at least. Thoughts? > > -Yonik > http://www.lucenerevolution.org -- Lucene/Solr User Conference, May > 25-26, San Francisco > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
