Hi Everyone, I have released version 1.7.9 of the MMApps Lucenemodule. This release has new features as well as some improvements and bug fixes.
New Features: 1) Extractor plugins Lucenemodule's dependencies on pdf and word content extraction are now handled by Extractors. This makes the module less depending on some external jars. Extractors have to implement an interface and have to be declared in the definition xml file: <extractor mimetype="application/ms-excel" extractorClass="net.sf.mmapps.modules.lucenesearch.extraction.impl.POIExcelExtractor" /> Extractors are executed by their mimetype by the Lucenemodule. You can register the same extractor with different mimetypes if you wish. The previously used dependencies for word and pdf are now included automatically as the default and can be overruled by defining a extractor with the same mimetype. 2) Support for Excel and RTF These formats are now supported as Extractors. 3) StandardCleaningAnalyzers This Analyzer which is available for Dutch and English are useful in most cases where you want to search and index word with special characters. Words like 'één' can then be found in their original form 'één' and base form 'een'. Improvements: 1) Cache cleanup MMBase cache is now cleaned when indexing of nodes takes place to keep the memory usage low, the behaviour of the 1.8 HugeNodeListIterator was simulated here. 2) SearchTag custom queries The custom queries in the search tag are not supported anymore, the taglibs match tag should be used. 3) re-use-index This option in the module xml file will keep the index in tact and only does updates on the index on a restart of MMBase. Use this option in combination with a large interval time to keep the module from re-indexing. 4) Usage of Lucene 1.9 The new version of Lucene is used and the module now uses the new Lucene 2.0 api. Bugfixes: Various bugs were fixed with the help of Alban Hertroys and others. grtz Wouter _______________________________________________ Developers mailing list [email protected] http://lists.mmbase.org/mailman/listinfo/developers
