----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/28247/ -----------------------------------------------------------
Review request for lucene. Repository: lucene Description ------- This patch is a start towards adding a concordance capability to Lucene. It currently relies on converting queries to SpanQueries and then doing the calculations to build concordance windows with sort keys. Once spans are nuked and the positions branch is ready, it should be straightforward to modify this to use positions. There is plenty of room for optimization and for general cleanup. Use of concordances dates back to the 13th century (according to Wikipedia), but this can still be a very useful capability for advanced analysts, linguists and lawyers. Diffs ----- trunk/dev-tools/idea/.idea/ant.xml 1640617 trunk/dev-tools/idea/.idea/modules.xml 1640617 trunk/dev-tools/idea/.idea/workspace.xml 1640617 trunk/dev-tools/maven/lucene/pom.xml.template 1640617 trunk/lucene/build.xml 1640617 trunk/lucene/concordance/build.xml PRE-CREATION trunk/lucene/concordance/ivy.xml PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/AbstractConcordanceWindowCollector.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSearcher.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSearcherUtil.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSortKey.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSortOrder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSorter.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceWindow.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceWindowCollector.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DedupingConcordanceWindowCollector.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DefaultSortKeyBuilder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DocIdBuilder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DocMetadataExtractor.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DocumentOrderSortKey.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/FieldBasedDocIdBuilder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/IndexIdDocIdBuilder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/SimpleDocMetadataExtractor.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/SortKeyBuilder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/WindowBuilder.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/DocTokenOffsets.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/DocTokenOffsetsIterator.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/OffsetLengthStartComparator.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/OffsetStartComparator.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/OffsetUtil.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/RandomAccessCharOffsetContainer.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/ReanalyzingTokenCharOffsetsReader.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/SimpleAnalyzerUtil.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/TargetTokenNotFoundException.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/TokenCharOffsetRequests.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/TokenCharOffsetsReader.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/package.html PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/queries/SpanQueryConverter.java PRE-CREATION trunk/lucene/concordance/src/java/org/apache/lucene/search/spans/SimpleSpanQueryConverter.java PRE-CREATION trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/ConcordanceTestBase.java PRE-CREATION trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/ConcordanceTestUtils.java PRE-CREATION trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/TestConcordanceSearcher.java PRE-CREATION trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/TestSimpleAnalyzerUtil.java PRE-CREATION trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/TestSpanQueryConverter.java PRE-CREATION trunk/lucene/module-build.xml 1640617 Diff: https://reviews.apache.org/r/28247/diff/ Testing ------- Some unit tests. Versions of this code have been deployed in a prototype for two years. More tests welcomed. Thanks, Tim Allison
