-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/28247/
-----------------------------------------------------------

Review request for lucene.


Repository: lucene


Description
-------

This patch is a start towards adding a concordance capability to Lucene.  It 
currently relies on converting queries to SpanQueries and then doing the 
calculations to build concordance windows with sort keys.  Once spans are nuked 
and the positions branch is ready, it should be straightforward to modify this 
to use positions.

There is plenty of room for optimization and for general cleanup.

Use of concordances dates back to the 13th century (according to Wikipedia), 
but this can still be a very useful capability for advanced analysts, linguists 
and lawyers.


Diffs
-----

  trunk/dev-tools/idea/.idea/ant.xml 1640617 
  trunk/dev-tools/idea/.idea/modules.xml 1640617 
  trunk/dev-tools/idea/.idea/workspace.xml 1640617 
  trunk/dev-tools/maven/lucene/pom.xml.template 1640617 
  trunk/lucene/build.xml 1640617 
  trunk/lucene/concordance/build.xml PRE-CREATION 
  trunk/lucene/concordance/ivy.xml PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/AbstractConcordanceWindowCollector.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSearcher.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSearcherUtil.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSortKey.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSortOrder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceSorter.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceWindow.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/ConcordanceWindowCollector.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DedupingConcordanceWindowCollector.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DefaultSortKeyBuilder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DocIdBuilder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DocMetadataExtractor.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/DocumentOrderSortKey.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/FieldBasedDocIdBuilder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/IndexIdDocIdBuilder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/SimpleDocMetadataExtractor.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/SortKeyBuilder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/WindowBuilder.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/DocTokenOffsets.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/DocTokenOffsetsIterator.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/OffsetLengthStartComparator.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/OffsetStartComparator.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/OffsetUtil.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/RandomAccessCharOffsetContainer.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/ReanalyzingTokenCharOffsetsReader.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/SimpleAnalyzerUtil.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/TargetTokenNotFoundException.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/TokenCharOffsetRequests.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/charoffsets/TokenCharOffsetsReader.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/concordance/package.html
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/queries/SpanQueryConverter.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/java/org/apache/lucene/search/spans/SimpleSpanQueryConverter.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/ConcordanceTestBase.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/ConcordanceTestUtils.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/TestConcordanceSearcher.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/TestSimpleAnalyzerUtil.java
 PRE-CREATION 
  
trunk/lucene/concordance/src/test/org/apache/lucene/search/concordance/TestSpanQueryConverter.java
 PRE-CREATION 
  trunk/lucene/module-build.xml 1640617 

Diff: https://reviews.apache.org/r/28247/diff/


Testing
-------

Some unit tests.  Versions of this code have been deployed in a prototype for 
two years. More tests welcomed.


Thanks,

Tim Allison

Reply via email to