[ 
https://issues.apache.org/jira/browse/LUCENE-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steven Rowe updated LUCENE-2084:
--------------------------------

    Attachment: collation.benchmark.tar.bz2

Atached collation.benchmark.tar.bz2, which contains stuff to run an 
analysis-only contrib benchmark for the (ICU)CollationKeyAnalyzers over 4 
languages: English, French, German, and Ukrainian.

Included are:

# For each language, a line-doc containing the most frequent 100K words from a 
corresponding Wikipedia dump from November 2009;
# For each language, Java code for a no-argument analyzer callable from a 
benchmark alg, that specializes (ICU)CollationKeyAnalyzer and uses 
PerFieldAnalyzerWrapper to only run it over the line-doc body field
# A script to compile and jarify the above analyzers
# A benchmark alg running 5 iterations of 10 repetitions of analysis only over 
the line-doc for each language
# A script to find the minimum elapsed time for each combination, and output 
the results as a JIRA table
# A script to run the previous two scripts once for each of three JDK versions
# A script to compare the output of the above script before and after applying 
the attached patch removing Char/ByteBuffer wrapping, and output the result as 
a JIRA table


> remove Byte/CharBuffer wrapping for collation key generation
> ------------------------------------------------------------
>
>                 Key: LUCENE-2084
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2084
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1
>
>         Attachments: collation.benchmark.tar.bz2, LUCENE-2084.patch, 
> LUCENE-2084.patch
>
>
> We can remove the overhead of ByteBuffer and CharBuffer wrapping in 
> CollationKeyFilter and ICUCollationKeyFilter.
> this patch moves the logic in IndexableBinaryStringTools into char[],int,int 
> and byte[],int,int based methods, with the previous Byte/CharBuffer methods 
> delegating to these.
> Previously, the Byte/CharBuffer methods required a backing array anyway.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to