Thanks Ian, nice improvement! Could you open a lucene JIRA issue and provide a patch? (that helps with other things such as IP tracking)
-Yonik On 4/29/07, Ian Boston <[EMAIL PROTECTED]> wrote:
Hi, We've been using this filter in a project and found it a bit slow, so we've re-written it. In tests we did on a 5M string with no accents, we found the version in trunk to take ~200ms, this version takes about ~12ms. If there are accents, then its not quite as good at about 60ms. (MacBookPro) Code is at https://saffron.caret.cam.ac.uk/svn/projects/darwincorresp/trunk/darwin-analyzer/src/main/java/uk/ac/cam/caret/darwin/lucene/ISOLatin1AccentFilter.java BTW, just replacing StringBuffer with StringBuilder got down to about 80ms for both cases. Ian
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
