Re: Documentation Brainstorming

2007-05-25 Thread Bob Carpenter
just a String and deprecate the one with Reader? Or at least provide an alternative for the usual case of not having a reader. Maybe I'm just missing something here, but I don't think it's scaling to streaming input that'd overflow memory. - Bob Carpenter Alias-i - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Refactored FuzzyTermEnum

2006-06-13 Thread Bob Carpenter
happy with the Apache license, so that shouldn't be a problem. I also don't know whether you use tabs or spaces -- I untabified the final version and used your two-space format in emacs. - Bob Carpenter package org.apache.lucene.search; /** * Copyright 2004 The Apache Software Founda

Re: Edit-distance strategy (slicing and one vs. all algorithms)

2006-06-09 Thread Bob Carpenter
gamma-coding, specifically) as reverse-indexes for search engines, and yields a very tight memory representation with good merging properties. I've basically adopted Lucene's strategy of on-disk representations writing out to a mergeable streaming format that can scale on disk (o

Re: Edit-distance strategy (slicing and one vs. all algorithms)

2006-06-08 Thread Bob Carpenter
tching against a dictionary. A bonus is that you get to learn about really cool biology sequencing applications and the *sub-linear* matching possible with exclusion techniques (mind-bogglingly clever algorithms). - Bob Carpenterhttp://www.colloquial.com/carp Alias-i http://