On Tue, Feb 05, 2002 at 08:38:28AM +0000, Nick Ing-Simmons wrote: > Dan Kogai <[EMAIL PROTECTED]> writes:
> Perhaps we make "Build CJK encodings?" a Configure question? > We could determine default based on locale, or (as I once > did for a UK/USA paper size choice) by TZ. > >107853 bytes (112%) saved spotting duplicates > > Probably worth keeping. > > >22801 bytes (23.6%) saved using substrings > > That is where the time goes - there is a loop which uses index() > on all existing strings to see if it can re-use one. > It saves 22K but is that worth while? Then surely this extra searching becomes the configure question? Try harder to compress CJK encodings (this will slow your build considerably)? [no] Unless we find a more efficient algorithm to search for common substrings. Nicholas Clark -- EMCFT http://www.ccl4.org/~nick/CV.html