On Tue, Feb 05, 2002 at 08:38:28AM +0000, Nick Ing-Simmons wrote:
> Dan Kogai <[EMAIL PROTECTED]> writes:

> Perhaps we make "Build CJK encodings?" a Configure question?
> We could determine default based on locale, or (as I once
> did for a UK/USA paper size choice) by TZ.

> >107853 bytes (112%) saved spotting duplicates
> 
> Probably worth keeping.
> 
> >22801 bytes (23.6%) saved using substrings
> 
> That is where the time goes - there is a loop which uses index()
> on all existing strings to see if it can re-use one.
> It saves 22K but is that worth while?

Then surely this extra searching becomes the configure question?

  Try harder to compress CJK encodings (this will slow your build considerably)?
  [no]


Unless we find a more efficient algorithm to search for common substrings.

Nicholas Clark
-- 
EMCFT http://www.ccl4.org/~nick/CV.html

Reply via email to