Allen wrote:

> My current file is over 3 million words and I plan on running at
> least 10 million words overall in trying to determine the entropy

If your intent is to analyze data of that size, then write a PERL or
Python script to extract and analyze the data.

Unless that PERL/Python script is the macro. Even then, I think stand
alone would be more useful.

> usage by Mark Twain over his life, a minor tweak would make it easy to 
> compile of list of words used in each of his books and

The more interesting statistic is the change in frequency of letter
pairs and letter triplets.

xan

jonathon

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to