A user reports a bug in rank.pl. This seems to occur when dealing with
smaller files, for example...

marimba(49): more x
first<>bigram<>1 4.000 1 1
second<>bigram<>2 3.000 2 2
extra<>bigram1<>3 2.000 3 3
third<>bigram<>4 1.000 4 4

marimba(50): more y
second<>bigram<>1 4.000 2 2
extra<>bigram2<>2 3.000 4 4
first<>bigram<>3 2.000 1 1
third<>bigram<>4 1.000 3 3


New version (0.03)
marimba(51): rank.pl x y
Illegal division by zero at /usr/local/bin/rank.pl line 397.

Old version (0.01)
marimba(52): perl ./rank.pl x y
Rank correlation coefficient = 0.5000

There are also cases there rank.pl will report, falsely, that there
are no ngrams in common between the input files. Again, this seems to
occur with smaller files.

We are checking into this, and if you've observed anything similar
please do let us know!

Cordially,
Ted

Reply via email to