>>> I am interested in manually improving the spelling.ssp file. I created
my reference database by importing every pgn I could find into one base,
and then using spellcheck and delete twin games to get rid of duplicates. I
have noticed that spellcheck does not correct some East Asian names (e.g.
Le Quang Liem does not go to Le, Quang Liem) and hyphenated or names with
spaces in them (e.g. DeFirmian, Nick does not got to De Firmian, Nick).
Although I wouldn’t be able to find everything like this, I could correct
the spelling.ssp file when I run into something like this (by finding a
duplicate game not deleted because the names are different).

Yah - improving our spelling correction is on the todo list. But i'm not
sure if the spelling file is too bad. Franz has just released a new version
of it (which i patch a little for release with ScidvsPC).
https://sourceforge.net/projects/scid/files/Player%20Data/Latest%20data/

Re not adding a comma to Asian names, i found this on wikipedia
http://en.wikipedia.org/wiki/Chinese_name
  "According to the Chicago Manual of Style, Chinese names are indexed by
the family name with *no inversion and no comma*"
I am not familiar if (for eg) Vietnamese names follow a similiar convention.

Do you/anyone have any other issues with spelling.ssp ?

We *do* need a list of improvements to be done to improve our spell checker
(I probably would leave chinese names without a comma). Things that come to
mind are names with a comma but no space, and name capitalisation.

> I think I have discovered what seems to be a more important issue. It
seems as though ScidvsMac is not able to correct more than 2000 names at a
time. So, when I run spellcheck player names on my reference database, it
detects 260,000 corrections, but when I hit Make corrections it only
corrects 2000 times.

Yes - this need investigating/addressing. I was not aware of it.

> The spellcheck feature works beautifully, except that it doesn't remove
(wh) and (bl) at the end of the player name, which could be fixed easily by
adding a %Suffix " (wh)" "" line.

Sounds reasonable, though i have never seen these used myself.

Steven
------------------------------------------------------------------------------
_______________________________________________
Scidvspc-users mailing list
Scidvspc-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scidvspc-users

Reply via email to