ash wrote:
Tim Churches wrote:


The license on it allows re-distribution, I think, so it would be possible to make a much smaller, trimmed-of-fat subset for use as a word processor supplemental dictionary available from a more local server. A nice collaborative project - happy to write a few Python scripts to do the trimming if necessary if someone else works out what sort of files are needed. Target supplementary dictionary formats ought to be OpenOffice and MS-Word - no idea what form for files these can import as spelling dictionaries. Plain text files, I expect.

generally if you look at the *.dic files they render in a text editor like this :

Abidjan
Abigail/M
Abilene/M
ability/EISM
Abingdon
abjection/MS
abjectness
abject/Y
abjuration/M
abjure/RGyNSDn
ablate/GSD
ablation

so my guess is a simple text list with carriage returns could be cut & pasted into a document, and then "save to custom dictionary"
a macro to save the 10,000 mouse clicks required would be an asset though

ash

--------------

Such an endeavour would also be valuable as a resource for those de-identifying clinical casenote information. In that the above list could act as an exclusion list to ensure that common medical terms are not removed from a casenote record *if* similar terms are used for naming a person e.g. Abigail from above.

Andre.

_______________________________________________
Gpcg_talk mailing list
[email protected]
http://ozdocit.org/cgi-bin/mailman/listinfo/gpcg_talk

Reply via email to