On Mon, Mar 10, 2014 at 4:59 PM, Alex Aruj <[email protected]> wrote:
> Hi again,
>
...
>
> What are the hashes and global hash refs? Are these the "parameters" for
> character replacement? I see 5 listed at line 287 of "sf.pl", but are there
> others like the two word lists given for training a language pack?

Yes. those hashes contain all the parameters you need for diacritic
restoration.  In fact, the file tr-probs.txt in the charlifter-tr
package is simply a dump of those five hashes as a plain text file.
Between the awesome detailed comments I included for each of those
hashes in the sf.pl source, and examining the tr-probs.txt in a text
editor, it should be easy to figure out what's going on.

The word lists that are read in at training time are effectively
stored in the tableref hash, so we don't need separate data structures
for them.

kps

------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/13534_NeoTech
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to