[ngram] Re: Pb with tokenisation in nsp

2011-01-13 Thread phamhieniol
Hi all, I tried to put the use local; into the .pl file then run with Vietnamese text but the result as follow: cp12 94 40 ,c11 48 94 nng10 62 92 p,10 40 48 vng10 46 92 ngn9 92 62 tng9 76 92 ngc9 92 94 tc9 76 94 ,t7 48 76 nht7 48 76 Bc7 24 94 cv6 94 46 nhng6 48 92 khng6 20 92 cm6

[ngram] Re: Extending huge-count to 3 grams.

2011-01-13 Thread phamhieniol
Hi Cyrus, Can you post the script? I want to give it a try. Best, Hien --- In ngram@yahoogroups.com, Cyrus Shaoul cyrus.shaoul@... wrote: Well, since nobody replied, I just made a trigram counter based on huge-count.pl. It is running now, and seems to work well. If anyone can help me

Re: [ngram] Re: Extending huge-count to 3 grams.

2011-01-13 Thread Ted Pedersen
Hi Hien, I'm happy to report these were included in Text::NSP in the following directory: http://cpansearch.perl.org/src/TPEDERSE/Text-NSP-1.21/bin/utils/contributed/ Please feel free to share any comments or observations you might have. Enjoy! Ted On Wed, Jan 12, 2011 at 10:47 PM,