Ruud,

I have downloaded Dutch from Mozilla and the continuation flags are very complex and hard to understand.

I have also downloaded Galician from Mozilla and it seems a bit easier...

But, in Galician, what is a:
*d\/v* ; it uses \ and /
also, I haven't seen in ASCII yet, but the space separating the words and the tags are chr 32 or different?:
*da/10 po:contracción*
In Portuguese they used a TAB chr or such... if it is not a space and I change to a tab chr, will it still work?

:-P

Thanks for your time!

Kind regards,
      >Marco A.G.Pinto
        ----------------------


On 15/11/2014 14:14, R.J. Baars wrote:
In the AFFIX file, default flag is just 1 char.
When the clause FLAG num is in the file, the flags are numbers in the
2-byte range, from 1 upt to 65535, separated by a comma (1,2,3,4,555)

When the clause FLAG long is in the file, the flags are two chars long,
which also translates into 2 bytes internatlly (just plains ascii chars
allowed), but there is no separation.

Flags might be like
Word/AaBbCcDD , actually the flags Aa and Bb etc.

So I guess it might be best to use a double byte internally, translating
all flags to that.

Then there are multiple compounding methods.

One using COMPOUNDRULE (mostly for numbers etc, very systematic patterns)

COMPOUNDRULE 39
COMPOUNDRULE (N1)(n2)           # eenen+zestig[ste]
COMPOUNDRULE (G1)*(Le)          # 1e - 9999e
...

# general compounding, for normal compounding mechanisms. Beware: prefixes
only apply to 'first' and suffices to 'last'
COMPOUNDBEGIN Ca
COMPOUNDMIDDLE Cb
COMPOUNDEND Cc
COMPOUNDPERMITFLAG Cp
ONLYINCOMPOUND Cx


The other mechanism is using continuations flags:
SFX CA Y 2
SFX CA 0 /CaCp
SFX CA 0 -/CaCp

(Flag CA creates the option to add words with flag Ca, eiterh wth a - in
front of it, or not)

You could reduce some ot the complexity by ignoring all 'filters, like
'checkcompoundpattern, onlyincompound etc', because after generating the
huge outcom, you could apply Hunspell with the same dic and affixe with
the command line option -G te get the correct words from the entire list.

The hunspell used had better be > 1.3 then, since 1.2* has a bug,
suggestion mistaken compound words.

Does this help?

Ruud




Hello,

I didn't go to work today because I was not feeling well.

I have decided to dedicate some time today to improve PTG in order for
it also to unmunch .DICs with numbers instead of characters.

Daniel and Ruud, could you explain to me in detail how to detect if the
.AFF deals with chrs or numbers?

Also, could you provide the troubling dictionary for me to analyse and
test?

I have been swamped with work and only in January I will be on vacation,
but I will try to do my best!

Thanks!

Kind regards from your friend,
          >Marco A.G.Pinto
            ----------------------


--
------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel



------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel



--
------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to