Those \/ are escaped "/". That is, "d\/v" stands for "d/v" with no suffix
rules, "da/10" stands for "da" with suffix rule number 10.
As for the space, unmunch-wise you can simply ignore anything beyond the
first space/tab of a line. "po:" stands for part of speech (categoría
gramatical), and there are other keys, but they are just supposed to help
Hunspell with suggestions (and for LightProof, the grammar checker by
Hunspell’s author), but they have no effect whatsoever in the form of the
word. So, for unmunching purposes, you can imagine that "da/10
po:contracción" is just "da/10".
2014-11-15 16:32 GMT+01:00 Marco A.G.Pinto <marcoagpi...@mail.telepac.pt>:
> Ruud,
>
> I have downloaded Dutch from Mozilla and the continuation flags are very
> complex and hard to understand.
>
> I have also downloaded Galician from Mozilla and it seems a bit easier...
>
> But, in Galician, what is a:
> *d\/v* ; it uses \ and /
> also, I haven't seen in ASCII yet, but the space separating the words and
> the tags are chr 32 or different?:
> *da/10 po:contracción*
> In Portuguese they used a TAB chr or such... if it is not a space and I
> change to a tab chr, will it still work?
>
> :-P
>
> Thanks for your time!
>
> Kind regards,
> >Marco A.G.Pinto
> ----------------------
>
>
>
> On 15/11/2014 14:14, R.J. Baars wrote:
>
> In the AFFIX file, default flag is just 1 char.
> When the clause FLAG num is in the file, the flags are numbers in the
> 2-byte range, from 1 upt to 65535, separated by a comma (1,2,3,4,555)
>
> When the clause FLAG long is in the file, the flags are two chars long,
> which also translates into 2 bytes internatlly (just plains ascii chars
> allowed), but there is no separation.
>
> Flags might be like
> Word/AaBbCcDD , actually the flags Aa and Bb etc.
>
> So I guess it might be best to use a double byte internally, translating
> all flags to that.
>
> Then there are multiple compounding methods.
>
> One using COMPOUNDRULE (mostly for numbers etc, very systematic patterns)
>
> COMPOUNDRULE 39
> COMPOUNDRULE (N1)(n2) # eenen+zestig[ste]
> COMPOUNDRULE (G1)*(Le) # 1e - 9999e
> ...
>
> # general compounding, for normal compounding mechanisms. Beware: prefixes
> only apply to 'first' and suffices to 'last'
> COMPOUNDBEGIN Ca
> COMPOUNDMIDDLE Cb
> COMPOUNDEND Cc
> COMPOUNDPERMITFLAG Cp
> ONLYINCOMPOUND Cx
>
>
> The other mechanism is using continuations flags:
> SFX CA Y 2
> SFX CA 0 /CaCp
> SFX CA 0 -/CaCp
>
> (Flag CA creates the option to add words with flag Ca, eiterh wth a - in
> front of it, or not)
>
> You could reduce some ot the complexity by ignoring all 'filters, like
> 'checkcompoundpattern, onlyincompound etc', because after generating the
> huge outcom, you could apply Hunspell with the same dic and affixe with
> the command line option -G te get the correct words from the entire list.
>
> The hunspell used had better be > 1.3 then, since 1.2* has a bug,
> suggestion mistaken compound words.
>
> Does this help?
>
> Ruud
>
>
>
>
>
> Hello,
>
> I didn't go to work today because I was not feeling well.
>
> I have decided to dedicate some time today to improve PTG in order for
> it also to unmunch .DICs with numbers instead of characters.
>
> Daniel and Ruud, could you explain to me in detail how to detect if the
> .AFF deals with chrs or numbers?
>
> Also, could you provide the troubling dictionary for me to analyse and
> test?
>
> I have been swamped with work and only in January I will be on vacation,
> but I will try to do my best!
>
> Thanks!
>
> Kind regards from your friend,
> >Marco A.G.Pinto
> ----------------------
>
>
> --
> ------------------------------------------------------------------------------
> Comprehensive Server Monitoring with Site24x7.
> Monitor 10 servers for $9/Month.
> Get alerted through email, SMS, voice calls or mobile push notifications.
> Take corrective actions from your mobile
> device.http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk_______________________________________________
> Languagetool-devel mailing
> listLanguagetool-devel@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
>
>
> ------------------------------------------------------------------------------
> Comprehensive Server Monitoring with Site24x7.
> Monitor 10 servers for $9/Month.
> Get alerted through email, SMS, voice calls or mobile push notifications.
> Take corrective actions from your mobile
> device.http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
> _______________________________________________
> Languagetool-devel mailing
> listLanguagetool-devel@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
>
>
> --
>
>
> ------------------------------------------------------------------------------
> Comprehensive Server Monitoring with Site24x7.
> Monitor 10 servers for $9/Month.
> Get alerted through email, SMS, voice calls or mobile push notifications.
> Take corrective actions from your mobile device.
>
> http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel
>
>
------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel