WRT regular expressions I am not sure it does any practical difference for
the pairs in question (nordic languages) whether the new Java7 unicode
stuff is available or not ?
If it indeed does a difference then, well, java is GPL so we can copy code
as we please :-)
2012/8/11 Per Tunedal <[email protected]>
> Hi,
> Yes, the update works OK. I didn't believe my eyes when the da-sv
> alternative had disappeared!
>
Yes.
da-sv doesent work:
Dansk til svensk virker ikke
^Dansk/Dansk<adj><pst><un><sg><ind>/Dansk<n><nt><sg><ind><nom>$
^til/til<adv>/til<pr>/til<cnjadv>$
^svensk/svensk<adj><pst><un><sg><ind>/svensk<n><nt><sg><ind><nom>$
^virker/virke<vblex><pres><actv>$ ^ikke/ikke<adv>$
^Dansk<n><nt><sg><ind><nom>$ ^til<pr>$ ^svensk<n><nt><sg><ind><nom>$
^virke<vblex><pres><actv>$ ^ikke<adv>$
^Danska<n><ut><sg><ind><nom>$ ^till<pr>$ ^svenska<n><ut><sg><ind><nom>$
^@virke<vblex><pres><actv>$ ^icke<adv>/inte<adv>$
Danska till svenska \@virke #icke
but sv-da does:
Svensk till danska fungerar bra
^Svensk/Svensk<adj><pst><ut><sg><ind>$ ^till/till<cnjadv>/till<pr>$
^danska/dansk<adj><pst><un><pl><ind>/dansk<adj><pst><un><sp><def>/danska<n><ut><sg><ind><nom>$
^fungerar/fungera<vblex><pres><actv>$ ^bra/bra<adv>$
^Svensk<adj><pst><ut><sg><ind>$ ^till<pr>$ ^danska<n><ut><sg><ind><nom>$
^fungera<vblex><pres><actv>$ ^bra<adv>$
^Svensk<adj><pst><un><sg><ind>$ ^til<pr>$ ^dansk<n><nt><sg><ind><nom>$
^fungere<vblex><pres><actv>$ ^godt<adv>$
Svensk til dansk fungerer godt
>
> Well, anyhow I might have some use for it, when testing similarities
> between the Scandinavian languages. I guess I can use that translation
> direction some other way?
>
That being said, even if a language pair doesent work satisfactory for a
release, that doesent mean that people won't find it usefull to play around
with.
If would be kinda nice to have da->sv available but with a proper warning
that its unreleased (perhaps show a beta sign besides it). Users would need
to enable to see unreleased pairs in the settings before they appeared.
And with Mikels new support for external programs, I feel that we need to
rethink the language pair list configuration file (/builds/language-pairs)
a little.
I think where should be a column with keywords, where 'unreleased' could
imply that its an unreleased direction (like da->sv) and 'gc' could imply
that it depends on that CG is available. We could add keywords to this
column if we encounter new requirements, like HFST.
Clients should discard lines with keywords they doesent support.
Note that this would require some lines duplicated. Like after
apertium-sv-da
https://apertium.svn.sourceforge.net/svnroot/apertium/builds/apertium-sv-da/apertium-sv-da.jar
file:apertium-sv-da-0.5.0.tar.gz sv-da
we would have a extra line with
apertium-sv-da
https://apertium.svn.sourceforge.net/svnroot/apertium/builds/apertium-sv-da/apertium-sv-da.jar
file:apertium-sv-da-0.5.0.tar.gz
*da-sv unreleased*
Mikel, this is just an idea. Implement as you please :-)
--
Jacob Nordfalk <http://profiles.google.com/jacob.nordfalk>
javabog.dk
Androidudvikler og -underviser på
IHK<http://cv.ihk.dk/diplomuddannelser/itd/vf/MAU>og
Lund&Bendsen <https://www.lundogbendsen.dk/undervisning/beskrivelse/LB1809/>
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff