Gang,
great stuff; I haven't checked it exhaustively but as far as I am
testing it seems to behave as expected.
Now it is time to move on to preparing your application. For that, you
will have to study the current .tsx format and make sense of it, as your
tagger will use exactly that format.
Forbid rules can be applied to the input text before actually training
or running the tagger. You will also need to find a good way to store
probabilities or turn them into rules which can be read and perhaps
edited using linguistic knowledge.
Please do not hesitate to ask any questions to me or to the list.
Best,
Mikel
Al 04/21/2013 05:36 PM, En/na Gang Chen ha escrit:
Hi, Mikel,
Sorry for the inconvenience. I found there was a little include bug in
that version. No wonder you couldn't compile it.
I have fixed the bug, implemented the second task *roundtrip
converter*, added a Makefile and a detailed README to the project, so
it would be easier to test the code:)
The code files are here:
https://github.com/elephantgcc/CodingChallenge
Best,
Gang
2013/4/21 Mikel Forcada <[email protected] <mailto:[email protected]>>
Gang,
could you give a Makefile for it? I cannot just simply compile it
with a g++ command — I'd need to know the libraries, and I don't
want to work that hard ;-)
It would be really really nice if you could provide also code for
a roundtrip converter reading the output of this one and
regenerating the input, as this is something you would have to
deal with anyway when you write the tagger.
All the best
Mikel
--
Mikel L. Forcada (http://www.dlsi.ua.es/~mlf/ <http://www.dlsi.ua.es/%7Emlf/>)
Departament de Llenguatges i Sistemes Informàtics
Universitat d'Alacant
E-03071 Alacant, Spain
Phone:+34 96 590 9776 <tel:%2B34%2096%20590%209776>
Fax:+34 96 590 9326 <tel:%2B34%2096%20590%209326>
a
--
Mikel L. Forcada (http://www.dlsi.ua.es/~mlf/)
Departament de Llenguatges i Sistemes Informàtics
Universitat d'Alacant
E-03071 Alacant, Spain
Phone: +34 96 590 9776
Fax: +34 96 590 9326
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff