Hi Gang,
your code seems to work correctly, at least in a few tests I have performed. There is only one thing that I didn't like: the program silently exits unless there is one of the options -r/-f. It should give an error.

As Jimmy O'Regan says in his message, and even if it is not a requirement, we do need to see if you could prepare a C++ version of the coding challenge, as this is the language that is going to be used for the sliding-window part-of-speech tagger. Can you do that Gang?

As to the intuitive idea, well, the sliding-window PoS tagger is different from an HMM PoS tagger, but I would not say that it can use "more information". In fact, if both the previous and the following word are ambiguous, an HMM PoS tagger is actually using a wider context, as it will not make a decision until a nonambiguous word appears.

It is just a different way of tagging. We suspect it uses more parameters, but it can easily be turned into a finite-state transducer after training.

We are also interested in the way you can introduce restrictions (FORBID) in the tagger.

I look forward to hearing from you.

Best

Mikel


,  Al 04/20/2013 05:37 PM, En/na Gang Chen ha escrit:
hi, Mlforcada, Fran,

I am Gang Chen. I have a great interest in the Apertium GSOC-2013 project "Sliding-window part-of-speech tagger". After talking to Fran and other mentors these days and viewing the wiki pages in Apertium, I think I have a better understanding of the platform and the project background:)

I've been reading the recommended paper recently. I think the idea is based on a strong intuitive that, a sliding-window can employ much more information(both left and right), which helps disambiguation, than a traditional HMM tagger. And unsupervised training allows its even wider applications.

    I've done the coding challenge for this idea, with the code here:
https://github.com/elephantgcc/gsoc-2013/blob/master/ApertiumFilter.py
I am not sure whether I got a full understanding of the coding challenge, and looking forward to your comments:)
    Thank you.


Best wishes,
Gang Chen


------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter


_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff


--
Mikel L. Forcada (http://www.dlsi.ua.es/~mlf/)
Departament de Llenguatges i Sistemes InformĂ tics
Universitat d'Alacant
E-03071 Alacant, Spain
Phone: +34 96 590 9776
Fax: +34 96 590 9326

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to