Hi Gang,
your code seems to work correctly, at least in a few tests I have
performed. There is only one thing that I didn't like: the program
silently exits unless there is one of the options -r/-f. It should give
an error.
As Jimmy O'Regan says in his message, and even if it is not a
requirement, we do need to see if you could prepare a C++ version of the
coding challenge, as this is the language that is going to be used for
the sliding-window part-of-speech tagger. Can you do that Gang?
As to the intuitive idea, well, the sliding-window PoS tagger is
different from an HMM PoS tagger, but I would not say that it can use
"more information". In fact, if both the previous and the following word
are ambiguous, an HMM PoS tagger is actually using a wider context, as
it will not make a decision until a nonambiguous word appears.
It is just a different way of tagging. We suspect it uses more
parameters, but it can easily be turned into a finite-state transducer
after training.
We are also interested in the way you can introduce restrictions
(FORBID) in the tagger.
I look forward to hearing from you.
Best
Mikel
, Al 04/20/2013 05:37 PM, En/na Gang Chen ha escrit:
hi, Mlforcada, Fran,
I am Gang Chen. I have a great interest in the Apertium GSOC-2013
project "Sliding-window part-of-speech tagger".
After talking to Fran and other mentors these days and viewing the
wiki pages in Apertium, I think I have a better understanding of the
platform and the project background:)
I've been reading the recommended paper recently. I think the idea
is based on a strong intuitive that, a sliding-window can employ much
more information(both left and right), which helps disambiguation,
than a traditional HMM tagger. And unsupervised training allows its
even wider applications.
I've done the coding challenge for this idea, with the code here:
https://github.com/elephantgcc/gsoc-2013/blob/master/ApertiumFilter.py
I am not sure whether I got a full understanding of the coding
challenge, and looking forward to your comments:)
Thank you.
Best wishes,
Gang Chen
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
--
Mikel L. Forcada (http://www.dlsi.ua.es/~mlf/)
Departament de Llenguatges i Sistemes InformĂ tics
Universitat d'Alacant
E-03071 Alacant, Spain
Phone: +34 96 590 9776
Fax: +34 96 590 9326
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff