Hi, Fran.
Thanks for replying!
>> Among the ideas listed on the idea page, I am mostly attracted by:
>> 1."Corpus-based lexicalised feature transfer"
>> 2."Sliding-window part-of-speech tagger"
> Cool, have you looked into doing the coding challenges for either of
those ?
I have looked into the the coding challenge for both of them.
Considering the time limited, I would prefer the 2-nd idea, "Sliding-window
part-of-speech tagger", for it is relatively more independent. The coding
challenge for this idea is to write a filter dealing with the formats.
I will try that [1] and come back later.
Btw, it seems the filter can be done with "sed", or a "apertium-filter" C++
binary is expected ?
[1]
http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Sliding-window_part-of-speech_tagger
>> Also I had Apertium installed, but there
>> seeems to be
>> a PCRE problem as you guys discussed about these days.
> Did you manage to fix it ?
Yes, the advice from Unhammer woks! The PCRE *configure* should take
options "--enable-utf --enable-unicode-properties".
echo "hello" | apertium -d /usr/local/share/apertium/ en-es
hola
haha:)
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff