Hi, Fran.

Thanks for replying!

>> Among the ideas listed on the idea page, I am mostly attracted by:
>> 1."Corpus-based lexicalised feature transfer"
>> 2."Sliding-window part-of-speech tagger"

> Cool, have you looked into doing the coding challenges for either of
those ?

I have looked into the the coding challenge for both of them.
Considering the time limited, I would prefer the 2-nd idea, "Sliding-window
part-of-speech tagger", for it is relatively more independent. The coding
challenge for this idea is to write a filter dealing with the formats.
I will try that [1] and come back later.
Btw, it seems the filter can be done with "sed", or a "apertium-filter" C++
binary is expected ?

[1]
http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Sliding-window_part-of-speech_tagger




>> Also I had Apertium installed, but there
>> seeems to be
>> a PCRE problem as you guys discussed about these days.

> Did you manage to fix it ?

Yes, the advice from Unhammer woks! The PCRE *configure* should take
options "--enable-utf  --enable-unicode-properties".

echo "hello" | apertium -d /usr/local/share/apertium/ en-es
hola

haha:)
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to