El dt 18 de 03 de 2014 a les 02:24 -0300, en/na Roque Lopez va escriure:
> Hello Apertium team,
> 
> First, congratulations for the big work in the Apertium project. I
> hope to be part of your team in this summer.
> 
> I am Roque López, a master student in Computer Science at São Paulo
> University-Brazil (USP). I have strong interest in Natural Language
> Processing, for this reason I am doing my master about Opinion
> Summarization at NILC (Núcleo Interinstitucional de Linguística
> Computacional http://www.nilc.icmc.usp.br/nilc/index.php)
> 
> Of the many ideas listed on the GSoC page, I am very interested in
> "Improving support for non-standard text input", because, it has a
> direct relation with my research topic.
> 
> I have already talked to Francis by the IRC channel and thanks to his
> advice, I have finished the Coding Challenge, which is in my github
> repo:
> https://github.com/rlopezc27/Coding_challenge_Apertium2014
> 
> In the next few days (today or tomorrow) I will be sending my proposal
> and I would appreciate any suggestions.

$ echo "@laaura_95 mírala a eeeella, toda curioooosa! Jajajja así da
gusto volver! " | apertium es-en
@*laaura_95 look it to *eeeella, all *curioooosa! *Jajajja Like this
gives taste go back! 

I had to do some changes to make it work properly: 

    for line in sys.stdin.readlines():
       for word in line.split(' '):
           original_word = word
           candidates = create_candidates(original_word)
           print(output_format(original_word, candidates))   

The selector program makes a bit of a mess from the input of the
generator:

$ echo "@laaura_95 mírala a eeeella, toda curioooosa! Jajajja así da
gusto volver! " | python3 candidates_generator.py | python3
candidates_selector.py ^@laaura_95/['@laaura_95']$
^mírala/['mírala']$
^a/['a']$
^eeeella,/['eella,', 'eela,', 'ella,', 'ela,']$
^toda/['toda']$
^curioooosa!/['curioosa!', 'curiosa!']$
^Jajajja/['jajajja', 'jajaja']$
^así/['así']$
^da/['da']$
^gusto/['gusto']$
^volver!/['volver!']$
^
/['\n']$

It needs to read in the input of the original, expected output is:

@laaura_95 mírala a ella, toda curiosa! Jajaja así da gusto volver! 

Fran


------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/13534_NeoTech
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to