[R] Help retrieving only Portuguese words from a file

2013-05-28 Thread Júlia Zara
Hello everyone, I´ve been using R in my doctoral reserach and would really appreciate it if any of you could help me find a solution for a problem. I have a file with texts in 3 languages - Portuguese, Spanish, and English - but I am interested only in the Portuguese words. Is there a function I

Re: [R] Help retrieving only Portuguese words from a file

2013-05-28 Thread Rui Barradas
Hello, I'm portuguese and I don't know of any such function. Maybe a dictionary based search? Rui Barradas Em 28-05-2013 11:31, Júlia Zara escreveu: Hello everyone, I´ve been using R in my doctoral reserach and would really appreciate it if any of you could help me find a solution for a

Re: [R] Help retrieving only Portuguese words from a file

2013-05-28 Thread Rui Barradas
Hello, And some words exist in Portuguese, Spanish and English, the three languages of the problem. For instance, animal. I don't think this problem can be solved, but a dictionary search would tell if it is a Portuguese word, which it is. Rui Barradas Em 28-05-2013 16:17, Marcus Nunes

Re: [R] Help retrieving only Portuguese words from a file

2013-05-28 Thread Barry Rowlingson
On Tue, May 28, 2013 at 5:02 PM, Rui Barradas ruipbarra...@sapo.pt wrote: Hello, And some words exist in Portuguese, Spanish and English, the three languages of the problem. For instance, animal. I don't think this problem can be solved, but a dictionary search would tell if it is a

Re: [R] Help retrieving only Portuguese words from a file

2013-05-28 Thread Marcus Nunes
I'm Brazilian I don't know a function like this either, but I can think of a problem using dictionary search. How can this search, if it exists, can distinguish between espada, in Portuguese, and espada, in Spanish? I know they have the same meaning in both languages, but it may cause some

Re: [R] Help retrieving only Portuguese words from a file

2013-05-28 Thread Júlia Zara
Dear Marcus and Rui, Thanks for answering my email. Even though the Portuguese/Spanish distinction may not happen, I think the dictrionary serach would get rid of most words I do not want to include in my research. I have never done this kind of research, but I will read something about it. Best