Hi to all,
I need to clean a tagged text (each line= a word, a part of speech, a
lemma). Actually the corpus shows some errors like this one:
< NOM <unknown>
anth NOM <unknown>
> NOM <unknown>
I wanto to replace these three lines with a single line like this one:
<anth>
well, I tried the command:
perl -pi -e "s/< NOM <unknown>.anth NOM <unknown>.>
NOM <unknown>/<anth>/gis" corpus.txt
but it doesn't work.
Someone can tell me why (and, maybe, how to make it work ;) )?
Thaks at all,
alladr
|^|_|^|_|^| |^|_|^|_|^|
| | | |
| | | |
| |*\_/*\_/*\_/*\_/*\_/* | |
| |
| |
| |
| http://www.e-allora.net |
| |
| |
**************************************
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/