Hola, También tienes el paquete "stringdist", que se aproxima bastante a los que has preguntado.
http://cran.r-project.org/web/packages/stringdist/index.html stringdist: Approximate String Matching and String Distance Functions Implements an approximate string matching version of R's native 'match' function. Can calculate various string distances based on edits (damerau-levenshtein, hamming, levenshtein, optimal sting alignment), qgrams (q-gram, cosine, jaccard distance) or heuristic metrics (jaro, jaro-winkler). An implementation of soundex is provided as well. Y también por la variedad de funciones y posibilidades de análisis que ofrece puedes mirar "qDap": http://cran.r-project.org/web/packages/qdap/index.html qdap: Bridging the Gap Between Qualitative Data and Quantitative Analysis Automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables, providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. 'qdap' is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/Natural Language Processing. Saludos, Carlos Ortega www.qualityexcellence.es El 7 de julio de 2015, 11:14, MªLuz Morales <[email protected]> escribió: > Buenos días, > > quisiera saber si existe algún paquete en R para procesamiento de texto, > búsqueda de similitudes y ese tipo de cosas. He estado buscando pero no he > encontrado nada al respecto. > > Gracias > Un saludo > > [[alternative HTML version deleted]] > > _______________________________________________ > R-help-es mailing list > [email protected] > https://stat.ethz.ch/mailman/listinfo/r-help-es > -- Saludos, Carlos Ortega www.qualityexcellence.es [[alternative HTML version deleted]] _______________________________________________ R-help-es mailing list [email protected] https://stat.ethz.ch/mailman/listinfo/r-help-es
