To comment on the following update, log in, then open the issue: http://www.openoffice.org/issues/show_bug.cgi?id=91198
------- Additional comments from [email protected] Tue Feb 3 17:10:01 +0000 2009 ------- michel_w->tl: I created it myself using text_cat (http://odur.let.rug.nl/~vannoord/TextCat/). Text_cat is based on this paper: http://citeseer.ist.psu.edu/68861.html (Figure 3 explains the algorithm quite well). The numbers in the second column represent the number of times the n-grams appears in the original sample. They are not used by the algorithm and can thus be safely removed AFAIK (which is what I did). --------------------------------------------------------------------- Please do not reply to this automatically generated notification from Issue Tracker. Please log onto the website and enter your comments. http://qa.openoffice.org/issue_handling/project_issues.html#notification --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
