To comment on the following update, log in, then open the issue:
http://www.openoffice.org/issues/show_bug.cgi?id=91198





------- Additional comments from [email protected] Tue Feb  3 17:10:01 
+0000 2009 -------
michel_w->tl: I created it myself using text_cat
(http://odur.let.rug.nl/~vannoord/TextCat/). Text_cat is based on this paper:
http://citeseer.ist.psu.edu/68861.html (Figure 3 explains the algorithm quite 
well).
The numbers in the second column represent the number of times the n-grams
appears in the original sample. They are not used by the algorithm and can thus
be safely removed AFAIK (which is what I did).

---------------------------------------------------------------------
Please do not reply to this automatically generated notification from
Issue Tracker. Please log onto the website and enter your comments.
http://qa.openoffice.org/issue_handling/project_issues.html#notification

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to