Re: [Scikit-learn-general] Document classifier with Arabic textual data

2012-04-16 Thread Olivier Grisel
Le 16 avril 2012 05:28, Lars Buitinck a écrit : > Op 16 april 2012 13:57 heeft Fahd S. Alotaibi > het volgende geschreven: >> I usually have got the following error message and I don't know exactly >> where is the problem? Would scikit-learn work fine for Arabic letters (using >> Unicode)? if not

Re: [Scikit-learn-general] Document classifier with Arabic textual data

2012-04-16 Thread Lars Buitinck
Op 16 april 2012 13:57 heeft Fahd S. Alotaibi het volgende geschreven: > I usually have got the following error message and I don't know exactly > where is the problem? Would scikit-learn work fine for Arabic letters (using > Unicode)? if not how to do so? The problem is that Vectorizer by defaul

[Scikit-learn-general] Document classifier with Arabic textual data

2012-04-16 Thread Fahd S. Alotaibi
Hi everyone, I'm trying to investigate how efficient using scikit-learn for classifying Arabic documents. I could successfully work with the English tutorial (20 newsgroups dataset) and once I manipulated the code (by using load_files()) to import Arabic text instead but I got errors (see bel