Mahout In Action matches the 0.5 release of Mahout. If you are trying
to use the current trunk, various things have changed.

You may do better if you start with one of the classify examples and
walk through it. There's a lot going on and I found it easier to run
the scripts and then drill down into the steps.

On Mon, Apr 9, 2012 at 1:31 PM, Ahmed Abdeen Hamed
<[email protected]> wrote:
> I actually tried with a much simpler file structure and still getting
> FileNotFoundException. Of course the files exist and have the right
> permissions. Something is strange with this example.
>
> -Ahmed
>
> On Mon, Apr 9, 2012 at 12:50 PM, Ahmed Abdeen Hamed <[email protected]
>> wrote:
>
>> Hello Vikas,
>> I am actually having the same problem with the 20-newsgroups example. I
>> will reduce the directory structure to see if that works. Let me know if
>> you come up with a better solution.
>>
>> Thanks,
>> -Ahmed
>>
>>
>> On Mon, Apr 9, 2012 at 11:13 AM, Vikas <[email protected]> wrote:
>>
>>> Hi All,
>>>
>>> I am new to Mahout hence the question might sound a bit stupid.
>>> Even so, please do humor me.
>>> I am trying to build a Bayes classifier engine on the lines of a spam
>>> filter.
>>> But it has more than 2 classes, and will contain a network later.
>>> I plan to use Mahout for the same.
>>>
>>> I tried to follow the implementation example of 20-newsgroups.
>>> But the format differs from the explanation in "Mahout in Action", Figure
>>> 13.2.
>>> The New Examples have Predictor Variables only, and no Target Variables.
>>>
>>> But the example contains target variables in both the training and test
>>> set.
>>> An example is, alt.atheism.txt from both training and test data.
>>> It contains rows of data in the format
>>> "alt.atheism<tab>some_text_to_classify".
>>> The result is a matrix showing the Naive Bayes probability of
>>> classification.
>>>
>>> Am I missing something?
>>> Or does the Naive Bayes implementation using Mahout require this data
>>> format?
>>>
>>> I am still looking for some other implementations, but to no avail.
>>>
>>> Any help is greatly appreciated.
>>>
>>> Thank you,
>>> Vikas
>>>
>>>
>>



-- 
Lance Norskog
[email protected]

Reply via email to