Re: Extracting data from websites

David Rose Mon, 30 Jul 2012 05:58:53 -0700

The clustering and classification is something that we want to use.  We would 
want to grab news from sites we input on specific industries or companies, and 
then have them classified based on relevance.


On Jul 30, 2012, at 8:26 AM, Sean Owen wrote:

> Extract as in web crawl? No it's nothing to do with that.
> Extract as in entity extraction? I don't think there are relevant
> implementations here either, though that begins to border on machine
> learning.
> This is more about clustering and classification of documents than anything
> else.
> 
> On Mon, Jul 30, 2012 at 1:22 PM, David Rose <[email protected]> wrote:
> 
>> Hi all,
>> 
>> I  apologize for how basic my question is, but I am very new to all of
>> this, machine learning, writing code, all of it.  I was finally able to get
>> Mahout downloaded, installed, and running.  I was assigned a project at my
>> work to try to use Mahout to extract data from websites that we input.  Is
>> this possible? Can anyone help me with suggestions or instructions on how
>> to do so? I appreciate any help on this, as I have only two more weeks to
>> finish this project.
>> 
>> Thanks,
>> 
>> David Rose

Re: Extracting data from websites

Reply via email to