Re: Need Tutorial on Nutch

2018-03-07 Thread Eric Valencia
s > > > -Original message- > > From:Eric Valencia <ericlvalen...@gmail.com> > > Sent: Wednesday 7th March 2018 21:51 > > To: user@nutch.apache.org > > Subject: Re: Need Tutorial on Nutch > > > > How about using nutch with a headless browser lik

RE: Need Tutorial on Nutch

2018-03-07 Thread Markus Jelsma
lvalen...@gmail.com> > Sent: Wednesday 7th March 2018 21:51 > To: user@nutch.apache.org > Subject: Re: Need Tutorial on Nutch > > How about using nutch with a headless browser like CasperJS? Will this > work? Have any of you tried this? > > On Tue, Mar 6, 2018 at

Re: Need Tutorial on Nutch

2018-03-07 Thread Eric Valencia
Markus > > -Original message- > > From:Eric Valencia <ericlvalen...@gmail.com> > > Sent: Tuesday 6th March 2018 21:17 > > To: user@nutch.apache.org > > Subject: Re: Need Tutorial on Nutch > > > > Yash, well, I want to monitor the price for every ite

RE: Need Tutorial on Nutch

2018-03-06 Thread Markus Jelsma
luck Markus -Original message- > From:Eric Valencia <ericlvalen...@gmail.com> > Sent: Tuesday 6th March 2018 21:17 > To: user@nutch.apache.org > Subject: Re: Need Tutorial on Nutch > > Yash, well, I want to monitor the price for every item in the top 500 > reta

Re: Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
Hadoop. If you dont I do > > > recomend to read "Hadoop. The definitive guide", because, well, Nutch > is > > > Hadoop. > > > > > > Here we are, no pain, no gain. > > > > > > > > > > > > Sent: Tuesday, March 06, 2018 at 7:42 PM

Re: Need Tutorial on Nutch

2018-03-06 Thread Yash Thenuan Thenuan
is > > Hadoop. > > > > Here we are, no pain, no gain. > > > > > > > > Sent: Tuesday, March 06, 2018 at 7:42 PM > > From: "Eric Valencia" <ericlvalen...@gmail.com> > > To: user@nutch.apache.org > > Subject: Re: Need Tutorial on Nutch &g

Re: Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
in. > > > > Sent: Tuesday, March 06, 2018 at 7:42 PM > From: "Eric Valencia" <ericlvalen...@gmail.com> > To: user@nutch.apache.org > Subject: Re: Need Tutorial on Nutch > Thank you kindly Yash. Yes, I did try some of the tutorials actually but > they seem to b

Re: Need Tutorial on Nutch

2018-03-06 Thread Semyon Semyonov
ell, Nutch is Hadoop. Here we are, no pain, no gain.     Sent: Tuesday, March 06, 2018 at 7:42 PM From: "Eric Valencia" <ericlvalen...@gmail.com> To: user@nutch.apache.org Subject: Re: Need Tutorial on Nutch Thank you kindly Yash. Yes, I did try some of the tutorials actually b

Re: Need Tutorial on Nutch

2018-03-06 Thread Yash Thenuan Thenuan
Start with nutch 1.x if you are getting some trouble. Its easier to configure and by following nutch 1.x tutorial you will be able to crawl your first website easily. On 7 Mar 2018 00:13, "Eric Valencia" wrote: > Thank you kindly Yash. Yes, I did try some of the

Re: Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
Thank you kindly Yash. Yes, I did try some of the tutorials actually but they seem to be missing the complete amount of steps required to successfully scrape in nutch. On Tue, Mar 6, 2018 at 10:37 AM Yash Thenuan Thenuan wrote: > I would suggest to start with the

Re: Need Tutorial on Nutch

2018-03-06 Thread Yash Thenuan Thenuan
I would suggest to start with the documentation on nutch's website. You can get a Idea about how to start crawling and all. Apart from that there are no proper tutorials as such. Just start crawling if you got stuck somewhere try to find something related to that on Google and nutch mailing list

Need Tutorial on Nutch

2018-03-06 Thread Eric Valencia
I'm a beginner in Nutch and need the best tutorials to get started. Can you guys let me know how you would advise yourselves if starting today (like me)? Eric