RE: no nutch script file under bin directory

2007-07-18 Thread Tsengtan A Shuy
/[EMAIL PROTECTED]/msg08621.html - Original Message From: Tsengtan A Shuy [EMAIL PROTECTED] To: Tsengtan A Shuy [EMAIL PROTECTED]; nutch-dev@lucene.apache.org Sent: Tuesday, July 17, 2007 12:32:49 PM Subject: RE: no nutch script file under bin directory BTW, I just found out there is only one

RE: no nutch script file under bin directory

2007-07-18 Thread Tsengtan A Shuy
a 'nutch' directory or a 'nutch0.9' directory, running svn, and see if it creates another subdirectory under that, then moves things to where you want. - Original Message From: Tsengtan A Shuy [EMAIL PROTECTED] To: nutch-dev@lucene.apache.org Sent: Tuesday, July 17, 2007 5:30:18 PM Subject: RE

ready for the first assignment

2007-07-18 Thread Tsengtan A Shuy
I download the nightly build #153, and eclipse ant the whole package. I also do the whole-web crawl with the resultant org folder, it looks good. I will get the book lucene in action in a week. So I think I am ready for the first bug assignment. Adam Shuy, President ePacific Web Design Hosting

no nutch script file under bin directory

2007-07-17 Thread Tsengtan A Shuy
I follow the msg06571.html to check out the trunk. Then I found there is no nutch script file under the bin directory. How do you crawl the multiple websites without this nutch script file? Adam Shuy, President ePacific Web Design Hosting Professional Web/Software developer TEL: 408-272-6946

RE: no nutch script file under bin directory

2007-07-17 Thread Tsengtan A Shuy
: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] Sent: Tuesday, July 17, 2007 12:23 PM To: 'nutch-dev@lucene.apache.org' Subject: no nutch script file under bin directory I follow the msg06571.html to check out the trunk. Then I found there is no nutch script file under the bin directory. How do you crawl

RE: no nutch script file under bin directory

2007-07-17 Thread Tsengtan A Shuy
developer TEL: 408-272-6946 www.epacificweb.com -Original Message- From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] Sent: Tuesday, July 17, 2007 12:33 PM To: 'Tsengtan A Shuy'; nutch-dev@lucene.apache.org Subject: RE: no nutch script file under bin directory BTW, I just found out there is only

RE: OOM error during parsing with nekohtml

2007-07-16 Thread Tsengtan A Shuy
I successfully run the whole-web crawl with the my new ubuntu OS, and I am ready to fix the bug. I need someone to guide me to get the most updated source code and the bug assignment. Thank you in advance!! Adam Shuy, President ePacific Web Design Hosting Professional Web/Software developer

inject command fail on whole-web run

2007-07-14 Thread Tsengtan A Shuy
I am running the bin/nutch inject crawl/crawldb dmoz command on my ubuntu OS by following the nutch-0.8.x tutorial. But I got the following error message: 2007-07-14 11:38:35,238 WARN mapred.LocalJobRunner (LocalJobRunner.java:run(120)) - job_ij0atx java.lang.NoClassDefFoundError:

mozdex as a backend search engine.

2007-07-07 Thread Tsengtan A Shuy
I successfully implemented the web search menu in my www.epacificweb.com website. This menu uses mozdex.com as the backend search engine. Adam Shuy, President ePacific Web Design Hosting Professional Web/Software developer TEL: 408-272-6946 www.epacificweb.com

problem running bin/nutch crawl urls -dir crawl -depth 3 -topN 50 command

2007-06-29 Thread Tsengtan A Shuy
I follow the nutch-0.8.x tutorial and run the bin/nutch crawl urls -dir crawl -depth 3 -topN 50 command in my cygwin DOS prompt. I got the following message: crawl started in: crawl rootUrlDir = urls threads = 10 depth = 3 topN = 50 Injector: starting Injector: crawlDb: crawl/crawldb

RE: problem running bin/nutch crawl urls -dir crawl -depth 3 -topN 50 command

2007-06-29 Thread Tsengtan A Shuy
I did the same thing in my eClipse, it ran successfully. So from now on I will use eclipse to run the crawl. Adam Shuy President ePacific Web Design Hosting Professional Web/Software developer TEL: 408-272-6946 www.epacificweb.com -Original Message- From: Tsengtan A Shuy [mailto:[EMAIL

problem with nutch 0.8.1 compile

2007-06-28 Thread Tsengtan A Shuy
Where can I find the library for import com.etranslate.tm.processing.rtf.ParseException; java source code.

RE: problem with nutch 0.8.1 compile

2007-06-28 Thread Tsengtan A Shuy
I found the jar file. I like to join the nutch developer team. Where shall I get start? Adam Shuy President ePacific Web Design Hosting Professional Web/Software developer TEL: 408-272-6946 www.epacificweb.com -Original Message- From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] Sent

RE: problem with nutch 0.8.1 compile

2007-06-28 Thread Tsengtan A Shuy
developer TEL: 408-272-6946 www.epacificweb.com -Original Message- From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED] Sent: Thursday, June 28, 2007 4:19 PM To: nutch-dev@lucene.apache.org Subject: RE: problem with nutch 0.8.1 compile I found the jar file. I like to join the nutch developer team