/[EMAIL PROTECTED]/msg08621.html
- Original Message
From: Tsengtan A Shuy [EMAIL PROTECTED]
To: Tsengtan A Shuy [EMAIL PROTECTED]; nutch-dev@lucene.apache.org
Sent: Tuesday, July 17, 2007 12:32:49 PM
Subject: RE: no nutch script file under bin directory
BTW, I just found out there is only one
a 'nutch' directory or a 'nutch0.9'
directory, running svn, and see if it creates another subdirectory under
that, then moves things to where you want.
- Original Message
From: Tsengtan A Shuy [EMAIL PROTECTED]
To: nutch-dev@lucene.apache.org
Sent: Tuesday, July 17, 2007 5:30:18 PM
Subject: RE
I download the nightly build #153, and eclipse ant the whole package.
I also do the whole-web crawl with the resultant org folder, it looks good.
I will get the book lucene in action in a week.
So I think I am ready for the first bug assignment.
Adam Shuy, President
ePacific Web Design Hosting
I follow the msg06571.html to check out the trunk.
Then I found there is no nutch script file under the bin directory.
How do you crawl the multiple websites without this nutch script file?
Adam Shuy, President
ePacific Web Design Hosting
Professional Web/Software developer
TEL: 408-272-6946
: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent: Tuesday, July 17, 2007 12:23 PM
To: 'nutch-dev@lucene.apache.org'
Subject: no nutch script file under bin directory
I follow the msg06571.html to check out the trunk.
Then I found there is no nutch script file under the bin directory.
How do you crawl
developer
TEL: 408-272-6946
www.epacificweb.com
-Original Message-
From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent: Tuesday, July 17, 2007 12:33 PM
To: 'Tsengtan A Shuy'; nutch-dev@lucene.apache.org
Subject: RE: no nutch script file under bin directory
BTW, I just found out there is only
I successfully run the whole-web crawl with the my new ubuntu OS, and I am
ready to fix the bug. I need someone to guide me to get the most updated
source code and the bug assignment.
Thank you in advance!!
Adam Shuy, President
ePacific Web Design Hosting
Professional Web/Software developer
I am running the bin/nutch inject crawl/crawldb dmoz command on my ubuntu
OS by following the nutch-0.8.x tutorial. But I got the following error
message:
2007-07-14 11:38:35,238 WARN mapred.LocalJobRunner
(LocalJobRunner.java:run(120)) - job_ij0atx
java.lang.NoClassDefFoundError:
I successfully implemented the web search menu in my www.epacificweb.com
website.
This menu uses mozdex.com as the backend search engine.
Adam Shuy, President
ePacific Web Design Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
I follow the nutch-0.8.x tutorial and run the bin/nutch crawl urls -dir
crawl -depth 3 -topN 50 command in my cygwin DOS prompt. I got the
following message:
crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
topN = 50
Injector: starting
Injector: crawlDb: crawl/crawldb
I did the same thing in my eClipse, it ran successfully.
So from now on I will use eclipse to run the crawl.
Adam Shuy
President
ePacific Web Design Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-Original Message-
From: Tsengtan A Shuy [mailto:[EMAIL
Where can I find the library for import
com.etranslate.tm.processing.rtf.ParseException; java source code.
I found the jar file.
I like to join the nutch developer team.
Where shall I get start?
Adam Shuy
President
ePacific Web Design Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-Original Message-
From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent
developer
TEL: 408-272-6946
www.epacificweb.com
-Original Message-
From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent: Thursday, June 28, 2007 4:19 PM
To: nutch-dev@lucene.apache.org
Subject: RE: problem with nutch 0.8.1 compile
I found the jar file.
I like to join the nutch developer team
14 matches
Mail list logo