http://www.mozdex.com/developer/#opensearch

Tsengtan A Shuy wrote:
Is there a Nutch search engine which provides the RSS feed?
Then I can use my php front end to publish the search result, and use it as
back end.
Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf
Of Jérôme Charron
Sent: Saturday, June 30, 2007 6:52 AM
To: [email protected]
Subject: Re: integrate Nutch into my php front page

What not using the Lucene PHP version provided by Zend?
File formats are compatibles, so that you can crawl with Java based nutch
and use Zend Lucene PHP for your frontend.

Jérôme

On 6/30/07, Tsengtan A Shuy <[EMAIL PROTECTED]> wrote:
I think I only can purchase the private tomcat hosting plan not a shared
tomcat hosting plan. Because nutch-0.9 only running under the root
directory
of tomcat.  Am I right?

Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Enzo Michelangeli [mailto:[EMAIL PROTECTED]
Sent: Friday, June 29, 2007 6:58 PM
To: [email protected]
Subject: Re: integrate Nutch into my php front page

Ah, excellent, thanks: I had missed it!

Enzo

----- Original Message -----
From: "Roger Dunk" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Saturday, June 30, 2007 9:18 AM
Subject: Re: integrate Nutch into my php front page


It's called OpenSearch (http://en.wikipedia.org/wiki/OpenSearch). Nutch
has supported this for a long time already, and many people make good
use
of it.

-Roger

----- Original Message -----
From: "Enzo Michelangeli" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Saturday, June 30, 2007 11:08 AM
Subject: Re: integrate Nutch into my php front page


Another way would be to rewrite a search.jsp so that it return XML or
JSON
rather than HTML, and then have the PHP code place a GET to that page
and
parse the results (the SOLR approach, so to speak). The JVM (and
Tomcat)
should obviously be run, but that could be done on a different machine.

(BTW, has anybody written such JSP page already?)

Enzo

----- Original Message -----
From: "Matthew A. Bockol" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Saturday, June 30, 2007 7:51 AM
Subject: Re: integrate Nutch into my php front page

Hi Tsengtan,

I've written a php wrapper class for nutch that permits querying the
jsp.
It works reasonably well, but you still need to run the JVM. Let me
know
if you're interested.

Matt


----- Original Message -----
From: "Tsengtan A Shuy" <[EMAIL PROTECTED]>
To: [email protected]
Sent: Friday, June 29, 2007 5:34:45 PM (GMT-0600) America/Chicago
Subject: integrate Nutch into my php front page

I like to integrate the nutch search engine into my php front page of
my
website www.epacificweb.com . Is there a way to do it without running
on
JVM?

Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent: Friday, June 29, 2007 2:55 PM
To: [email protected]
Subject: RE: windows eclipse run

Please ignore my last email.

I run both nutch-0.8.1 and nutch-0.9 with my windows eclipse
environment.
I got all the result folders: crawldb, index, indexs, linkdb and
segments
from nutch-0.8.1, but I only got crawldb folder from the nutch-0.9.
Am I getting the right result?
Any feedback will be much appreciated.

Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com

-----Original Message-----
From: Tsengtan A Shuy [mailto:[EMAIL PROTECTED]
Sent: Friday, June 29, 2007 2:44 PM
To: '[email protected]'
Subject: RE: NoRouteToHostException

I got the same error when I ran in my cygwin environment.
So I ran it in the windows eclipse environment, it ran OK but I still
have
some other nutch-0.9 issue to deal with.
Please read the following web page:
http://wiki.apache.org/nutch/RunNutchInEclipse, and
http://lucene.apache.org/nutch/tutorial8.html
Then ran it again.

Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: DANIEL CLARK [mailto:[EMAIL PROTECTED]
Sent: Friday, June 29, 2007 1:07 PM
To: Nutch List
Subject: NoRouteToHostException

I'm running 0.8.1 and I'm getting the following exception.  Any help
would
be appreciated.

$ bin/nutch crawl urls -dir crawl -depth 3
crawl started in: crawl
rootUrlDir = urls
threads = 10
depth = 3
Injector: starting
Injector: crawlDb: crawl/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.net.NoRouteToHostException: No route
to
host
       at java.net.PlainSocketImpl.socketConnect(Native Method)
       at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
       at
java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
       at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
       at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
       at java.net.Socket.connect(Socket.java:519)
       at java.net.Socket.connect(Socket.java:469)
       at java.net.Socket.<init>(Socket.java:366)
       at java.net.Socket.<init>(Socket.java:208)
       at
org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113)
       at org.apache.hadoop.ipc.Client.getConnection(Client.java:359)
       at org.apache.hadoop.ipc.Client.call(Client.java:297)
       at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150)
       at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown
Source)
       at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214)
       at
org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248)
       at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java
:327)
       at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
       at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)


~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Daniel Clark, President
DAC Systems, Inc.
5209 Nanticoke Court
Centreville, VA  20120
Cell - (703) 403-0340
Email - [EMAIL PROTECTED]
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~





--
Glenn Cooke
Insurance Squared Inc.
1-866-662-5433

Agent discussion forum: http://www.americaninsurancebroker.com
Free US broker directory: http://directory.americaninsurancebroker.com
Free Canadian broker directory: http://www.canadianinsurancebroker.com
Search engine optimization
Directory submission: http://www.insurancesquared.com/directorysubmission
Advertise on blogs: http://www.insurancesquared.com/blogs
Presell pages: http://www.insurancesquared.com/insurancepresellpages


Reply via email to