Re: opensearchservlet - HowTo ?

Philip Brown Tue, 29 Aug 2006 05:33:12 -0700

Andrzej Bialecki wrote:

Michael Wechner wrote:
Sandy Polanski wrote:
I second that. Is there anyone that can give us some tips on how touse the OpenSearchServlet? I'd really like to see a standalone Javaprogram that would allow me to see the results in RSS format that Ican call from the "./bin/nutch" executable.
I guess the bin/nutch resp. some other program (maybe based onNutchBean) should return a RSS feed which then can be pulled/parsedby the OpenSearchServlet. The question is does something like thisalready exist within Nutch and if not is somebody writing somethinglike this(for instance myself ;-) but I would rather wait if somebody mightanswer the "exist" question ...
Folks,
As the name itself suggests, the servlet needs a servlet container torun. If you build a standard WAR you will get among others theOpenSearchServlet included in the WAR, under <contextPath>/opensearch.Deploy this WAR to your favorite servlet container, e.g. Tomcat, andyou are ready to go.
This is a REST-type service, which means that you send it requests asstandard HTTP GET-s with parameters in the URL, and as a response youget an XML document.
Example request:

 http://localhost:8081/nutch/opensearch?query=cnn

Example response:

<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:nutch="http://www.nutch.org/opensearchrss/1.0/";xmlns:opensearch="http://a9.com/-/spec/opensearchrss/1.0/"; version="2.0">
<channel>
<title>Nutch: cnn</title>
<description>Nutch search results for query: cnn</description>
<link>http://localhost:8081/nutch/search.jsp?query=cnn&start=0&hitsPerDup=2&hitsPerPage=10</link>
<opensearch:totalResults>1</opensearch:totalResults>
<opensearch:startIndex>0</opensearch:startIndex>
<opensearch:itemsPerPage>10</opensearch:itemsPerPage>

<nutch:query>cnn</nutch:query>
<item>
<title>CNN.com - Breaking News, U.S., World, Weather, Entertainment& Video News</title><description> ... the worldInstant Access CNNInternational Live newscasts and ...Pipeline Overnight Live feeds from <spanclass="highlight">CNN and its global<spanclass="ellipsis"> ... </description>
<link>http://www.cnn.com/</link>
<nutch:site>www.cnn.com</nutch:site>
<nutch:cache>http://localhost:8081/nutch/cached.jsp?idx=0&id=0</nutch:cache><nutch:explain>http://localhost:8081/nutch/explain.jsp?idx=0&id=0&query=cnn&lang=null</nutch:explain>
<nutch:segment>20060817135307</nutch:segment>
<nutch:digest>6e5e1ede359a88f11fc564cf22f79305</nutch:digest>
<nutch:boost>2.5735338</nutch:boost>

</item>
</channel>
</rss>

Thanks for reply,

I did not want to drop whole .war into my already running web ap. Justlooking at dropping this nutch.searcher package in with the twodependent imports needed ie. hadoop.conf.Configuration; andnutch.util.NutchConfiguraion; it will access the index undertomcatServerRoot/nutch-0.8/crawl-result/index...

so I see as you run the query and servlet outputs xml ... does itinclude a xslt stylesheet with it to format the page.

Unfortunately I don't have time to start experimenting and seeing my ownresults for the next few days, so I am just trying to garner info, topoint me in the right direction. I appreciate all the replys.


Thanks

Re: opensearchservlet - HowTo ?

Reply via email to