Guys

Please join "Nutch-39 issue" thread in nutch-dev maillist discussion. Thanks

/Jack 

On Apr 12, 2005 8:28 AM, zhang jin <[EMAIL PROTECTED]> wrote:
> Thanks very much,that's very good!
> 
> On Apr 12, 2005 12:56 AM, Orlando Tempobono - AtlasVision <
> [EMAIL PROTECTED]> wrote:
> >
> > Hi,
> >
> > We are working in a network of search websites here in Brazil called
> > www.sitedebusca.com <http://www.sitedebusca.com> the complete list are in
> > http://www.servicodebusca.com/sitesdebusca.php and we add some patchs on
> > search.jsp to show the results in a
> > simple XML format, to read in your own application actually write in PHP.
> > We already are using nutch in a "beta" environment. We have plans to use
> > only nutch on a network of more than
> > 50 regional search web sites.
> > The code of search.jsp are in the next lines, i hope you can understand
> > my email and i hope this code are useful for
> > you.
> >
> > Regards,
> > AtlasVision - Team
> >
> > <%@ page
> > contentType="text/xml; charset=ISO-8859-1"
> > pageEncoding="ISO-8859-1"
> >
> > import="javax.servlet.*"
> > import="javax.servlet.http.*"
> > import="java.io.*"
> > import="java.util.*"
> > import="java.net.*"
> >
> > import="net.nutch.html.Entities"
> > import="net.nutch.searcher.*"
> > %><%
> >
> > NutchBean bean = NutchBean.get(application);
> >
> > // set the character encoding to use when interpreting request values
> > request.setCharacterEncoding("ISO-8859-1");
> >
> > bean.LOG.info("query request from " + request.getRemoteAddr());
> >
> > // get query from request
> > String queryString = request.getParameter("query");
> > if (queryString == null) queryString = "";
> >
> > // first hit to display
> > int start = 0;
> > String startString = request.getParameter("start");
> > if (startString != null) start = Integer.parseInt(startString);
> >
> > // number of hits to display
> > int hitsPerPage = 10;
> > String hitsString = request.getParameter("hitsPerPage");
> > if (hitsString != null) hitsPerPage = Integer.parseInt(hitsString);
> >
> > // max hits per site
> > int hitsPerSite = 2;
> > String hitsPerSiteString = request.getParameter("hitsPerSite");
> > if (hitsPerSiteString != null) hitsPerSite =
> > Integer.parseInt(hitsPerSiteString);
> >
> > Query query = Query.parse(queryString);
> > bean.LOG.info("query: " + queryString);
> >
> > // perform query
> > // Hits hits = bean.search(query, start + 1000, hitsPerSite); // FIXME
> > esta
> > linha estava provocando erros na query: linux
> > Hits hits = bean.search(query, start + hitsPerPage, hitsPerSite);
> >
> > // Last hit in the page
> > int end = start + hitsPerPage - 1;
> > if (end > hits.getLength() - 1) end = hits.getLength() - 1;
> >
> > // Total length in the page
> > int length = 0;
> >
> > if (start < end)
> > length = end - start + 1;
> >
> > bean.LOG.info("total hits: " + hits.getTotal());
> >
> > %><?xml version="1.0" encoding="ISO-8859-1"?>
> > <%
> > // To prevent the character encoding declared with 'contentType' page
> > // directive from being overriden by JSTL (apache i18n), we freeze it
> > // by flushing the output buffer.
> > // see
> > http://java.sun.com/developer/technicalArticles/Intl/MultilingualJSP/
> > out.flush();
> > %>
> > <nutchSearch>
> > <querystring><%=Entities.encode(queryString)%></querystring>
> >
> > <hitsInfo>
> > <hitsPerPage><%=hitsPerPage%></hitsPerPage>
> > <hitsPerSite><%=hitsPerSite%></hitsPerSite>
> > <start><%=new Long(start)%></start>
> > <end><%=new Long(end)%></end>
> > <total><%=new Long(hits.getTotal())%></total>
> > <totalIsExact><%=new Boolean(hits.totalIsExact())%></totalIsExact>
> > <length><%=new Integer(hits.getLength())%></length>
> > <lengthInPage><%=length%></lengthInPage>
> > </hitsInfo>
> >
> > <%
> > if (length > 0) {
> > %>
> > <hitsData>
> > <%
> > Hit[] show = hits.getHits(start, length);
> > HitDetails[] details = bean.getDetails(show);
> > String[] summaries = bean.getSummary(details, query);
> >
> > // display the hits
> > for (int i = 0; i < length; i++) {
> >
> > Hit hit = show[i];
> > HitDetails detail = details[i];
> > String title = detail.getValue("title");
> > String url = detail.getValue("url");
> > String summary = summaries[i].replaceAll("([ \t\n\r]|&nbsp;){2,}", "
> > ");
> > String id = "idx=" + hit.getIndexNo() + "&id=" +
> > hit.getIndexDocNo();
> >
> > // use url for docs w/o title
> > if (title == null || title.equals("")) title = url;
> > %>
> > <hit>
> > <title><![CDATA[<%=title%>]]></title>
> > <summary><![CDATA[<%=summary%>]]></summary>
> > <url><![CDATA[<%=url%>]]></url>
> > <indexNo><%=hit.getIndexNo()%></indexNo>
> > <docNo><%=hit.getIndexDocNo()%></docNo>
> > <moreFromSite><%=(hit.moreFromSiteExcluded())%></moreFromSite>
> > <site><![CDATA[<%=hit.getSite()%>]]></site>
> > </hit>
> > <%
> > }
> > %>
> > </hitsData>
> > <%
> > }
> > %>
> >
> > </nutchSearch>
> >
> >
> > ----- Original Message -----
> > From: <[EMAIL PROTECTED]>
> > To: <[email protected]>
> > Sent: Sunday, April 10, 2005 12:06 PM
> > Subject: XML OUTPUT
> >
> > Hi!
> >
> > Does anybody knows how to output search results in XML format?
> > I would like to provide my data like Google/Yahoo do with their API's.
> >
> > Thanks!
> >
> >
> 
>


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to