For indexing the webpages, you can use Nutch with Solr, which would do the scarping and indexing of the page. For finding similar documents/pages you can use http://wiki.apache.org/solr/MoreLikeThis, by querying the above document (by id or search terms) and it would return similar documents from the index for the result.
Regards, Jayendra On Tue, Aug 30, 2011 at 8:23 AM, Sheetal <rituzprad...@gmail.com> wrote: > Hi, > > Is it possible to give the URL address of a site and solr search server > reads the contents of the given site and recommends similar projects to > that. I did scrapped the web contents from the given URL address and now > have the plain text format of the contents in URL. But when I pass that > scrapped text as query into Solr. It doesn't work as query being too > large(depends on the given contents of URL). > > I read it somewhere that its possible , Given the URL address and outputs > you the relevant projects to it. But I don't remember whether its using Solr > search or other search engine. > > Does anyone have any ideas or suggestions for this..Would highly appreciate > your comments.... > > Thank you in advance.. > > ----- > Sheetal > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Search-the-contents-of-given-URL-in-Solr-tp3294376p3294376.html > Sent from the Solr - User mailing list archive at Nabble.com. >