Hi Mark, Not sure if this is exactly what you're looking for but maybe try the whitelist_blacklist_plugin from NUTCH-585 https://issues.apache.org/jira/browse/NUTCH-585
Best, Olle On Nov 11, 2013, at 7:01 PM, "Reyes, Mark" <[email protected]> wrote: > Hi: > > I’m using Nutch 1.7 to crawl/index the pages of my domain to Solr and > JavaScript library AJAX Solr to capture that index as JSON, which would then > print that to the front-end. > > My question is, if it’s possible to have specific content return (i.e. An H2 > tag and a p tag) on the search results page versus all contents of that page? > > Thank you, > Mark > > > IMPORTANT NOTICE: This e-mail message is intended to be received only by > persons entitled to receive the confidential information it may contain. > E-mail messages sent from Bridgepoint Education may contain information that > is confidential and may be legally privileged. Please do not read, copy, > forward or store this message unless you are an intended recipient of it. If > you received this transmission in error, please notify the sender by reply > e-mail and delete the message and any attachments.

