Hi Julien,

Firstly I need to apologise for my failure to differentiate between
cashing and caching (of course the latter being correct). Sorry about
that.

On Fri, Aug 10, 2012 at 8:30 AM, Julien Nioche
<[email protected]> wrote:
> Could this be for the html meta directives?
>

Well in o.a.n.metadata.Nutch some brief Javadoc's for the caching
fields mention the following

 static String  CACHING_FORBIDDEN_ALL
          Don't show either original forbidden content or summaries.
static String   CACHING_FORBIDDEN_CONTENT
          Don't show original forbidden content, but show summaries.
static String   CACHING_FORBIDDEN_KEY
          Sites may request that search engines don't provide access
to cached documents.
static org.apache.avro.util.Utf8        CACHING_FORBIDDEN_KEY_UTF8

static String   CACHING_FORBIDDEN_NONE
          Show both original forbidden content and summaries (default).

I understand that caching data is held within and concerns metadata
(in trunk it is parse.getData().getMeta())but I still have no idea the
characteristics of the cache data, why this would be valuable for an
index. I personally have never queried for it before in my index.

Thanks

Reply via email to