Well, it was a crawl command first and then a solrindex command. The crawl
does invertlinks as well right?

Thanks
Chethan

On Mon, Oct 8, 2012 at 3:16 PM, Markus Jelsma <[email protected]>wrote:

> Hi - did you run the invertlinks program over your segments before
> indexing?
>
> -----Original message-----
> > From:chethan <[email protected]>
> > Sent: Mon 08-Oct-2012 04:28
> > To: [email protected]
> > Subject: Anchor text of current URL
> >
> > Hi,
> >
> > In an indexing filter, is there a way to figure out the Anchor text from
> > which the current URL/document originated from? I tried the inlinks but
> > that seems to be null.
> >
> > public NutchDocument filter(NutchDocument doc, Parse parse, Text url,
> > CrawlDatum datum, Inlinks inlinks) IndexingException {
> >
> > *    //Need to know the anchor text from which the current document
> > originated from at this point*
> >
> > }
> >
> > Thanks
> > Chethan
> >
>

Reply via email to