Which indexFilter plugin does Nutch use out-of-the-box? Or how do I find out? I'm trying to figure out how the following fields are being indexed.
anchor boost content digest host segment site title tstamp url
Which indexFilter plugin does Nutch use out-of-the-box? Or how do I find out? I'm trying to figure out how the following fields are being indexed.
anchor boost content digest host segment site title tstamp url