[ https://issues.apache.org/jira/browse/NUTCH-3115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18004909#comment-18004909 ]
ASF GitHub Bot commented on NUTCH-3115: --------------------------------------- CatChullain commented on PR #856: URL: https://github.com/apache/nutch/pull/856#issuecomment-3066331857 Still banging on this today. If we still have time, I'd like to do a bit more work so the new flag for all fields access can apply to each field/class individually. That'll be easier to update an existing nutch-site to start using it without interfering with previous index-arbitrary defs. > Allow POJO in Arbitrary Indexer to access indexing objects in filter > constrctor > ------------------------------------------------------------------------------- > > Key: NUTCH-3115 > URL: https://issues.apache.org/jira/browse/NUTCH-3115 > Project: Nutch > Issue Type: Improvement > Components: indexer > Affects Versions: 1.20 > Reporter: Joe Gilvary > Assignee: Joe Gilvary > Priority: Minor > Fix For: 1.21 > > Attachments: NUTCH-3115.patch > > > The POJO used with Arbitrary Indexer doesn't have access to NutchDocument, > Inlinks, etc. Granting this read/write access to the POJO enables additional > data modification/clean-up before the indexing process. -- This message was sent by Atlassian Jira (v8.20.10#820010)