Hello everybody, If some more efforts are to be done on NUTCH-1740, I'll be glad to help. I developed this plugin because I was amongst people that didn't want to create new plugins just for few metadata extraction matters ;)
2014-11-01 19:47 GMT+01:00 Lewis John McGibbney (JIRA) <j...@apache.org>: > > [ > https://issues.apache.org/jira/browse/NUTCH-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > ] > > Lewis John McGibbney updated NUTCH-1644: > ---------------------------------------- > Fix Version/s: (was: 2.3) > 2.4 > >> Should have a parser that uses xpath >> ------------------------------------ >> >> Key: NUTCH-1644 >> URL: https://issues.apache.org/jira/browse/NUTCH-1644 >> Project: Nutch >> Issue Type: New Feature >> Components: parser >> Affects Versions: 2.2.1 >> Reporter: cihad güzel >> Assignee: Lewis John McGibbney >> Labels: parser, xpath >> Fix For: 2.4 >> >> Attachments: NUTCH-1644.patch >> >> >> May want to parse some url via xpath. May be blog or news web sites. Should >> be a plugin using xpath parse. > > > > -- > This message was sent by Atlassian JIRA > (v6.3.4#6332)