[jira] [Updated] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard

2019-10-01 Thread Sebastian Nagel (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-1917:
---
Fix Version/s: (was: 1.16)
   1.17

> index.parse.md, index.content.md and index.db.md should support wildcard
> 
>
> Key: NUTCH-1917
> URL: https://issues.apache.org/jira/browse/NUTCH-1917
> Project: Nutch
>  Issue Type: Bug
>  Components: indexer
>Affects Versions: 1.9
>Reporter: Lewis John McGibbney
>Priority: Major
> Fix For: 1.17
>
> Attachments: MetadataIndexer.java.patch
>
>
> Right now metatags.names supports the '*' character for a catch all.
> I believe that the above index properties should also support catch all as a 
> mechanism for quickly building augmented data models from crawl data. 
> Individual identification and manual inclusion of tags one by one is error 
> prone and time consuming.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard

2018-07-02 Thread Sebastian Nagel (JIRA)


 [ 
https://issues.apache.org/jira/browse/NUTCH-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-1917:
---
Fix Version/s: (was: 1.15)
   1.16

> index.parse.md, index.content.md and index.db.md should support wildcard
> 
>
> Key: NUTCH-1917
> URL: https://issues.apache.org/jira/browse/NUTCH-1917
> Project: Nutch
>  Issue Type: Bug
>  Components: indexer
>Affects Versions: 1.9
>Reporter: Lewis John McGibbney
>Priority: Major
> Fix For: 1.16
>
> Attachments: MetadataIndexer.java.patch
>
>
> Right now metatags.names supports the '*' character for a catch all.
> I believe that the above index properties should also support catch all as a 
> mechanism for quickly building augmented data models from crawl data. 
> Individual identification and manual inclusion of tags one by one is error 
> prone and time consuming.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard

2017-12-17 Thread Sebastian Nagel (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-1917:
---
Fix Version/s: (was: 1.14)
   1.15

> index.parse.md, index.content.md and index.db.md should support wildcard
> 
>
> Key: NUTCH-1917
> URL: https://issues.apache.org/jira/browse/NUTCH-1917
> Project: Nutch
>  Issue Type: Bug
>  Components: indexer
>Affects Versions: 1.9
>Reporter: Lewis John McGibbney
> Fix For: 1.15
>
> Attachments: MetadataIndexer.java.patch
>
>
> Right now metatags.names supports the '*' character for a catch all.
> I believe that the above index properties should also support catch all as a 
> mechanism for quickly building augmented data models from crawl data. 
> Individual identification and manual inclusion of tags one by one is error 
> prone and time consuming.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Updated] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard

2016-10-18 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-1917:

Fix Version/s: 1.13

> index.parse.md, index.content.md and index.db.md should support wildcard
> 
>
> Key: NUTCH-1917
> URL: https://issues.apache.org/jira/browse/NUTCH-1917
> Project: Nutch
>  Issue Type: Bug
>  Components: indexer
>Affects Versions: 1.9
>Reporter: Lewis John McGibbney
> Fix For: 1.13
>
> Attachments: MetadataIndexer.java.patch
>
>
> Right now metatags.names supports the '*' character for a catch all.
> I believe that the above index properties should also support catch all as a 
> mechanism for quickly building augmented data models from crawl data. 
> Individual identification and manual inclusion of tags one by one is error 
> prone and time consuming.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard

2016-10-18 Thread David Johnson (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Johnson updated NUTCH-1917:
-
Attachment: MetadataIndexer.java.patch

> index.parse.md, index.content.md and index.db.md should support wildcard
> 
>
> Key: NUTCH-1917
> URL: https://issues.apache.org/jira/browse/NUTCH-1917
> Project: Nutch
>  Issue Type: Bug
>  Components: indexer
>Affects Versions: 1.9
>Reporter: Lewis John McGibbney
> Attachments: MetadataIndexer.java.patch
>
>
> Right now metatags.names supports the '*' character for a catch all.
> I believe that the above index properties should also support catch all as a 
> mechanism for quickly building augmented data models from crawl data. 
> Individual identification and manual inclusion of tags one by one is error 
> prone and time consuming.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)