[jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions

2019-11-15 Thread Sebastian Nagel (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastian Nagel updated NUTCH-2481:
---
Affects Version/s: 1.15

> HostDatum deltas(previous step statistics) and Metadata expressions
> ---
>
> Key: NUTCH-2481
> URL: https://issues.apache.org/jira/browse/NUTCH-2481
> Project: Nutch
>  Issue Type: Improvement
>  Components: hostdb
>Affects Versions: 1.15
>Reporter: Semyon Semyonov
>Priority: Minor
>
> To allow the usage of previous step statistics(deltas of fetched,unfetced 
> etc) in hostdb. The motivation is usage of this statistics in generate with 
> maxCount expressions.
>  
> The solution allows to fill in metadata of hostdatum based on custom JEXL 
> expression using two hostdatum: before update(previousHostDatum) and after 
> update(currentHostDatum)..
> For example to fill in difference in quantity of fetched at round t and t-1 
> we can use the following expression
> 
>  hostdb.deltaExpression
>  \{return new ("javafx.util.Pair","FetchedDelta", 
> currentHostDatum.fetched - previousHostDatum.fetched);}
> 
> A pull request will be provided shortly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions

2018-01-17 Thread Semyon Semyonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Semyon Semyonov updated NUTCH-2481:
---
Component/s: (was: generator)

> HostDatum deltas(previous step statistics) and Metadata expressions
> ---
>
> Key: NUTCH-2481
> URL: https://issues.apache.org/jira/browse/NUTCH-2481
> Project: Nutch
>  Issue Type: Improvement
>  Components: hostdb
>Reporter: Semyon Semyonov
>Priority: Minor
>
> To allow the usage of previous step statistics(deltas of fetched,unfetced 
> etc) in hostdb. The motivation is usage of this statistics in generate with 
> maxCount expressions.
>  
> The solution allows to fill in metadata of hostdatum based on custom JEXL 
> expression using two hostdatum: before update(previousHostDatum) and after 
> update(currentHostDatum)..
> For example to fill in difference in quantity of fetched at round t and t-1 
> we can use the following expression
> 
>  hostdb.deltaExpression
>  \{return new ("javafx.util.Pair","FetchedDelta", 
> currentHostDatum.fetched - previousHostDatum.fetched);}
> 
> A pull request will be provided shortly.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions

2018-01-17 Thread Semyon Semyonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Semyon Semyonov updated NUTCH-2481:
---
Description: 
To allow the usage of previous step statistics(deltas of fetched,unfetced etc) 
in hostdb. The motivation is usage of this statistics in generate with maxCount 
expressions.

 

The solution allows to fill in metadata of hostdatum based on custom JEXL 
expression using two hostdatum: before update(previousHostDatum) and after 
update(currentHostDatum)..

For example to fill in difference in quantity of fetched at round t and t-1 we 
can use the following expression


 hostdb.deltaExpression
 \{return new ("javafx.util.Pair","FetchedDelta", 
currentHostDatum.fetched - previousHostDatum.fetched);}


A pull request will be provided shortly.

  was:
To allow the usage of previous step statistics(deltas of fetched,unfetced etc) 
in hostdb. The motivation is usage of this statistics in generate with maxCount 
expressions.

 

The 


> HostDatum deltas(previous step statistics) and Metadata expressions
> ---
>
> Key: NUTCH-2481
> URL: https://issues.apache.org/jira/browse/NUTCH-2481
> Project: Nutch
>  Issue Type: Improvement
>  Components: hostdb
>Reporter: Semyon Semyonov
>Priority: Minor
>
> To allow the usage of previous step statistics(deltas of fetched,unfetced 
> etc) in hostdb. The motivation is usage of this statistics in generate with 
> maxCount expressions.
>  
> The solution allows to fill in metadata of hostdatum based on custom JEXL 
> expression using two hostdatum: before update(previousHostDatum) and after 
> update(currentHostDatum)..
> For example to fill in difference in quantity of fetched at round t and t-1 
> we can use the following expression
> 
>  hostdb.deltaExpression
>  \{return new ("javafx.util.Pair","FetchedDelta", 
> currentHostDatum.fetched - previousHostDatum.fetched);}
> 
> A pull request will be provided shortly.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (NUTCH-2481) HostDatum deltas(previous step statistics) and Metadata expressions

2018-01-17 Thread Semyon Semyonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Semyon Semyonov updated NUTCH-2481:
---
Summary: HostDatum deltas(previous step statistics) and Metadata 
expressions  (was: HostDatum deltas(previous step statistics))

> HostDatum deltas(previous step statistics) and Metadata expressions
> ---
>
> Key: NUTCH-2481
> URL: https://issues.apache.org/jira/browse/NUTCH-2481
> Project: Nutch
>  Issue Type: Improvement
>  Components: generator, hostdb
>Reporter: Semyon Semyonov
>Priority: Minor
>
> To allow the usage of previous step statistics(deltas of fetched,unfetced 
> etc) in hostdb. The motivation is usage of this statistics in generate with 
> maxCount expressions.
>  
> The 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)