[ 
https://issues.apache.org/jira/browse/CRUNCH-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461181#comment-13461181
 ] 

Matthias Friedrich commented on CRUNCH-69:
------------------------------------------

I think we have to anonymize these logs before committing them, or preferably 
make up artificial data. I don't know about US laws, but in Germany IP 
addresses are considered private data, it would be illegal to store them for 
longer than a few days, much less publish them.
                
> it would be useful to include sample data for AverageBytesByIP and 
> TotalBytesByIP examples
> ------------------------------------------------------------------------------------------
>
>                 Key: CRUNCH-69
>                 URL: https://issues.apache.org/jira/browse/CRUNCH-69
>             Project: Crunch
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 0.3.0
>            Reporter: Roman Shaposhnik
>            Assignee: Josh Wills
>            Priority: Minor
>         Attachments: access_log.zip
>
>
> Currently one has to wonder what kind of input to give those examples. It 
> would be very nice if there existed a canonical set of input files as part of 
> example's resources. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to