[
https://issues.apache.org/jira/browse/CRUNCH-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461181#comment-13461181
]
Matthias Friedrich commented on CRUNCH-69:
------------------------------------------
I think we have to anonymize these logs before committing them, or preferably
make up artificial data. I don't know about US laws, but in Germany IP
addresses are considered private data, it would be illegal to store them for
longer than a few days, much less publish them.
> it would be useful to include sample data for AverageBytesByIP and
> TotalBytesByIP examples
> ------------------------------------------------------------------------------------------
>
> Key: CRUNCH-69
> URL: https://issues.apache.org/jira/browse/CRUNCH-69
> Project: Crunch
> Issue Type: Improvement
> Components: Core
> Affects Versions: 0.3.0
> Reporter: Roman Shaposhnik
> Assignee: Josh Wills
> Priority: Minor
> Attachments: access_log.zip
>
>
> Currently one has to wonder what kind of input to give those examples. It
> would be very nice if there existed a canonical set of input files as part of
> example's resources.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira