[
https://issues.apache.org/jira/browse/ACCUMULO-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13658584#comment-13658584
]
Hudson commented on ACCUMULO-1417:
----------------------------------
Integrated in Accumulo-1.5-Hadoop-2.0 #118 (See
[https://builds.apache.org/job/Accumulo-1.5-Hadoop-2.0/118/])
ACCUMULO-1417 ngram ingester (Revision 1482896)
Result = SUCCESS
ecn :
Files :
*
/accumulo/branches/1.5/examples/simple/src/main/java/org/apache/accumulo/examples/simple/mapreduce/NGramIngest.java
> data storage efficiency
> -----------------------
>
> Key: ACCUMULO-1417
> URL: https://issues.apache.org/jira/browse/ACCUMULO-1417
> Project: Accumulo
> Issue Type: Task
> Reporter: Eric Newton
> Assignee: Eric Newton
>
> David Medinets wrote the user's list:
> {quote}
> Are there any published numbers for the amount of disk space used by
> Accumulo versus other products? I'm thinking some dataset like dbpedia
> or something from http://books.google.com/ngrams/datasets. If there is
> not such a comparison, what comparisons would you like to see? What
> about WordNet stored in CSV, MySQL, Cassandra, HBase, and Accumulo?
> WordNet is just a large set of CSV files so it would be a good
> candidate for this concept, I think.
> {quote}
> Good idea.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira