[
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072492#comment-14072492
]
ASF GitHub Bot commented on MAHOUT-1500:
----------------------------------------
Github user avati commented on the pull request:
https://github.com/apache/mahout/pull/21#issuecomment-49946698
@cliffclick I have updated with review comments. Note that even though I
did away with Tuple2, I am using a new H2ODrm in place. Having the optional row
in the same Frame made things very confusing for a reviewer to instantly
identify if a given Frame was with row labels or without. H2ODrm has potential
future uses (extra members) as well.
I have also made the drmfromHdfs() api fall back to H2O parser (csv etc) if
a given file is not a sequence file format. So this opens up the possibility to
tweak the job pipeline to use csv files instead of seqfiles and gain in
performance and compression.
> H2O integration
> ---------------
>
> Key: MAHOUT-1500
> URL: https://issues.apache.org/jira/browse/MAHOUT-1500
> Project: Mahout
> Issue Type: Improvement
> Reporter: Anand Avati
> Fix For: 1.0
>
>
> Provide H2O backend for the Mahout DSL
--
This message was sent by Atlassian JIRA
(v6.2#6252)