[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14072492#comment-14072492
 ] 

ASF GitHub Bot commented on MAHOUT-1500:
----------------------------------------

Github user avati commented on the pull request:

    https://github.com/apache/mahout/pull/21#issuecomment-49946698
  
    @cliffclick I have updated with review comments. Note that even though I 
did away with Tuple2, I am using a new H2ODrm in place. Having the optional row 
in the same Frame made things very confusing for a reviewer to instantly 
identify if a given Frame was with row labels or without. H2ODrm has potential 
future uses (extra members) as well.
    
    I have also made the drmfromHdfs() api fall back to H2O parser (csv etc) if 
a given file is not a sequence file format. So this opens up the possibility to 
tweak the job pipeline to use csv files instead of seqfiles and gain in 
performance and compression.


> H2O integration
> ---------------
>
>                 Key: MAHOUT-1500
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Anand Avati
>             Fix For: 1.0
>
>
> Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to