[ 
https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14059110#comment-14059110
 ] 

ASF GitHub Bot commented on MAHOUT-1500:
----------------------------------------

Github user avati commented on the pull request:

    https://github.com/apache/mahout/pull/21#issuecomment-48762753
  
    On Fri, Jul 11, 2014 at 10:46 AM, Pat Ferrel <[email protected]>
    wrote:
    
    > Exactly, thanks. I see you've done the same for CF also great.
    >
    > But this illustrates the problem. I need to change 50% of the tests in CF
    > cooccurrence because they were not catching a bug. Now the tests live in
    > two places h2o and spark. And unless I change the tests in both places the
    > build will break. The files look virtually identical except for the
    > imports, which is good. If that's true, I wonder if we could we use a 
Scala
    > macro to keep the code all in one file? We might be able to take the same
    > code and produce two artifacts that are both run at build time. That would
    > reduce the load on devs for this kind of thing.
    >
    As we discussed on another email thread, I'm independently working on how
    to move tests back into math-scala. That effort should address this concern
    I think?
    
     However currently almost all IO code is spark specific. You must have
    > re-implemented drm.writeDrm for h2o. Until this is *not* a
    > re-implementation but is engine neutral we are going to have a growing
    > problem.
    >
    Why is this a problem? drm.writeDrm() accepts an engine neutral path, like
    "hdfs://.." or "file://..." and the content of what gets written is the
    well defined sequencefile format no matter what the runtime backend is. And
    as long as the path and file content are engine neutral, why should
    pipeline code worry how the IO implementation is done? Again, am I missing
    something?


> H2O integration
> ---------------
>
>                 Key: MAHOUT-1500
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1500
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Anand Avati
>             Fix For: 1.0
>
>
> Provide H2O backend for the Mahout DSL



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to