[ 
https://issues.apache.org/jira/browse/TINKERPOP-1027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15046009#comment-15046009
 ] 

ASF GitHub Bot commented on TINKERPOP-1027:
-------------------------------------------

Github user okram commented on the pull request:

    
https://github.com/apache/incubator-tinkerpop/pull/172#issuecomment-162709345
  
    I made it so that SparkGremlin works like Spark JobServer 
(https://github.com/spark-jobserver/spark-jobserver/). It ensures that RDDs are 
not garbage collected by maintaining a static `Spark` class that holds a 
`ConcurrentHashMap` of RDDs. Thus, Spark is like a "file system" in that RDDs 
can be `ls()`, `rm()`, etc. This was necessary to get "slow" `mvn clean 
install` to build correctly where RDDs are NOT GC'd by Spark context cleaner.


> Merge view prior to writing graphRDD to output format/rdd
> ---------------------------------------------------------
>
>                 Key: TINKERPOP-1027
>                 URL: https://issues.apache.org/jira/browse/TINKERPOP-1027
>             Project: TinkerPop
>          Issue Type: Bug
>          Components: hadoop
>    Affects Versions: 3.1.0-incubating
>            Reporter: Marko A. Rodriguez
>            Assignee: Marko A. Rodriguez
>             Fix For: 3.1.1-incubating
>
>
> [~dalaro] noted that DSEGraph was not happy with the current {{graphRDD}} 
> model when it comes to writing. To make it happy, the view merge needs to 
> happen prior to {{graphRDD}} output. Thus, move the {{mapReduceRDD}} view 
> merge to before {{graphRDD}} writing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to