Re: Hi, guys, does anyone use Spark in finance market?

2016-09-01 Thread Adam Roberts
Hi, yes, there's definitely a market for Apache Spark and financial institutions, I can't provide specific details but to answer your survey: "yes" and "more than a few GB!" Here are a couple of examples showing Spark with financial data, full disclosure that I work for IBM, I'm sure there are

Re: Spark 2.0.0 - Java vs Scala performance difference

2016-09-01 Thread Adam Roberts
On Java vs Scala: Sean's right that behind the scenes you'll be calling JVM based APIs anyway (e.g. sun.misc.unsafe for Tungsten) and that the vast majority of Apache Spark's important logic is written in Scala. Would be an interesting experiment to write the same functioning program using the

Re: Spark build 1.6.2 error

2016-08-31 Thread Adam Roberts
Looks familiar, got the zinc server running and using a shared dev box? ps -ef | grep "com.typesafe zinc.Nailgun", look for the zinc server process, kill it and try again, Spark branch-1.6 builds great here from scratch, had plenty of problems thanks to running the zinc server here (started wit

Re: Memory allocation error with Spark 1.5, HashJoinCompatibilitySuite

2015-08-24 Thread Adam Roberts
Hi, I'm regularly hitting "Unable to acquire memory" problems only when trying to use overflow pages when running the full set of Spark tests across different platforms. The machines I'm using all have well over 10 GB of RAM and I'm running without any changes to the pom.xml file. Standard 3 GB Jav