Hi, I think that we should extend the benchmark of GoraCI. I would like to make a benchmark of:
* Hadoop Map/Reduce * Spark * Hadoop Map/Reduce via Gora * Spark via Gora For that aim, I would like to work on two types of dataset: 1) Data-intensive 2) CPU-intensive It could be nice to benchmark them via GoraCI. Data-intensive and CPU-intensive algorithms are two different kind of algorithms which makes easier to compare performance results. On the other hand it could be nice to compare the performance result of using Gora or not (this is not a must). By the way, we should improve documentation i.e. it says "*Currently these are updated to jackson-core-asl-1.4.2.jar and jackson-mapper-asl-1.4.2.jar. For details see HADOOP-6945*" but I think that it is not a problem with current Avro version at Gora. I'm ready to help improving GoraCI. Kind Regards, Furkan KAMACI On Thu, Oct 29, 2015 at 11:37 PM, Lewis John Mcgibbney < [email protected]> wrote: > Hi Folks, > OK, so I've been chatting with a few folk recently. Namely Furkan, Namrata > and Sujen (CC'd) regarding kicking off GoraCI properly on the Infra > Rackspace sub account that Gora was given some time ago. > It seems our login credentials require renewal. I hope that this will be > resolved soon. > https://issues.apache.org/jira/browse/INFRA-10683 > In the meantime, lets take this opportunity to kick back off discussion on > GoraCI e.g. what we need to do to get it up and running, what use cases are > out there, etc. > Once we can get a login for the Rackspace account we will be a step closer > however we still need to advance the software provisioning side of GoraCI > which is documented here > https://issues.apache.org/jira/browse/GORA-379 > Thanks > Lewis > > > -- > *Lewis* >

