Hi,

I think that we should extend the benchmark of GoraCI. I would like to make
a benchmark of:

* Hadoop Map/Reduce
* Spark
* Hadoop Map/Reduce via Gora
* Spark via Gora

For that aim, I would like to work on two types of dataset:

1) Data-intensive
2) CPU-intensive

It could be nice to benchmark them via GoraCI. Data-intensive and
CPU-intensive algorithms are two different kind of algorithms which makes
easier to compare performance results. On the other hand it could be nice to
compare the performance result of using Gora or not (this is not a must).

By the way, we should improve documentation i.e. it says "*Currently these
are updated to jackson-core-asl-1.4.2.jar and jackson-mapper-asl-1.4.2.jar.
For details see HADOOP-6945*"  but I think that it is not a problem with
current Avro version at Gora.

I'm ready to help improving GoraCI.

Kind Regards,
Furkan KAMACI

On Thu, Oct 29, 2015 at 11:37 PM, Lewis John Mcgibbney <
[email protected]> wrote:

> Hi Folks,
> OK, so I've been chatting with a few folk recently. Namely Furkan, Namrata
> and Sujen (CC'd) regarding kicking off GoraCI properly on the Infra
> Rackspace sub account that Gora was given some time ago.
> It seems our login credentials require renewal. I hope that this will be
> resolved soon.
> https://issues.apache.org/jira/browse/INFRA-10683
> In the meantime, lets take this opportunity to kick back off discussion on
> GoraCI e.g. what we need to do to get it up and running, what use cases are
> out there, etc.
> Once we can get a login for the Rackspace account we will be a step closer
> however we still need to advance the software provisioning side of GoraCI
> which is documented here
> https://issues.apache.org/jira/browse/GORA-379
> Thanks
> Lewis
>
>
> --
> *Lewis*
>

Reply via email to