Re: Apache Gora and jOOQ

2013-11-18 Thread Henry Saputra
Hi Lukas,

Thanks so much for your email and your followup =)

I personally would prefer to go with option one for now. It would give
us immediate support for SQL module back for Gora even for non
commercial databases.

As Renato has mentioned too, we need to send VOTE message to dev
community and user to make sure Gora community agree about the
approach and make their opinion heard.

CC user list

- Henry


On Mon, Nov 18, 2013 at 6:33 AM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hey Lukas,

 N.B. Including dev@gora list here to keep everyone in the loop.

 Thanks for keeping this thread alive. I personally would like to apologize
 for not dropping in on this one earlier. My resources have been focused
 elsewhere as of late and I have not had the JOOQ initiative at the top of
 my TODO list.
 Regarding your points/options please see below


 On Fri, Nov 1, 2013 at 7:42 PM, Lukas Eder lukas.e...@datageekery.comwrote:

 1. Apache Gora links and embeds only the jOOQ Open Source Edition, which
 is available and will continue to be available from Maven Central under the
 terms of the ASL 2.0. For Apache Gora, there are no additional license
 terms. For your end users, there aren't any additional terms either, if
 they're using Apache Gora with an Open Source database. If end users wish
 to use Apache Gora with SQL Server, for instance, they would need to
 purchase a license from Data Geekery and replace jOOQ Open Source Edition
 binaries with jOOQ Professional Edition binaries. In this case, Data
 Geekery would welcome but not require a backlink indicating that Apache
 Gora is based on jOOQ.


 This sounds like the most realistic option IMHO. At the end of the day we
 (Gora) as an Apache top level project are in no position to change the
 terms and/or add to the Apache License v2.0 in any way. What is slightly
 annoying right now is that a good re-write of the gora-sql module to
 implement JOOQ is extremely attractive for us at Gora but it appears the
 development cycles are not there right now!



 2. In addition to the above, Data Geekery and Apache Gora could make a
 separate agreement, which grants Apache Gora a perpetual license to use the
 jOOQ Enterprise Edition for integration testing Apache Gora against SQL
 Server and other commercial databases, but not to distribute, sublicense or
 make available the jOOQ Enterprise Edition to end users. End users may
 again use Apache Gora with the jOOQ Open Source Edition along with Open
 Source databases, or they may purchase a commercial license from Data
 Geekery. In this case, Data Geekery would require a prominent backlink
 indicating that Apache Gora is based on jOOQ. This is how IntelliJ or
 YourKit Profiler handle free commercial licenses for non-commercial OSS
 projects as well.


 This is also very attractive as it would enable us to verify gora
 compliance with commercial RDBMS... always a bonus to promote Gora to more
 people.



 3. Data Geekery and Apache Gora will make a separate agreement, which
 grants Apache Gora a perpetual license to use and distribute the jOOQ
 Enterprise Edition, but not to sublicense jOOQ or to make the jOOQ API or
 binaries available to end users (e.g. by embedding the jOOQ jar file). Of
 course, there are technical ways to circumvent this restriction and
 extract jOOQ from Apache Gora. Hence, there would need to be additional
 terms to YOUR license, clearly indicating that only Apache Gora code is ASL
 2.0 licensed, whereas jOOQ binaries are jOOQ-licensed. I suspect that this
 would make Apache Gora dual-licensed, as well...? This case would be based
 on a discounted license paid by Apache Gora backers. Your end users would
 not need to license jOOQ.

 Personally, I feel that 1) or 2) is the easiest for you guys to handle.


 I agree with you here Lukas... I think that the first two options are most
 likely at this stage. We would pull in the ASL v2.0 licensed JOOQ maven
 artifacts as part of our gora-sql pom.xml, use the open source code and
 leave it down to module users if they wish to pursue a separate agreement
 with DataGeekery.



 Let me know what you think,
 Lukas

 From my point of view this is something which sounds like a positive thing
 to progress with but as I said above my time and commitments are elsewhere
 right now. I am not in a position to begin writing the SQL module back in
 to Gora right now :(


Patch available flag in JIRA or ping for review on patches

2013-12-03 Thread Henry Saputra
Hi Gora contributors,

If you have a patch for Jira tickets please set the patch available flag or
ping the dev@ list if there is no response for some time (like 72 hours).

Sometimes people are busy and get tardy a bit.
Thank you again for your awesome support and contributions to Apache Gora.

- Henry


Re: [ANNOUNCE] Dublin NoSQL Meetup – Apache Gora and the Oracle NoSQL database

2014-01-01 Thread Henry Saputra
Hey Lewis, how was the presentation going?

On Thursday, December 12, 2013, Lewis John Mcgibbney wrote:

 Hi Folks,
 A quick post here to promote an event Apostolos Giannakidis (Apache Gora's
 GSoC student this year) and myself with be speaking at in Dublin this
 coming Monday.
 Event info and registration can be found below

 http://tcubedublin.com/events/dublin-nosql-meetup-apache-gora-and-the-oracle-nosql-database-customize/
 Thanks and if you are able to attend... see you there.
 Best

 --
 *Lewis*



Re: Slides about ApacheCon 2014

2014-04-15 Thread Henry Saputra
Wow! Thanks for sharing the slides, guys

On Mon, Apr 14, 2014 at 7:07 AM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Thanks v much Renato they are great slides and it was an excellent
 turnout... MUCH better than mine :)
 Please see
 https://twitter.com/hectorMcSpector/status/453654190938537984/photo/1
 I also updated the Gora site with both our slides so check them out if you
 get the time folks.

 FYI folks, we will be holding ApacheCon Europe 2014 in Budapest, Hungary,
 November 17th - 21st, 2014. It would be REALLY excellent to have the project
 represented there again this year.
 All for now

 On Mon, Apr 14, 2014 at 12:22 PM, user-digest-h...@gora.apache.org wrote:



 From: Renato Marroquín Mogrovejo renatoj.marroq...@gmail.com
 To: Gora Dev d...@gora.apache.org, Gora User user@gora.apache.org
 Cc:
 Date: Mon, 14 Apr 2014 13:22:13 +0200
 Subject: Slides about ApacheCon 2014

 Hi everyone,

 Here are the slides I presented @ApacheCon 2014.
 It was actually a good talk with quite a few people interested in the
 project (around 30 people). This was really interesting as there were people
 from different projects interested on seeing how we could integrate Gora
 with other projects or simply about the possibility of using Gora.
 There are some specific issues that we should discuss to make Gora better
 and a successful project, but I will write another email for that :)
 Enjoy.






Re: Slides about ApacheCon 2014

2014-04-15 Thread Henry Saputra
Renato, just small comment that you should probably use PMC instead of
PPMC since Gora is now top level ASF project =)

- Henry

On Mon, Apr 14, 2014 at 4:22 AM, Renato Marroquín Mogrovejo
renatoj.marroq...@gmail.com wrote:
 Hi everyone,

 Here are the slides I presented @ApacheCon 2014.
 It was actually a good talk with quite a few people interested in the
 project (around 30 people). This was really interesting as there were
 people from different projects interested on seeing how we could integrate
 Gora with other projects or simply about the possibility of using Gora.
 There are some specific issues that we should discuss to make Gora better
 and a successful project, but I will write another email for that :)
 Enjoy.


 Renato M.

 [1] http://www.slideshare.net/RenatoJavierMarroqun/giraphgora-in-apachecon14


Re: [VOTE] Release Apache Gora 0.4

2014-04-16 Thread Henry Saputra
Hi Lewis,

Thanks so much for driving the release.

Looks like the signatures are generated also for the hash files. I am seeing

gora-cassandra-0.4-sources.jar.asc.asc
gora-cassandra-0.4-sources.jar.asc.asc.md5
gora-cassandra-0.4-sources.jar.asc.asc.sha1

and also somehow the signature file has accompanying hash files:

gora-cassandra-0.4-sources.jar.asc.md5
gora-cassandra-0.4-sources.jar.asc.sha1

Extra files generated I suppose?

- Henry


On Wed, Apr 16, 2014 at 1:12 PM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Folks,

 I am really glad to be opening a VOTE to draw a line under the 0.4
 development drive and to release the 0.4 artifacts.
 We solved a whole bunch of issues: http://s.apache.org/4lx

 SVN source tag (r1588038):
 https://svn.apache.org/repos/asf/gora/tags/apache-gora-0.4/

 Staging repo:
 https://repository.apache.org/content/repositories/orgapachegora-1000/

 PGP release keys (signed using 48BAEBF6):
 https://svn.apache.org/repos/asf/gora/trunk/KEYS

 Release Artifacts: https://dist.apache.org/repos/dist/dev/gora/

 A note on VOTE'ing.

1. The code you are VOTE'ing to release is the SVN source tag r1588038.
2. The staging repos publishes the container artifacts are consumable
Maven artifacts on Maven central.
3. Finally, the release artifacts are hosted on the Apache distribution
server and cloned by mirror servers.

 Vote will be open for 72 hours. Again I would like to thank everyone that
 contributed to Gora. here's to plenty more releases.
 Best
 Lewis

 [ ] +1, let's get it released!!!
 [ ] +/-0, fine, but consider to fix few issues before...
 [ ] -1, nope, because... (and please explain why)

 --
 *Lewis*


Re: [VOTE] Release Apache Gora 0.4

2014-04-16 Thread Henry Saputra
Ah ok, I will just verify the signatures for main source files

Thanks for the explanation, Lewis


- Henry

On Wed, Apr 16, 2014 at 2:22 PM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Henry,
 These signatures are auto generated by maven sign plugin and I am not too
 keen on them being present in the staging repos.
 There is however, very little I can do about them as they are pushed to the
 staging repos and I cannot intervene in that process.
 hth
 Lewis


 On Wed, Apr 16, 2014 at 10:18 PM, Henry Saputra henry.sapu...@gmail.com
 wrote:

 Hi Lewis,

 Thanks so much for driving the release.

 Looks like the signatures are generated also for the hash files. I am
 seeing

 gora-cassandra-0.4-sources.jar.asc.asc
 gora-cassandra-0.4-sources.jar.asc.asc.md5
 gora-cassandra-0.4-sources.jar.asc.asc.sha1

 and also somehow the signature file has accompanying hash files:

 gora-cassandra-0.4-sources.jar.asc.md5
 gora-cassandra-0.4-sources.jar.asc.sha1

 Extra files generated I suppose?

 - Henry


 On Wed, Apr 16, 2014 at 1:12 PM, Lewis John Mcgibbney
 lewis.mcgibb...@gmail.com wrote:
  Hi Folks,
 
  I am really glad to be opening a VOTE to draw a line under the 0.4
  development drive and to release the 0.4 artifacts.
  We solved a whole bunch of issues: http://s.apache.org/4lx
 
  SVN source tag (r1588038):
  https://svn.apache.org/repos/asf/gora/tags/apache-gora-0.4/
 
  Staging repo:
  https://repository.apache.org/content/repositories/orgapachegora-1000/
 
  PGP release keys (signed using 48BAEBF6):
  https://svn.apache.org/repos/asf/gora/trunk/KEYS
 
  Release Artifacts: https://dist.apache.org/repos/dist/dev/gora/
 
  A note on VOTE'ing.
 
 1. The code you are VOTE'ing to release is the SVN source tag
  r1588038.
 2. The staging repos publishes the container artifacts are consumable
 Maven artifacts on Maven central.
 3. Finally, the release artifacts are hosted on the Apache
  distribution
 server and cloned by mirror servers.
 
  Vote will be open for 72 hours. Again I would like to thank everyone
  that
  contributed to Gora. here's to plenty more releases.
  Best
  Lewis
 
  [ ] +1, let's get it released!!!
  [ ] +/-0, fine, but consider to fix few issues before...
  [ ] -1, nope, because... (and please explain why)
 
  --
  *Lewis*




 --
 Lewis


Re: [VOTE] Release Apache Gora 0.4

2014-04-19 Thread Henry Saputra
Just realized Lewis already put the artifacts in the staging [1] directory

Sorry guys for the confusion, mental lapse from my side, will validate
the release this weekend.

- Henry

[1] https://dist.apache.org/repos/dist/dev/gora/


On Fri, Apr 18, 2014 at 8:12 AM, Henry Saputra henry.sapu...@gmail.com wrote:
 Thanks for the clarifications, Lewis.

 Just to confirm, just like our prev releases we will just publish [1]
 and [2] to the ASF dist folders? I am planning to verify sig and hash
 for those files and build/ run the binary from [1].

 - Henry

 On Thu, Apr 17, 2014 at 4:24 AM, Lewis John Mcgibbney
 lewis.mcgibb...@gmail.com wrote:
 Hi Henry,


 On Thu, Apr 17, 2014 at 4:48 AM, Henry Saputra henry.sapu...@gmail.com
 wrote:

 Lewis,

 What is the diff between gora-0.4-source-release.zip and
 gora-0.4-src.tar.gz files?


 So just to clarify what Henry is asking here, we have a number of artifacts
 within the staging repos which we will be released as Maven artifacts.
 Namely gora-0.4-source-release.zip [0], gora-0.4-src.tar.gz [1],
 gora-0.4-src.zip [2] and individual module -javadoc.jar, -sources.jar,
 src.tar.gz, src.zip, tests.jar, .jar and finally .pom (along with
 corresponding signatures for each file.)

 The difference between [0] [1] and [2] is trivial but important to note
 none-the-less.

 [0] is an aggregation of the sources for all modules bundled into an
 aggregate .zip project. Effectively this is what is a bundle of all sources
 included in the .sources.jar Maven artifacts for each module. Such artifacts
 allows us to link and assimilate project sources against code in order to
 see 3rd party/dependency source within Eclipse or another IDE. This can be
 done by linking the sources.jar to the given artifact in the 'project build'
 --  'sources' tab of Eclipse. To be openly honest with you Henry we most
 likely don't need to publish this artifact so maybe we can remove it in
 future builds.
 Both [1] and [2] are the actual release artifacts which we refer to (along
 with their relevant signatures) on our Downloads page. We publish these on
 Maven central for convenience. These are the only two artifacts that we
 actually host on the Apache Distribution server and mirror to 3rd party
 Mirror Servers.

 For other who wish to review the candidate, here is a really helpful link to
 incubator guidelines
 http://incubator.apache.org/guides/releasemanagement.html#check-list


 Thanks, I hope that this has cleared up a bit of the release artifact puzzle
 ;)
 Lewis

 [0]
 https://repository.apache.org/content/repositories/orgapachegora-1000/org/apache/gora/gora/0.4/gora-0.4-source-release.zip
 [1]
 https://repository.apache.org/content/repositories/orgapachegora-1000/org/apache/gora/gora/0.4/gora-0.4-src.tar.gz
 [2]
 https://repository.apache.org/content/repositories/orgapachegora-1000/org/apache/gora/gora/0.4/gora-0.4-src.zip


Re: Gora on Black Duck Softwares New Blog

2014-05-06 Thread Henry Saputra
Great job Lewis, I like the blog.

Will definitely share the blog to the world :)

On Tuesday, May 6, 2014, Lewis John Mcgibbney lewis.mcgibb...@gmail.com
wrote:

 Hi Folks,
 We were successful in publicising Gora and a bit about our community on
 Black Duck Softwares new Open Source Delivers blog :)

 http://osdelivers.blackducksoftware.com/2014/05/06/what-does-it-really-take-to-build-a-community-around-code/
 If your able please send this out to social media...
 We are the first Apache project to write a guest post here so this is a
 pretty big deal.
 Thanks
 Lewis



Re: PersistentDatum{Reader,Writer} ?

2014-05-20 Thread Henry Saputra
Cross post this to dev@ list since some people involve directly in
GORA_94 would be able to help.

- Henry

On Tue, May 20, 2014 at 6:38 AM, Damien Raude-Morvan
draz...@drazzib.com wrote:
 Hi all,

 It seems that PersistentDatumReader and PersistentDatumReaderWriter are now
 gone in master branch. As I've find in git log, they have been dropped in
 GORA-321 (GORA_94 branch merge to master).

 Those classes were really useful to me to serialize Gora persistent objects
 with all their dirty information (for instance to send them over the wire on
 AMQP).

 Is there issues with those classes and Avro 1.7.x ? Is there a way to
 restore this feature ?

 Regards,
 --
 Damien


Re: PersistentDatum{Reader,Writer} ?

2014-05-20 Thread Henry Saputra
Sorry I had a typo with your name in my last reply, I meant Damien

- Henry

On Tue, May 20, 2014 at 8:29 AM, Henry Saputra henry.sapu...@gmail.com wrote:
 Hi Damine,

 I believe the dirty bit info should be used only internally by Gora to
 manage the data, is there particular reason you need to include it to
 the wire to external component?

 - Henry

 On Tue, May 20, 2014 at 6:38 AM, Damien Raude-Morvan
 draz...@drazzib.com wrote:
 Hi all,

 It seems that PersistentDatumReader and PersistentDatumReaderWriter are now
 gone in master branch. As I've find in git log, they have been dropped in
 GORA-321 (GORA_94 branch merge to master).

 Those classes were really useful to me to serialize Gora persistent objects
 with all their dirty information (for instance to send them over the wire on
 AMQP).

 Is there issues with those classes and Avro 1.7.x ? Is there a way to
 restore this feature ?

 Regards,
 --
 Damien


Re: PersistentDatum{Reader,Writer} ?

2014-05-20 Thread Henry Saputra
Hi Damine,

I believe the dirty bit info should be used only internally by Gora to
manage the data, is there particular reason you need to include it to
the wire to external component?

- Henry

On Tue, May 20, 2014 at 6:38 AM, Damien Raude-Morvan
draz...@drazzib.com wrote:
 Hi all,

 It seems that PersistentDatumReader and PersistentDatumReaderWriter are now
 gone in master branch. As I've find in git log, they have been dropped in
 GORA-321 (GORA_94 branch merge to master).

 Those classes were really useful to me to serialize Gora persistent objects
 with all their dirty information (for instance to send them over the wire on
 AMQP).

 Is there issues with those classes and Avro 1.7.x ? Is there a way to
 restore this feature ?

 Regards,
 --
 Damien


Re: Apache Camel now using Gora

2014-05-29 Thread Henry Saputra
Awesome job Ioannis!

- Henry

On Thu, May 29, 2014 at 2:46 PM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Folks,

 Since the long time that Ioannis contacted us and proposed his Camel-Gora
 work they have been working on it and recently announced that Camel trunk
 now has a Gora components [0].
 I am going to update the Gora site with this news as it is pretty huge...
 from what I understand Camel is an extremely busy community with a large
 codebase... Gora integration is excellent.
 Thank you Ioannis for your hard work and congratulations.
 Lewis

 [0] https://issues.apache.org/jira/browse/CAMEL-4817

 --
 Lewis


Re: Gora mongodb tutorial is not find in here.

2014-06-23 Thread Henry Saputra
HI Feng Lu,

Thanks for reporting it.

Seemed like we are missing the doc page for the mongodb data store.
I know there is a JIRA about adding more info about extra params to
configure mongodb.

But we need to page shim for mongodb data store to avoid 404 error.

If no one beats me to it I will the shim page by EOD today.

- Henry

CCing dev@ list for FYI

- Henry

On Mon, Jun 23, 2014 at 12:52 AM, feng lu amuseme...@gmail.com wrote:
 Hi All

 I can not open ths gora mongodb tutorial. Does this page exist or not?

 http://gora.apache.org/current/gora-mongodb.html

 Thanks.
 --
 Don't Grow Old, Grow Up... :-)


Re: Gora mongodb tutorial is not find in here.

2014-06-23 Thread Henry Saputra
I added bare page for mongodb so at least it wont return 404.

The issue I believe is https://issues.apache.org/jira/browse/GORA-335

- Henry

On Mon, Jun 23, 2014 at 10:56 AM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi folks.  This documentation is not completed. I assigned the JIRA issue to
 Damien (drazzib) for Karma.

 On Jun 23, 2014 10:10 AM, Henry Saputra henry.sapu...@gmail.com wrote:

 HI Feng Lu,

 Thanks for reporting it.

 Seemed like we are missing the doc page for the mongodb data store.
 I know there is a JIRA about adding more info about extra params to
 configure mongodb.

 But we need to page shim for mongodb data store to avoid 404 error.

 If no one beats me to it I will the shim page by EOD today.

 - Henry

 CCing dev@ list for FYI

 - Henry

 On Mon, Jun 23, 2014 at 12:52 AM, feng lu amuseme...@gmail.com wrote:
  Hi All
 
  I can not open ths gora mongodb tutorial. Does this page exist or not?
 
  http://gora.apache.org/current/gora-mongodb.html
 
  Thanks.
  --
  Don't Grow Old, Grow Up... :-)


Re: Gora Support for MongoDB

2014-07-25 Thread Henry Saputra
I believe the MongoDB support is available in next release 0.5.

But you can build from source to try it out.

As for doc to use MongoDB store, unfortunately it is still pending and
being tracked by this JIRA [1]

CC dev@ list for FYI

- Henry

[1] https://issues.apache.org/jira/browse/GORA-335

On Thu, Jul 24, 2014 at 8:16 PM, Muhamad Muchlis tru3@gmail.com wrote:
 Hi All,

 Since 04 june 2014, Gora Support for mongodb. So What release version gora
 have to download ?

 Is there anything that can be shared to me  tutorial Gora and MongoDB?

 Advice me please.


 as the information: I'm trying to integrate Nutch with MongoDB


 Best Regard's


 M.Muchlis


Re: Info on GORA-346 (Nutch 2 on CDH5, Hadoop 2)

2014-09-06 Thread Henry Saputra
Cross-posting this to dev@ list since it may interest @lewis and @renato.

I have reviewed the patch and seemed like Lewis is working on merging
the fix soon [1]

Thanks for nudging this Eddy

- Henry

[1] https://issues.apache.org/jira/browse/GORA-346

On Thu, Sep 4, 2014 at 3:58 AM, Edoardo Causarano
edoardo.causar...@gmail.com wrote:
 Hi All,

 I'm building a Nutch 2 installation on top of CDH5 and need to rebuild the 
 tool and Gora against Hadoop 2. I found some patches in GORA-346 and tried 
 applying them to master and gora 0.4 but get some patch failures.

 Any indication of when will CORA-346 be merged into master or if there's 
 already a branch I can checkout?


 TIA,
 Eddy


Re: [VOTE] Apache Gora 0.5 Release Candidate #1

2014-09-17 Thread Henry Saputra
Sig files look good
Hash files look good
NOTICE, LICENSE files are good
Source artifacts compiled and tests were passed.
Tutorial looks good

+1

Thx for managing the release, Lewis

- Henry

On Tue, Sep 16, 2014 at 8:00 AM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Folks,
 I am very happy to get a VOTE out for Apache Gora 0.5 Release Candidate.

 We solved 44 issues: http://s.apache.org/0.5report

 Git source tag (c2d58dd1440b4e2c66c1f40a4b6d4169d79bb6d3):
 http://s.apache.org/Eyv

 Staging repo:
 https://repository.apache.org/content/repositories/orgapachegora-1001

 Source Release Artifacts: https://dist.apache.org/repos/dist/dev/gora/0.5/

 PGP release keys (signed using 48BAEBF6): http://gora.apache.org/dist/KEYS

 Vote will be open for 72 hours.
 Thank you to everyone that is able to VOTE as well as everyone that
 contributed to Apache Gora 0.5.

 [ ] +1, let's get it released!!!
 [ ] +/-0, fine, but consider to fix few issues before...
 [ ] -1, nope, because... (and please explain why)

 --
 Lewis


Re: [ANNOUNCEMENT] New PMC Memebr and Committer Talat Uyarer

2015-01-28 Thread Henry Saputra
Congrats Talat!

On Mon, Jan 26, 2015 at 10:45 AM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Everyone,
 The Gora PMC are very pleased to announce Talat as a new PMC member and
 committer. This comes after a number of contributions from Talat to the Gora
 project.
 @Talat, please feel free to introduce yourself further.
 Thanks, have a great day folks.
 Lewis


 --
 Lewis


Re: [VOTE] Apache Gora 0.6 Release Candidate

2015-02-13 Thread Henry Saputra
Signature file looks good
Hash file looks good
NOTICE.txt file looks good
LICENSE.txt file looks good
Source compiled
Tests passed
No 3rd party

+1

- Henry

On Thu, Feb 12, 2015 at 11:34 PM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Folks,
 I am very happy to get a VOTE out for Apache Gora 0.6 Release Candidate.

 We solved 17 issues: http://s.apache.org/gora-0.6

 Git source tag (68ce474b6813e9e5e8b021e3d255f90ddd7a3eb3):
 http://s.apache.org/gora0.6tag

 Staging repo:
 https://repository.apache.org/content/repositories/orgapachegora-1002

 Source Release Artifacts: https://dist.apache.org/repos/dist/dev/gora/0.6/

 PGP release keys (signed using 48BAEBF6): http://gora.apache.org/dist/KEYS

 Vote will be open for 72 hours.
 Thank you to everyone that is able to VOTE as well as everyone that
 contributed to Apache Gora 0.6.

 [ ] +1, let's get it released!!!
 [ ] +/-0, fine, but consider to fix few issues before...
 [ ] -1, nope, because... (and please explain why)

 p.s. Here's my +1


 --
 Lewis


Re: Apply request for Gsoc

2015-03-10 Thread Henry Saputra
Thanks for the helping out fellow GSoC, Furkan.

Looking forward seeing your ideas and work with Apache Gora.

- Henry

On Tue, Mar 10, 2015 at 7:11 AM, Furkan KAMACI furkankam...@gmail.com wrote:
 Hi Nurettin,

 As a GSoC 2014 student for another Apache project and as an engineer who
 wants to contribute to Apache Gora project and wants to apply for GSoC 2015
 for Apache Gora let me explain you something.

 First of all, I suggest you check the issues listed at here:
 https://issues.apache.org/jira/browse/GORA

 You can filter unresolved of them and you can check whether there is an
 issue which has a label indicates that it is a GSoC issue (such kind of
 labels are not mandatory for GSoC issues but some other Apache projects use
 that convention).

 On the other hand you can suggest a feature for Apache Gora as a GSoC task
 but it seems that this is not your case.

 When you find an issue it becomes easy to apply for GSoC. You can start to
 discuss about it and get some ideas before you prepare a suggestion for your
 GSoC roadmap.

 Last choice is you can kindly ask for Jira tickets which are available and
 suitable for GSoC. You can also communicate with commiters to discuss about
 the issues.

 When idea part becomes clear in your mind you can apply to GSoC from Google
 Melange's website:
 https://www.google-melange.com/gsoc/homepage/google/gsoc2015

 Apache user mail lists are for asking questions, sharing knowledge, and
 discussing issues (you are welcome to discuss about GSoC). Google Melange is
 the website for applying GSoC.

 Kind Regards,
 Furkan KAMACI



 On Tue, Mar 10, 2015 at 1:08 PM, Nurettin Bakkal nurettinbak...@gmail.com
 wrote:

 Hi,

 I want to apply to GSOC.

 Kind Regards,
 Nurettin BAKKAL




Re: Gora Spark Backend Support (GORA-386) and Apache Crunch

2015-03-24 Thread Henry Saputra
HI Furkan,

Yes, you are right. In the code execution for Spark or Flink, Gora
should be part of the data ingest and storing.

So, is  the idea is to make data store in Spark to access Gora instead
of default store options?

- Henry

On Mon, Mar 23, 2015 at 11:34 AM, Furkan KAMACI furkankam...@gmail.com wrote:
 Hi Henry,

 So, as far as I understand instead of wrapping Apache Spark within Gora with
 full functionality, I have to wrap its functionality of storing and
 accessing data. I mean one will use Gora input/output format  and at the
 backend it will me mapped to RDD and will able to run Map/Reduce via Apache
 Spark etc. over Gora.

 Kind Regards,
 Furkan KAMACI

 On Mon, Mar 23, 2015 at 8:21 PM, Henry Saputra henry.sapu...@gmail.com
 wrote:

 Integration with Gora will mostly in the data ingest portion of the flow.

 Distributed processing frameworks like Spark, or Flink, already
 support Hadoop input format as data sources so Gora should be able to
 be used directly with Gor input format.

 The interesting portion is probably tighter integration such as custom
 RDD or custom Data Manager to store and get data from Gora directly.

 - Henry

 On Sat, Mar 21, 2015 at 1:42 PM, Lewis John Mcgibbney
 lewis.mcgibb...@gmail.com wrote:
  Henry mentored Crunch through incubation... Maybe he can tell you more
  context.
  For me, Gora is essentially an extremely easy storage abstraction
  framework.
  I do not currently use the Query API meaning that the analysis of data
  is
  delegated to Gora data store.
  This is my current usage of the code base.
 
 
  On Saturday, March 21, 2015, Furkan KAMACI furkankam...@gmail.com
  wrote:
 
  Hi Lewis,
 
  I am talking in context of GORA-418 and GORA-386, we can say GSoC. I've
  talked with Talat about design of that implementation. I just wanted to
  check other projects for does any of them such kind of feature.
 
  Here is what is in my mind for Apache Gora for Spark supoort:
  developing a
  layer which abstracts functionality of Spark, Tez, etc (GORA-418).
  There
  will be implementations for each of them (and Spark will be one of
  them:
  GORA-386)
 
  i.e. you will write a word count example as Gora style, you will use
  one
  of implementation and run it (as like storing data at Solr or Mongo via
  Gora).
 
  When I check Crunch I realize that:
 
  Every Crunch job begins with a Pipeline instance that manages the
  execution lifecycle of your data pipeline. As of the 0.9.0 release,
  there
  are three implementations of the Pipeline interface:
 
  MRPipeline: Executes a pipeline as a series of MapReduce jobs that can
  run
  locally or on a Hadoop cluster.
  MemPipeline: Executes a pipeline in-memory on the client.
  SparkPipeline: Executes a pipeline by running a series of Apache Spark
  jobs, either locally or on a Hadoop cluster.
 
  So, I am curious about that supporting Crunch may help us what we want
  with Spark support at Gora? Actually, I am new to such projects, I want
  to
  learn what should be achieved with GORA-386 and not to be get lost
  because
  of overthinking :) I see that you can use Gora for storing your data
  with
  Gora-style, running jobs with Gora-style but have a flexibility of
  using
  either HDFS, Solr, MongoDB, etc. or MaprReduce, Spark, Tez, etc.
 
  PS: I know there is a similar issue at Apache Gora for Cascading
  support:
  https://issues.apache.org/jira/browse/GORA-112
 
  Kind Regards,
  Furkan KAMACI
 
  On Sat, Mar 21, 2015 at 8:14 PM, Lewis John Mcgibbney
  lewis.mcgibb...@gmail.com wrote:
 
  Hi Furkan,
  In what context are we talking here?
  GSoC or Just development?
  I am very keen to essentially work towards what we can release as Gora
  1.0
  Thank you Furkan
 
 
  On Saturday, March 21, 2015, Furkan KAMACI furkankam...@gmail.com
  wrote:
 
  As you know that there is an issue for integration Apache Spark and
  Apache Gora [1]. Apache Spark is a popular project and in contrast to
  Hadoop's two-stage disk-based MapReduce paradigm, Spark's in-memory
  primitives provide performance up to 100 times faster for certain
  applications [2]. There are also some alternatives to Apache Spark,
  i.e.
  Apache Tez [3].
 
  When implementing an integration for Spark, it should be considered
  to
  have an abstraction for such kind of projects as an architectural
  design and
  there is a related issue for it: [4].
 
  There is another Apache project which aims to provide a framework
  named
  as Apache Crunch [5] for writing, testing, and running MapReduce
  pipelines.
  Its goal is to make pipelines that are composed of many user-defined
  functions simple to write, easy to test, and efficient to run. It is
  an
  high-level tool for writing data pipelines, as opposed to developing
  against
  the MapReduce, Spark, Tez APIs or etc. directly [6].
 
  I would like to learn how Apache Crunch fits with creating a multi
  execution engine for Gora [4]? What kind of benefits we can get with
  integrating Apache Gora and Apache

ApacheCon 2015 is coming to Austin, Texas, USA

2015-03-24 Thread Henry Saputra
Dear Apache Gora enthusiast,

In just a few weeks, we'll be holding ApacheCon in Austin, Texas, and
we'd love to have you in attendance. You can save $300 on admission by
registering NOW, since the early bird price ends on the 21st.

Register at http://s.apache.org/acna2015-reg

ApacheCon this year celebrates the 20th birthday of the Apache HTTP
Server, and we'll have Brian Behlendorf, who started this whole thing,
keynoting for us, and you'll have a chance to meet some of the
original Apache Group, who will be there to celebrate with us.

We've got 7 tracks of great talks, as well as BOFs, the Apache
BarCamp, project-specific hack events, and evening events where you
can deepen your connection with the larger Apache community. See the
full schedule at http://apacheconna2015.sched.org/

And if you have any questions, comments, or just want to hang out with
us before and during the event, follow us on Twitter - @apachecon - or
drop by #apachecon on the Freenode IRC network.

Hope to see you in Austin!

- Henry


Re: Gora Spark Backend Support (GORA-386) and Apache Crunch

2015-03-23 Thread Henry Saputra
Integration with Gora will mostly in the data ingest portion of the flow.

Distributed processing frameworks like Spark, or Flink, already
support Hadoop input format as data sources so Gora should be able to
be used directly with Gor input format.

The interesting portion is probably tighter integration such as custom
RDD or custom Data Manager to store and get data from Gora directly.

- Henry

On Sat, Mar 21, 2015 at 1:42 PM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Henry mentored Crunch through incubation... Maybe he can tell you more
 context.
 For me, Gora is essentially an extremely easy storage abstraction framework.
 I do not currently use the Query API meaning that the analysis of data is
 delegated to Gora data store.
 This is my current usage of the code base.


 On Saturday, March 21, 2015, Furkan KAMACI furkankam...@gmail.com wrote:

 Hi Lewis,

 I am talking in context of GORA-418 and GORA-386, we can say GSoC. I've
 talked with Talat about design of that implementation. I just wanted to
 check other projects for does any of them such kind of feature.

 Here is what is in my mind for Apache Gora for Spark supoort: developing a
 layer which abstracts functionality of Spark, Tez, etc (GORA-418). There
 will be implementations for each of them (and Spark will be one of them:
 GORA-386)

 i.e. you will write a word count example as Gora style, you will use one
 of implementation and run it (as like storing data at Solr or Mongo via
 Gora).

 When I check Crunch I realize that:

 Every Crunch job begins with a Pipeline instance that manages the
 execution lifecycle of your data pipeline. As of the 0.9.0 release, there
 are three implementations of the Pipeline interface:

 MRPipeline: Executes a pipeline as a series of MapReduce jobs that can run
 locally or on a Hadoop cluster.
 MemPipeline: Executes a pipeline in-memory on the client.
 SparkPipeline: Executes a pipeline by running a series of Apache Spark
 jobs, either locally or on a Hadoop cluster.

 So, I am curious about that supporting Crunch may help us what we want
 with Spark support at Gora? Actually, I am new to such projects, I want to
 learn what should be achieved with GORA-386 and not to be get lost because
 of overthinking :) I see that you can use Gora for storing your data with
 Gora-style, running jobs with Gora-style but have a flexibility of using
 either HDFS, Solr, MongoDB, etc. or MaprReduce, Spark, Tez, etc.

 PS: I know there is a similar issue at Apache Gora for Cascading support:
 https://issues.apache.org/jira/browse/GORA-112

 Kind Regards,
 Furkan KAMACI

 On Sat, Mar 21, 2015 at 8:14 PM, Lewis John Mcgibbney
 lewis.mcgibb...@gmail.com wrote:

 Hi Furkan,
 In what context are we talking here?
 GSoC or Just development?
 I am very keen to essentially work towards what we can release as Gora
 1.0
 Thank you Furkan


 On Saturday, March 21, 2015, Furkan KAMACI furkankam...@gmail.com
 wrote:

 As you know that there is an issue for integration Apache Spark and
 Apache Gora [1]. Apache Spark is a popular project and in contrast to
 Hadoop's two-stage disk-based MapReduce paradigm, Spark's in-memory
 primitives provide performance up to 100 times faster for certain
 applications [2]. There are also some alternatives to Apache Spark, i.e.
 Apache Tez [3].

 When implementing an integration for Spark, it should be considered to
 have an abstraction for such kind of projects as an architectural design 
 and
 there is a related issue for it: [4].

 There is another Apache project which aims to provide a framework named
 as Apache Crunch [5] for writing, testing, and running MapReduce pipelines.
 Its goal is to make pipelines that are composed of many user-defined
 functions simple to write, easy to test, and efficient to run. It is an
 high-level tool for writing data pipelines, as opposed to developing 
 against
 the MapReduce, Spark, Tez APIs or etc. directly [6].

 I would like to learn how Apache Crunch fits with creating a multi
 execution engine for Gora [4]? What kind of benefits we can get with
 integrating Apache Gora and Apache Crunch and what kind of gaps we still 
 can
 have instead of developing a custom engine for our purpose?

 Kind Regards,
 Furkan KAMACI

 [1] https://issues.apache.org/jira/browse/GORA-386
 [2] Xin, Reynold; Rosen, Josh; Zaharia, Matei; Franklin, Michael;
 Shenker, Scott; Stoica, Ion (June 2013).
 [3] http://tez.apache.org/
 [4] https://issues.apache.org/jira/browse/GORA-418
 [5] https://crunch.apache.org/
 [6] https://crunch.apache.org/user-guide.html#motivation



 --
 Lewis




 --
 Lewis



Re: Spark Backend for Apache Gora

2015-09-04 Thread Henry Saputra
Congrats Furkan, Apahe Gora is lucky to have you as contributors :)

On Friday, September 4, 2015, Furkan KAMACI  wrote:

> Hi All,
>
> Btw, my blog post for "Spark Backend for Apache Gora" is published at
> both DZone (https://dzone.com/articles/spark-backend-for-apache-gora
> )
> and Java Code Geeks: (
> http://www.javacodegeeks.com/2015/09/spark-backend-for-apache-gora.html
> ).
> I hope it'll let more people to introduce with Apache Gora.
>
> Kind Regards,
> Furkan KAMACI
>
> On Wed, Sep 2, 2015 at 12:06 AM, Furkan KAMACI  > wrote:
>
>> Hi All,
>>
>> I've also prepared an extra report named as *Final Report *and attached
>> it to Gora wiki:
>> https://cwiki.apache.org/confluence/display/GORA/Report+9
>>
>> Kind Regards,
>> Furkan KAMACI
>>
>> On Mon, Aug 31, 2015 at 2:28 AM, Lewis John Mcgibbney <
>> lewis.mcgibb...@gmail.com
>> > wrote:
>>
>>> Brilliant Furkan absolutely brilliant. We are nearly ready to merge your
>>> patch and then we can begin proper benchmarking .
>>> Congratulations
>>>
>>>
>>> On Sunday, August 30, 2015, Furkan KAMACI >> > wrote:
>>>
 Hi All,

 It's been announced that I've successfully completed my GSoC 2015
 journey :) First of all, I would like to thank my mentors (Talat and Lewis)
 and whole Gora community.

 During my GSoC period, I've implemented a Spark Backend for Apache
 Gora. Gora is a powerful project and I think that it will be much more
 powerful with its Spark backend.

 I've written a blog post which explains in a detailed manner how to use
 it: http://furkankamaci.com/spark-backend-for-apache-gora/

 I'm planning to write a blog post for performance analysis of
 MapReduce, Spark, MapReduce with Gora and Spark with Gora. Also, another
 blog post for people who wants to participate at GSoC next year.

 I'll keep up contributing to Gora and happy to work with its pretty
 nice community!

 Kind Regards,
 Furkan KAMACI

>>>
>>>
>>> --
>>> *Lewis*
>>>
>>>
>>
>