Re: [VOTE] Release apache-singa-incubating-2.0.0 (RC1)

2019-04-19 Thread Thejas Nair
+1 binding


On Wed, Apr 17, 2019 at 8:24 PM Justin Mclean 
wrote:

> Hi,
>
> +1 (binding) See other email for what I reviewed.
>
> Thanks,
> Justin
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: Podlings not following ASF release policy

2019-02-19 Thread Thejas Nair
Quoting from the document -
Unique Enough Names

The name needs be unique enough to avoid confusion with software that
already exists. For the community to be able to protect its reputation for
quality and openness, its name needs to unique enough to have potential as
a trademark.

But this isn’t only about being able to register trademark protection.
Ethics also plays a role. Even where a name may offer enough protection,
existing adoption of the name by an active community may mean that the
choice needs to be eliminated on ethical grounds. There is some judgment
involved in this decision. So, involve the wider Incubator community if a
name is already used.


On Tue, Feb 19, 2019 at 6:47 AM Moaz Reyad  wrote:

> Hi Justin,
>
> The download page of SINGA is (
> https://singa.incubator.apache.org/en/downloads.html ). The page that you
> mentioned is the index page. The official ASF releases are in the downloads
> page.
>
> The index page is the landing page of the website and it contains all the
> different ways to get started with SINGA. The index page encourages new
> users to start using the system on pre-installed environment such as AWS or
> Docker.
>
> We will create a ticket to move the docker image under Apache and add
> missing files to it.
>
> Thank you,
> Moaz
>
> On Tue, Feb 19, 2019 at 9:03 AM Justin Mclean 
> wrote:
>
> > Hi,
> >
> > > Can you please elaborate on release policy issues with Singa ?
> > > I checked a few things but couldn't find the issue.
> >
> >
> > Nothing too major. If you look at the download page [1] it’s encouraging
> > people to use docker [2]  and AWS [3]. Both pages probably has some minor
> > branding/trademark issues but I’m more concerned about the docker one as
> > it’s not under the apache docker username and the download page points
> > directly to it without any indication it’s 3rd party.  It’s seems to be
> > encouraging people to use the develop /latest version rather the the
> > offical releases, although I’m not a 100% sure, I  had a quick look at
> the
> > docker content but it wasn’t clear and it also seemed to be missing
> > important files like LICENSE, NOTICE etc etc
> >
> > Thanks,
> > Justin
> >
> > 1. https://singa.incubator.apache.org/en/index.html
> > 2. https://hub.docker.com/r/nusdbsystem/singa/
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>


Re: Podlings not following ASF release policy

2019-02-18 Thread Thejas Nair
Hi Justin,
Can you please elaborate on release policy issues with Singa ?
I checked a few things but couldn't find the issue.

Thanks,
Thejas





On Fri, Feb 8, 2019 at 5:51 PM Justin Mclean 
wrote:

> Hi,
>
> So I just checked the next bunch of podlings to report to see if we have
> any other issues with ASF releases policy and sadly we do and again it
> larger than I expected. Some of these are minor issues and easily fixed,
> some are not. In a couple of cases I may not of looked deeply enough into
> the issue and it may actually be fine, if so apologies in advance for
> listing you here. In a couple of cases (e.g. Zipkin) I can see it’s been
> discussed on the list but there’s more to do.
>
> Podlings having one or more issues with ASF release policy include:
> - Crail
> - Daffodil
> - Dlab
> - Druid
> - Dubbo
> - Hivemall
> - Marvin-ai
> - Memo
> - Omid
> - Openwhisk
> - Pinot
> - Ponymail
> - Singa
> - Skywalking
> - Zipkin
>
> Projects that I will be following up with include Dlab, Druid, Dubbo,
> Openwhisk, Singa, Skywalking and Zipkin.
>
> If you are the mentors of these projects please take a look and see what
> you can do to improve the situation and educate your podling on proper
> release policy. If you can’t find the issue ping me and I’ll send an email
> with what I think it is to your private list. There is probably things I’ve
> missed as well, often were there’s one issue there’s others.
>
> Please include something in the next podling report on this.
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Graduate Apache HAWQ (incubating)

2018-07-31 Thread Thejas Nair
+1

On Tue, Jul 31, 2018 at 11:28 AM, Ed Espino  wrote:
> +1 Woohoo!!
>
> -=e
>
> On Fri, Jul 27, 2018 at 7:13 PM Roman Shaposhnik  wrote:
>
>> Hi!
>>
>> after a very positive discussion in the HAWQ community
>> and at the IPMC level:
>>
>> https://lists.apache.org/thread.html/67a2d52ef29cbf9e93d8050ed0193cc110a919962dd92f8436b343b7@%3Cdev.hawq.apache.org%3E
>>
>> https://lists.apache.org/thread.html/3a142d758ef5ae119e421071893615992ea5ee937b5d02007f5e@%3Cgeneral.incubator.apache.org%3E
>>
>> I'd like to bring the following resolution for a formal vote.
>>
>> Please vote on the resolution pasted below to graduate
>> Apache HAWQ from the incubator to top level project.
>>
>> [ ] +1 Graduate Apache HAWQ from the Incubator.
>> [ ] +0 Don't care.
>> [ ] -1 Don't graduate Apache HAWQ from the Incubator because...
>>
>> This vote will be open for at least 72 hours.
>>
>> Many thanks to our mentors and everyone else for the support,
>> Roman (on behalf of the Apache HAWQ PPMC).
>>
>> ## Resolution to create a TLP from graduating Incubator podling
>>
>> X. Establish the Apache HAWQ Project
>>
>>WHEREAS, the Board of Directors deems it to be in the best
>>interests of the Foundation and consistent with the
>>Foundation's purpose to establish a Project Management
>>Committee charged with the creation and maintenance of
>>open-source software, for distribution at no charge to
>>the public, related to Hadoop native SQL query engine that
>>combines the key technological advantages of MPP database
>>with the scalability and convenience of Hadoop.
>>
>>NOW, THEREFORE, BE IT RESOLVED, that a Project Management
>>Committee (PMC), to be known as the "Apache HAWQ Project",
>>be and hereby is established pursuant to Bylaws of the
>>Foundation; and be it further
>>
>>RESOLVED, that the Apache HAWQ Project be and hereby is
>>responsible for the creation and maintenance of software
>>related to Hadoop native SQL query engine that
>>combines the key technological advantages of MPP database
>>with the scalability and convenience of Hadoop;
>>and be it further
>>
>>RESOLVED, that the office of "Vice President, Apache HAWQ" be
>>and hereby is created, the person holding such office to
>>serve at the direction of the Board of Directors as the chair
>>of the Apache HAWQ Project, and to have primary responsibility
>>for management of the projects within the scope of
>>responsibility of the Apache HAWQ Project; and be it further
>>
>>RESOLVED, that the persons listed immediately below be and
>>hereby are appointed to serve as the initial members of the
>>Apache HAWQ Project:
>>
>> * Alan Gates   
>> * Alexander Denissov   
>> * Amy Bai  
>> * Atri Sharma  
>> * Bhuvnesh Chaudhary   
>> * Bosco
>> * Chunling Wang
>> * David Yozie  
>> * Ed Espino
>> * Entong Shen  
>> * Foyzur Rahman
>> * Goden Yao
>> * Gregory Chase
>> * Hong Wu  
>> * Hongxu Ma
>> * Hubert Zhang 
>> * Ivan Weng
>> * Jesse Zhang  
>> * Jiali Yao
>> * Jun Aoki 
>> * Kavinder Dhaliwal
>> * Lav Jain 
>> * Lei Chang
>> * Lili Ma  
>> * Lirong Jian  
>> * Lisa Owen
>> * Ming Li  
>> * Mohamed Soliman  
>> * Newton Alex  
>> * Noa Horn 
>> * Oleksandr Diachenko  
>> * Paul Guo 
>> * Radar Da Lei 
>> * Roman Shaposhnik 
>> * Ruilong Huo  
>> * Shivram Mani 
>> * Shubham Sharma   
>> * Tushar Pednekar  
>> * Venkatesh Raghavan   
>> * Vineet Goel  
>> * Wen Lin  
>> * Xiang Sheng  
>> * Yi Jin   
>> * Zhanwei Wang 
>> * Zhenglin Tao 
>>
>>NOW, THEREFORE, BE IT FURTHER RESOLVED, that Lei Chang
>>be appointed to the office of Vice President, Apache HAWQ, to
>>serve in accordance with and subject to the direction of the
>>Board of Directors and the Bylaws of the Foundation until
>>death, resignation, retirement, removal or disqualification,
>>or until a successor is appointed; and be it further
>>
>>RESOLVED, that the initial Apache HAWQ PMC be and hereby is
>>tasked with the creation of a set of bylaws intended to
>>encourage open development and increased 

Re: [VOTE] Release apache-singa-incubating-1.2.0 (RC1)

2018-06-03 Thread Thejas Nair
+1


On Fri, May 25, 2018 at 7:38 PM, Justin Mclean  wrote:
> HI,
>
> As noted by others please remove the MD5 hash.
>
> +1 binding
>
> I checked:
> - incubating in name
> - signatures and hashes good
> - DISCLAIMER exists
> - NOTICE year needs updating
> - LICENSE is OK
> - Files have ASF headers . This file may be missing one. [1]
> - No unexpected binary files
> - dirndl;t compile as I don’t have the right set up
>
> LICENSE may be missing license for:
> - ALv2 file [2] copyright TensorFlow authors
>
> Thanks,
> Justin
>
> 1.  incubator-singa/doc/en/docs/notebook/utils.py
> 2.  incubator-singa/examples/imagenet/inception/convert.py
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Apache HAWQ 2.1.0.0-incubating (RC4)

2017-02-27 Thread Thejas Nair
+1
Verified signature and checksums
Reviewed LICENSE, DISCLAIMER and README

On Sun, Feb 26, 2017 at 7:21 PM, Ed Espino  wrote:

> RVS,
>
> Again, thank you for filing HAWQ-1351
> . The tweets.tar.gz
> binary
> file has been removed. The fix has been pushed into master and will be
> available in our next release.
>
> Regards,
> -=e
>
> On Thu, Feb 23, 2017 at 12:09 AM, Ed Espino  wrote:
>
> > Roman,
> >
> > PR #1142  addresses
> > the issue (HAWQ-1351 )
> > you raised during your Apache HAWQ 2.1.0.0-incubating RC4 review. The PR
> > removes the binary tarball (pxf/pxf-json/src/test/resourc
> > es/tweets.tar.gz) from the git repository. The test tarball will be
> > dynamically generated when the pxf-json gradle test task is executed.
> >
> > Regards,
> > -=e
> >
> >
> >
> > On Wed, Feb 22, 2017 at 12:49 PM, Ed Espino  wrote:
> >
> >> Roman,
> >>
> >> I have updated the description of HAWQ-1351 to reflect that we will be
> >> removing the binary test tarball (tweets.tar.gz) from the source code
> base
> >> and generating it dynamically.
> >>
> >> -=e
> >>
> >> On Wed, Feb 22, 2017 at 11:59 AM, Ed Espino  wrote:
> >>
> >>> Roman,
> >>>
> >>> Thank you for filing HAWQ-1351.
> >>>
> >>> I have added the following comment in the Jira: "On further inspection,
> >>> we will be removing this binary tarball and create it dynamically
> during
> >>> the test process from existing source test files (namely
> tweets-pp.json)."
> >>>  I hope to have this file removed from our master branch today. This
> will
> >>> keep future reviewers from potentially tripping over it. We appreciate
> your
> >>> guidance and mentorship through this incubation period.
> >>>
> >>> Regards,
> >>> -=e
> >>>
> >>> On Wed, Feb 22, 2017 at 11:19 AM, Roman Shaposhnik <
> ro...@shaposhnik.org
> >>> > wrote:
> >>>
>  On Tue, Feb 21, 2017 at 5:54 PM, Ed Espino  wrote:
>  > Hello Incubator PMC (IPMC),
>  >
>  > The Apache HAWQ community has voted on and approved a proposal to
>  > release Apache HAWQ 2.1.0.0-incubating (source only release).
>  >
>  > We kindly request that the IPMC members review and vote on this
>  > incubator release.
>  >
>  > The PPMC VOTE thread is here:
>  > https://lists.apache.org/thread.html/b641ddf4519feba01d3d4c5
>  5180be842a27e75bdaef640175b623e12@%3Cdev.hawq.apache.org%3E
>  >
>  > The PPMC VOTE RESULT is here:
>  > https://lists.apache.org/thread.html/9d3025c12dc032437d1317d
>  662f0e4434754c00258ca1abdd5c0ab9f@%3Cdev.hawq.apache.org%3E
>  >
>  > All JIRAs completed for this release are tagged with:
>  >   fixVersion = 2.1.0.0-incubating
>  >
>  > A complete JIRA list can be reviewed here:
>  > *
>  > https://issues.apache.org/jira/secure/ReleaseNote.jspa?proje
>  ctId=12318826=12339640
>  >
>  > The tag to be voted on: 2.1.0.0-incubating-rc4
>  > (f5033eaa3c7c1d9f85bbcc56e9d921d96337831a), located here:
>  > *
>  > https://git-wip-us.apache.org/repos/asf?p=incubator-hawq.git
>  ;a=commit;h=12c7df017551f1c3b0deb38c7243db3e018ef62c
>  >
>  > Git release branch:
>  > *
>  > https://git-wip-us.apache.org/repos/asf?p=incubator-hawq.git
>  ;a=shortlog;h=refs/heads/2.1.0.0-incubating
>  >
>  > Source release package:
>  > *
>  > https://dist.apache.org/repos/dist/dev/incubator/hawq/2.1.0.
>  0-incubating.RC4/apache-hawq-src-2.1.0.0-incubating.tar.gz
>  >
>  > Source release verification:
>  > * PGP Signature:
>  >
>  > https://dist.apache.org/repos/dist/dev/incubator/hawq/2.1.0.
>  0-incubating.RC4/apache-hawq-src-2.1.0.0-incubating.tar.gz.asc
>  > * SHA256/MD5 Hash:
>  >
>  > https://dist.apache.org/repos/dist/dev/incubator/hawq/2.1.0.
>  0-incubating.RC4/apache-hawq-src-2.1.0.0-incubating.tar.gz.sha256
>  >
>  > https://dist.apache.org/repos/dist/dev/incubator/hawq/2.1.0.
>  0-incubating.RC4/apache-hawq-src-2.1.0.0-incubating.tar.gz.md5
>  >
>  > Keys to verify the signature of the release artifact are available
> at:
>  > * https://dist.apache.org/repos/dist/dev/incubator/hawq/KEYS
>  >
>  > The artifact(s) has been signed with Key ID: 57325522 ("
>  esp...@apache.org")
>  >
>  > Build instructions are included in the project's wiki:
>  > https://cwiki.apache.org/confluence/display/HAWQ/Build+and+Install
>  >
>  > When voting, please list the actions taken to verify the release. To
>  > facilitate Apache license review and conformance, an Apache Release
>  > Audit Tool (RAT) pom.xml file is included in the source root
>  > directory.
>  >
>  > The vote will be open for at least 72 hours.
>  >
>  > 

Re: [VOTE] Release apache-singa-incubating-1.1.0 (RC1)

2017-02-08 Thread Thejas Nair
+1 (binding)
Verified signature and checksum
Checked updates to LICENSE, NOTICE, and RELEASE_NOTES files


On Tue, Feb 7, 2017 at 5:40 PM, Wang Wei  wrote:

> Hi John,
>
> Could you revise your vote as the glog issue is resolved?
> Thanks.
>
> Best,
> Wei
>
> On Tue, Feb 7, 2017 at 9:25 AM, John D. Ament 
> wrote:
>
> > Fair enough.
> >
> > On Mon, Feb 6, 2017 at 6:48 PM Niclas Hedhman 
> wrote:
> >
> > > https://github.com/google/glog/blob/master/COPYING
> > >
> > > On Mon, Feb 6, 2017 at 7:44 PM, John D. Ament 
> > > wrote:
> > >
> > > > Niclas,
> > > >
> > > > So I'll point out a couple of things.
> > > >
> > > > 1. -1's on releases aren't vetos, so if someone else (e.g. you)
> voted a
> > > +1
> > > > my -1 would be moot.
> > > >
> > > > 2. I mentioned in my response that the main issue is that I can't
> find
> > a
> > > > listed license for glog and I was choosing GPL because I found a
> source
> > > > file with a GPL header.  If the first file I looked at was another
> > > license,
> > > > I would have assumed that license.  It has nothing to do with build
> > > chain.
> > > > If you have a link that can show that glog is BSD licensed, that
> would
> > > > settle this.  Note that this issue [1] exists.
> > > >
> > > > John
> > > >
> > > > [1]: https://github.com/google/glog/issues/118
> > > >
> > > >
> > > > On Mon, Feb 6, 2017 at 12:04 AM Niclas Hedhman 
> > > wrote:
> > > >
> > > > > Sure, but in this case it is;
> > > > >   1. Singa depends on Glog
> > > > >   2. Glog is BSD licensed
> > > > >   3. Glog use a build tool chain that is GPL'd and includes a build
> > > > script
> > > > > to compensate for missing toolchain tools.
> > > > >   4. Singa doesn't use Glog's build toolchain
> > > > >
> > > > > Your (John) argument is that Glog is incorrectly licensed and
> should
> > > have
> > > > > been GPL'd. I think that reasoning is incorrect, and that Glog is
> > > > licensed
> > > > > correctly and hence it is not relevant whether it is optional or
> not
> > > for
> > > > > Singa.
> > > > >
> > > > > Given that we have both belt and suspenders for this, I think the
> -1
> > > can
> > > > be
> > > > > withdrawn regarding the Glog dependency.
> > > > >
> > > > > Cheers
> > > > > Niclas
> > > > >
> > > > > On Mon, Feb 6, 2017 at 10:49 AM, John D. Ament <
> > johndam...@apache.org>
> > > > > wrote:
> > > > >
> > > > > > We actually just had a discussion recently on legal-discuss on
> this
> > > > type
> > > > > of
> > > > > > topic.  Specifically, Cat-X and optional vs required
> dependencies.
> > > > Henri
> > > > > > and I settled on the wording you'll find at [1] as the final
> > result.
> > > > > > Basically, you can rely on Cat-X but only for optional features.
> > > > > >
> > > > > > We can probably follow up with legal on whether this does fall
> into
> > > the
> > > > > GPL
> > > > > > bucket though.
> > > > > >
> > > > > > John
> > > > > >
> > > > > > [1]: https://www.apache.org/legal/resolved.html#optional
> > > > > >
> > > > > > On Sun, Feb 5, 2017 at 9:45 PM Niclas Hedhman <
> nic...@hedhman.org>
> > > > > wrote:
> > > > > >
> > > > > > > I think that ends up being a build time dependency in GLOG,
> i.e.
> > > the
> > > > > > > equivalent of Systems Requirement, and not in itself viral to
> the
> > > ASF
> > > > > > > software. I assume that Google is much more worried about this
> > and
> > > > may
> > > > > > even
> > > > > > > have checked with their Legal team...
> > > > > > >
> > > > > > > Want to check with legal-discuss@ ? Is my memory failing me,
> or
> > > > hasn't
> > > > > > FSF
> > > > > > > stated that the build output is not bound by GPL of the build
> > > > > toolchain?
> > > > > > > (otherwise they can't release their own LGPL stuff)
> > > > > > >
> > > > > > > Cheers
> > > > > > > Niclas
> > > > > > >
> > > > > > > On Mon, Feb 6, 2017 at 10:27 AM, John D. Ament <
> > > > johndam...@apache.org>
> > > > > > > wrote:
> > > > > > >
> > > > > > > > -1 at least I think there's an issue.
> > > > > > > >
> > > > > > > > While the source code all looks good, the resulting binary is
> > not
> > > > > > valid.
> > > > > > > > There's no how to build doc, so I looked at your .travis.yml.
> > It
> > > > > > > confirmed
> > > > > > > > what I suspected for make, but then I started looking at your
> > > > > required
> > > > > > > > packages.
> > > > > > > >
> > > > > > > > You require glog [1], which I can't find a license for.
> > However,
> > > > > glog
> > > > > > > > includes [2] which is GPL, which makes glog GPL and as a
> > result,
> > > > your
> > > > > > > code.
> > > > > > > >
> > > > > > > > [1]: https://github.com/google/glog
> > > > > > > > [2]: https://github.com/google/glog/blob/master/missing
> > > > > > > >
> > > > > > > > - John
> > > > > > > >
> > > > > > > > On Sun, Jan 29, 2017 at 8:24 PM Wang Wei  >
> > > > wrote:
> > > > > > > >
> > > > 

[DISCUSS] publishing docker image for podling

2017-01-04 Thread Thejas Nair
As per Greg Stein's comment in
https://issues.apache.org/jira/browse/INFRA-13156, we haven't had any
podling request for a docker image (aka a "convenience binary") to be
published within Apache's namespace in hub.docker.com .

Starting this thread to see if we should have a vote on for this or we can
get incubator VP approval for this.

Thanks,
Thejas


Re: [VOTE] Release apache-singa-incubating-1.0.0 (RC2)

2016-09-07 Thread Thejas Nair
+1
Checked signatures and checksums, DISCLAIMER, LICENSE, and NOTICE files


On Tue, Sep 6, 2016 at 11:25 AM, Alan Gates  wrote:

> +1.  Checked signatures, DISCLAIMER, LICENSE, and NOTICE files, and
> assured there were no binary files in the distribution.
>
> Alan.
>
> > On Sep 3, 2016, at 02:33, WANG Sheng  wrote:
> >
> > Hi all,
> >
> > The SINGA community has voted on and approved a proposal to release
> Apache
> > SINGA 1.0.0 (incubating).
> >
> > The vote thread is at:
> > http://mail-archives.apache.org/mod_mbox/singa-dev/201609.mbox/%
> 3CCAELsgRf0T371qTSVo20HjTGiXYgW-br7YgnffUHgAxTooqW9qQ%40mail.gmail.com%3E
> >
> > and the result is at:
> > http://mail-archives.apache.org/mod_mbox/singa-dev/201609.
> mbox/%3CCAELsgRcyMcB%3DF5XgtOv1P9zuNPbU7-qWmnCNbDpRrJu4ReP9ug%40mail.
> gmail.com%3E
> >
> > We ask the IPMC to vote on this release.
> >
> > The artifacts to be voted on are located at:
> > https://dist.apache.org/repos/dist/dev/incubator/singa/1.0.0/
> >
> > The hashes of the artifacts are as follows:
> > MD5: AE 01 1F 67 D7 F0 F6 23  FF 26 F9 A9 F0 00 1C C3
> >
> > Release artifacts are signed with the following key:
> > https://people.apache.org/keys/committer/dinhtta.asc
> >
> > and the signature file is:
> > https://dist.apache.org/repos/dist/dev/incubator/singa/1.0.
> 0/apache-singa-incubating-1.0.0-RC2.tar.gz.asc
> >
> > The Github tag is at:
> > https://github.com/apache/incubator-singa/releases/tag/v1.0.0-rc2
> >
> > with commit ID: 0416a0fdfa63cb1b4f54e438ff5b33d7eb4d8df0
> >
> > To check the license, you can use the Apache Rat tool as follows:
> > 1. download & decompress apache rat from
> > http://creadur.apache.org/rat/download_rat.cgi
> > 2. run the following command under singa folder:
> >java -jar /PATH/TO/RAT/apache-rat-0.11.jar -E rat-excludes -d . >
> > rat_check
> > 3. check the results in file named "rat_check"
> >
> > The vote is open for at least 72 hours, or until the necessary number of
> > votes (3 +1) is reached.
> >
> > [ ] +1 Release this package as Apache SINGA 1.0.0-incubating
> > [ ]  0 I don't feel strongly about it, but I'm okay with the release
> > [ ] -1 Do not release this package because...
> >
> > Regards,
> > Sheng
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Release apache-singa-incubating-0.3.0 (RC3)

2016-04-19 Thread Thejas Nair
+1
http://compliance.rocks/result.html?1e40bc34
Looks like there are a few files missing license headers. Please
update them before next release.


On Mon, Apr 18, 2016 at 10:22 AM, Alan Gates  wrote:
> +1.  LICENSE, NOTICE, DISCLAIMER and rat check look good.  The signatures 
> look good as well.  I didn’t find any binary files in the distribution.
>
> Alan.
>
>> On Apr 17, 2016, at 20:55, Anh Dinh  wrote:
>>
>> Hi all,
>>
>> The SINGA community has voted on and approved a proposal to release Apache
>> SINGA 0.3.0 (incubating).
>>
>> This release candidate (RC3) addressed the issue with the previous release
>> candidate which contains Creative Common licensed file:
>> https://issues.apache.org/jira/browse/SINGA-159
>>
>>
>> The vote thread is at:
>> http://mail-archives.apache.org/mod_mbox/singa-dev/201604.mbox/%3CCAAbkU4QROZh4qh1qorR0XMNGvwg5sMad-9JqggByO6vUVnxmtQ%40mail.gmail.com%3E
>>
>> and the result is at:
>> http://mail-archives.apache.org/mod_mbox/singa-dev/201604.mbox/%3CCAAbkU4S_G-BmQ3jzjFbz%2BBtGQC-az_CjKwPM7usCWwBdBp8%2B0w%40mail.gmail.com%3E
>>
>> We ask the IPMC to vote on this release.
>>
>> The artifacts to be voted on are located at:
>> https://dist.apache.org/repos/dist/dev/incubator/singa/0.3.0/
>>
>> The hashes of the artifacts are as follows:
>> MD5: 45 4C 7A BB 17 C7 D6 47  77 85 92 58 59 DF B7 F5
>>
>> Release artifacts are signed with the following key:
>> https://people.apache.org/keys/committer/dinhtta.asc
>>
>> and the signature file is:
>> https://dist.apache.org/repos/dist/dev/incubator/singa/0.3.0/apache-singa-incubating-0.3.0-RC3.tar.gz.asc
>>
>> The Github tag is at:
>> https://github.com/apache/incubator-singa/releases/tag/v0.3.0-rc3
>>
>> with commit ID: d547a861068973db7d8fc82f9c733e95307a40f1
>>
>> To check the license, you can use the Apache Rat tool following
>> ```
>> ./configure
>> make rat
>> ```
>> The result is in rat_check file.
>>
>> The vote is open for at least 72 hours, or until the necessary number of
>> votes (3 +1) is reached.
>>
>> [ ] +1 Release this package as Apache SINGA 0.3.0-incubating
>> [ ]  0 I don't feel strongly about it, but I'm okay with the release
>> [ ] -1 Do not release this package because...
>>
>> Regards,
>> Anh.
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release apache-singa-incubating-0.2.0 (RC2)

2016-01-13 Thread Thejas Nair
+1 (binding)
Examined the LICENSE, NOTICE, README, and DISCLAIMER files. Checked
the signatures and checksum.

Like in the previous release, sha256 checksum matches what is in the
email, however, the .sha256 file seems like a different binary file.
The .md5 hash is the only hash most projects seem to publish, so that
might be sufficient in this case as well.
The sha256 file for the 0.1.0 release that is there in download page
also seems to have this issue. Can you please check that as well ?


Also, for future vote threads,  please create a git tag and include
that as well as commit hash in the VOTE threads.




On Tue, Jan 12, 2016 at 11:05 PM, Jean-Baptiste Onofré  
wrote:
> +1 (binding)
>
> Regards
> JB
>
>
> On 01/11/2016 02:12 PM, ooibc wrote:
>>
>> Hi all,
>>
>> The SINGA community has voted on and approved a proposal to release Apache
>> SINGA 0.2.0 (incubating). The license issues in RC1 have been fixed.
>>
>> The vote thread is at:
>>
>> http://mail-archives.apache.org/mod_mbox/singa-dev/201601.mbox/%3CCAJz0iLtvBC3krxJp8=7Jb2suhGpwFbUtB=dtgpysg-ycndr...@mail.gmail.com%3E
>>
>> and the result is at:
>>
>> http://mail-archives.apache.org/mod_mbox/singa-dev/201601.mbox/%3CCAJz0iLuwLeze=HtSA8TraoqzkokPewYxdQT5kqgq=X6Zgrg=o...@mail.gmail.com%3E
>>
>>
>> We ask the IPMC to vote on this release.
>>
>> The artifacts to be voted on are located here:
>> https://dist.apache.org/repos/dist/dev/incubator/singa/0.2.0/
>>
>> The hashes of the artifacts are as follows:
>> apache-singa-incubating-0.2.0-RC2.tar.gz.md5: 41 4F 39 EE B0 25 68 38 C1
>> 3A F0 9F 02 82 B2 9D
>> apache-singa-incubating-0.2.0-RC2.tar.gz.sha256:  42C13D9D D23C6179
>> 7C1D50CA B11947E0 D260342D D37C1B61 7818C9ED B3964BEF
>>
>> Release artifacts are signed with the following key:
>> https://people.apache.org/keys/committer/dinhtta.asc
>>
>> and the signature file is:
>> apache-singa-incubating-0.2.0-RC2.tar.gz.asc
>>
>>
>> To check the license, you can use the Apache Rat tool following
>> ```
>> ./configure
>> make rat
>> ```
>> The result is in rat_check file.
>> To install and test the new features, please read the README file and
>> refer
>> to SINGA website http://singa.apache.org/docs/index.html
>>
>>
>> The vote is open for at least 72 hours, or until the necessary number of
>>   votes (3 +1) is reached.
>>
>>   [ ] +1 Release this package as Apache SINGA 0.2.0-incubating
>>   [ ]  0 I don't feel strongly about it, but I'm okay with the release
>>   [ ] -1 Do not release this package because...
>>
>> Thanks.
>>
>> Regards,
>> Beng Chin Ooi
>> www.comp.nus.edu.sg/~ooibc
>>
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>>
>
> --
> Jean-Baptiste Onofré
> jbono...@apache.org
> http://blog.nanthrax.net
> Talend - http://www.talend.com
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Apache SINGA 0.1.0 (incubating)

2015-10-06 Thread Thejas Nair
+1
Examined the LICENSE, NOTICE, README, and DISCLAIMER files. Checked the
signatures and checksum.

The sha256 checksum matches what is in the email, however, the .sha256 file
seems like a different binary file. The .md5 hash is the only hash most
projects seem to publish, so that might be sufficient in this case as well.



On Tue, Oct 6, 2015 at 10:25 AM, Alan Gates  wrote:

> +1
>
> I looked at the signatures, the LICENSE, NOTICE, and DISCLAIMER files,
> checked for any binary files in the distro, and checked that source files
> have the appropriate license header.
>
> Alan.
>
> Wang Wei 
> September 29, 2015 at 19:47
> Hi all,
>
> The SINGA community has voted on and approved a proposal to release Apache
> SINGA 0.1.0 (incubating).
>
> The vote thread is at:
>
> http://mail-archives.apache.org/mod_mbox/singa-dev/201509.mbox/%3CCAJz0iLsZRgSuPyrMitpt5EdXvaqf4%2BDiF00Xyg2SLb3YEqkDMw%40mail.gmail.com%3E
>
> and the result is at:
>
> http://mail-archives.apache.org/mod_mbox/singa-dev/201509.mbox/%3CCAJz0iLtGNDT8O31V9QOhrapzFf8Nt0XSjckGR%2BxFHrfrFKraPA%40mail.gmail.com%3E
>
> We ask the IPMC to vote on this release.
>
> The artifacts to be voted on are located here:
> https://dist.apache.org/repos/dist/dev/incubator/singa/
>
> The hashes of the artifacts are as follows:
> apache-singa-incubating-0.1.0-RC2.tar.gz.md5: 63 0F DF E0 74 E0 E1 1F 89
> F6 0E DF 9E 66 50 73
> apache-singa-incubating-0.1.0-RC2.tar.gz.sha256: EE7CF820 70DB46F5 20FC39A1
> F85B5B73 865503BD 36280917 5369EB9F 5FA7199E
>
> Release artifacts are signed with the following key:
> https://people.apache.org/keys/committer/dinhtta.asc
>
> and the signature file is:
> apache-singa-incubating-0.1.0-RC2.tar.gz.asc
>
>
> The vote is open for at least 72 hours, or until the necessary number of
> votes (3 +1) is reached.
>
> [ ] +1 Release this package as Apache SINGA 0.1.0-incubating
> [ ] 0 I don't feel strongly about it, but I'm okay with the release
> [ ] -1 Do not release this package because...
>
> Thanks.
>
> Regards,
> Wei Wang
>
>


Re: [VOTE] Accept HAWQ into the Apache Incubator

2015-08-31 Thread Thejas Nair
 transfer of source code to the
>> > ASF.
>> >
>> > == External Dependencies ==
>> >
>> > Runtime dependencies:
>> >   * gimli (BSD)
>> >   * openldap (The OpenLDAP Public License)
>> >   * openssl (OpenSSL License and the Original SSLeay License, BSD style)
>> >   * proj (MIT)
>> >   * yaml (Creative Commons Attribution 2.0 License)
>> >   * python (Python Software Foundation License Version 2)
>> >   * apr-util (Apache Version 2.0)
>> >   * bzip2 (BSD-style License)
>> >   * curl (MIT/X Derivate License)
>> >   * gperf (GPL Version 3)
>> >   * protobuf (Google)
>> >   * libevent (BSD)
>> >   * json-c (https://github.com/json-c/json-c/blob/master/COPYING)
>> >   * krb5 (MIT)
>> >   * pcre (BSD)
>> >   * libedit (BSD)
>> >   * libxml2 (MIT)
>> >   * zlib (Permissive Free Software License)
>> >   * libgsasl (LGPL Version 2.1)
>> >   * thrift (Apache Version 2.0)
>> >   * snappy (Apache Version 2.0 (up to 1.0.1)/New BSD)
>> >   * libuuid-2.26 (LGPL Version 2)
>> >   * apache hadoop (Apache Version 2.0)
>> >   * apache avro (Apache Version 2.0)
>> >   * glog (BSD)
>> >   * googlemock (BSD)
>> >
>> > Build only dependencies:
>> >   * ant (Apache Version 2.0)
>> >   * maven (Apache Version 2.0)
>> >   * cmake (BSD)
>> >
>> > Test only dependencies:
>> >   * googletest (BSD)
>> >
>> > Cryptography N/A
>> >
>> > == Required Resources ==
>> >
>> > === Mailing lists ===
>> >   * priv...@hawq.incubator.apache.org (moderated subscriptions)
>> >   * comm...@hawq.incubator.apache.org
>> >   * d...@hawq.incubator.apache.org
>> >   * iss...@hawq.incubator.apache.org
>> >   * u...@hawq.incubator.apache.org
>> >
>> > === Git Repository ===
>> > https://git-wip-us.apache.org/repos/asf/incubator-hawq.git
>> >
>> > === Issue Tracking ===
>> > JIRA Project HAWQ (HAWQ)
>> >
>> > === Other Resources ===
>> >
>> > Means of setting up regular builds for HAWQ on builds.apache.org will
>> > require integration with Docker support.
>> >
>> > == Initial Committers ==
>> >   * Lirong Jian
>> >   * Hubert Huan Zhang
>> >   * Radar Da Lei
>> >   * Ivan Yanqing Weng
>> >   * Zhanwei Wang
>> >   * Yi Jin
>> >   * Lili Ma
>> >   * Jiali Yao
>> >   * Zhenglin Tao
>> >   * Ruilong Huo
>> >   * Ming Li
>> >   * Wen Lin
>> >   * Lei Chang
>> >   * Alexander V Denissov
>> >   * Newton Alex
>> >   * Oleksandr Diachenko
>> >   * Jun Aoki
>> >   * Bhuvnesh Chaudhary
>> >   * Vineet Goel
>> >   * Shivram Mani
>> >   * Noa Horn
>> >   * Sujeet S Varakhedi
>> >   * Junwei (Jimmy) Da
>> >   * Ting (Goden) Yao
>> >   * Mohammad F (Foyzur) Rahman
>> >   * Entong Shen
>> >   * George C Caragea
>> >   * Amr El-Helw
>> >   * Mohamed F Soliman
>> >   * Venkatesh (Venky) Raghavan
>> >   * Carlos Garcia
>> >   * Zixi (Jesse) Zhang
>> >   * Michael P Schubert
>> >   * C.J. Jameson
>> >   * Jacob Frank
>> >   * Ben Calegari
>> >   * Shoabe Shariff
>> >   * Rob Day-Reynolds
>> >   * Mel S Kiyama
>> >   * Charles Alan Litzell
>> >   * David Yozie
>> >   * Ed Espino
>> >   * Caleb Welton
>> >   * Parham Parvizi
>> >   * Dan Baskette
>> >   * Christian Tzolov
>> >   * Tushar Pednekar
>> >   * Greg Chase
>> >   * Chloe Jackson
>> >   * Michael Nixon
>> >   * Roman Shaposhnik
>> >   * Alan Gates
>> >   * Owen O'Malley
>> >   * Thejas Nair
>> >   * Don Bosco Durai
>> >   * Konstantin Boudnik
>> >   * Sergey Soldatov
>> >   * Atri Sharma
>> >
>> > == Affiliations ==
>> >   * Barclays:  Atri Sharma
>> >   * Bloomberg: Justin Erenkrantz
>> >   * Hortonworks: Alan Gates, Owen O'Malley, Thejas Nair, Don Bosco Durai
>> >   * WANDisco: Konstantin Boudnik, Sergey Soldatov
>> >   * Pivotal: everyone else on this proposal
>> >
>> > == Sponsors ==
>> >
>> > === Champion ===
>> > Roman Shaposhnik
>> >
>> > === Nominated Mentors ===
>> >
>> > The initial mentors are listed below:
>> >   * Alan Gates - Apache Member, Hortonworks
>> >   * Owen O'Malley - Apache Member, Hortonworks
>> >   * Thejas Nair - Apache Member, Hortonworks
>> >   * Konstantin Boudnik - Apache Member, WANDisco
>> >   * Roman Shaposhnik - Apache Member, Pivotal
>> >   * Justin Erenkrantz - Apache Member, Bloomberg
>> >
>> > === Sponsoring Entity ===
>> > We would like to propose Apache incubator to sponsor this project.
>> >
>> > -
>> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> > For additional commands, e-mail: general-h...@incubator.apache.org
>> >
>>
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>>
>>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[RESULT] [VOTE] Accept Apache Singa as incubator project

2015-03-17 Thread Thejas Nair
After a week-long voting period, the VOTE for accepting Singa into the
Apache Incubator has passed with 7 binding +1s and 1 binding +0 .

+1 (binding)
Konstantin I Boudnik
Ted Dunning
Alan Gates
Thejas Nair
Konstantin I Boudnik
Alan Cabrera
Daniel Dai

+0 (binding)
Jan I

Thanks for voting!

I will work on getting the infra JIRAs created.

Thanks,
Thejas


-- Forwarded message --
From: Thejas Nair thejas.n...@gmail.com
Date: Tue, Mar 10, 2015 at 7:30 AM
Subject: [VOTE] Accept Apache Singa as incubator project
To:
Cc: oo...@comp.nus.edu.sg


The Singa Incubator Proposal document has been updated based on
feedback in the proposal thread.

This vote is proposing the inclusion of Apache Singa as incubator project.
The vote will run for at least 72 hours.

[ ] +1 Accept Apache Singa into the Incubator
[ ] +0 Don’t care.
[ ] -1 Don’t accept Apache Singa into the Incubator because..

Please vote !

Here is my +1 .

Link to version of proposal being voted on :
https://wiki.apache.org/incubator/SingaProposal?action=recallrev=10

The text is below
--

= Singa Incubator Proposal =
== Abstract ==
SINGA is a distributed deep learning platform.

== Proposal ==
SINGA is an efficient, scalable and easy-to-use distributed platform
for training deep learning models, e.g., Deep Convolutional Neural Network and
Deep Belief Network. It parallelizes the computation (i.e., training) onto a
cluster of nodes by distributing the training data and model automatically to
speed up the training. Built-in training algorithms like Back-Propagation and
Contrastive Divergence are implemented based on common abstractions of deep
learning models. Users can train their own deep learning models by simply
customizing these abstractions like implementing the Mapper and
Reducer in Hadoop.

== Background ==
Deep learning refers to a set of feature (or representation) learning models
that consist of multiple (non-linear) layers, where different layers learn
different levels of abstractions (representations) of the raw input data.
Larger (in terms of model parameters) and deeper (in terms of number of layers)
models have shown better performance, e.g., lower image classification error in
Large Scale Visual Recognition Challenge. However, a larger model requires more
memory and larger training data to reduce over-fitting. Complex
numeric operations
make the training computation intensive. In practice, training large
deep learning
models takes weeks or months on a single node (even with GPU).

== Rational ==
Deep learning has gained a lot of attraction in both academia and
industry due to
its success in a wide range of areas such as computer vision and
speech recognition.
However, training of such models is computationally expensive,
especially for large
and deep models (e.g., with billions of parameters and more than 10
layers). Both
Google and Microsoft have developed distributed deep learning systems
to make the
training more efficient by distributing the computations within a
cluster of nodes.
However, these systems are closed source softwares. Our goal is to leverage the
community of open source developers to make SINGA efficient, scalable
and easy to
use. SINGA is a full fledged distributed platform, that could benefit the
community and also benefit from the community in their involvement in
contributing
to the further work in this area. We believe the nature of SINGA and our visions
for the system fit naturally to Apache's philosophy and development framework.

== Initial Goals ==
We have developed a system for SINGA running on a commodity computer
cluster. The initial goals include,
 * improving the system in terms of scalability and efficiency, e.g.,
using Infiniband for network communication and multi-threading for one
node computation. We would consider extending SINGA to GPU clusters
later.
 * benchmarking with larger datasets (hundreds of millions of training
instances) and models (billions of parameters).
 * adding more built-in deep learning models. Users can train the
built-in models on their datasets directly.


== Current Status ==
=== Meritocracy ===
We would like to follow ASF meritocratic principles to encourage more developers
to contribute in this project. We know that only active and excellent developers
can make SINGA a successful project. The committer list and PMC will be updated
based on developers' performance and commitment. We are also improving the
documentation and code to help new developers get started quickly.

=== Community ===
SINGA is currently being developed in the Database System Research Lab at the
National University of Singapore (NUS) in collaboration with Zhejiang
University in China.
Our lab has extensive experience in building database related systems, including
distributed systems. Six PhD students and research assistants (Jinyang Gao,
Kaiping Zheng, Sheng Wang, Wei Wang, Zhaojing Luo and Zhongle Xie) , a research
fellow (Anh Dinh) and three

Re: [VOTE] Accept Apache Singa as incubator project

2015-03-12 Thread Thejas Nair
Thanks for the feedback Bertrand!
Yes, I agree it makes sense to start with a single user and dev
mailing list. I got this feedback during the proposal phase, but I
forgot to update the proposal.


On Wed, Mar 11, 2015 at 2:42 AM, Bertrand Delacretaz
bdelacre...@apache.org wrote:
 On Tue, Mar 10, 2015 at 11:52 PM, Olemis Lang ole...@gmail.com wrote:
 ...I do not know if this matters at all but JFYI , singa is considered
 as an obscene word by native Spanish speakers in quite a few regions 

 It does matter in terms of marketing IMO.

 Also, dunno if that's been discussed already and it's just a detail
 but in general I recommend starting without a user mailing list, and
 creating only if dev list traffic becomes a problem.

 Apart from that +1 to incubation.

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Apache Singa as incubator project

2015-03-11 Thread Thejas Nair
Thanks for the input on the name. Naming is always tricky!
We can look into that as the project goes through incubation and
before it graduates.
Apache guideline on naming - http://www.apache.org/foundation/marks/naming.html

On Tue, Mar 10, 2015 at 3:52 PM, Olemis Lang ole...@gmail.com wrote:
 On 3/10/15, Thejas Nair thejas.n...@gmail.com wrote:
 The Singa Incubator Proposal document has been updated based on
 feedback in the proposal thread.


 I do not know if this matters at all but JFYI , singa is considered
 as an obscene word by native Spanish speakers in quite a few regions .

 [...]

 --
 Regards,

 Olemis - @olemislc

 Apache(tm) Bloodhound contributor
 http://issues.apache.org/bloodhound
 http://blood-hound.net

 Blog ES: http://simelo-es.blogspot.com/
 Blog EN: http://simelo-en.blogspot.com/

 Featured article:

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Accept Apache Singa as incubator project

2015-03-10 Thread Thejas Nair
 be implemented on these two platforms as
well. However, the there are differences in training efficiency,
scalability and
usability. Mahout and Spark ML-LIB follow models where their
nodes run synchronously. This is the fundamental difference to Singa who
follows the parameter server framework (like Google Brain and Microsoft
Adam). Singa can run synchronously or asynchronously. The asynchronous mode
is superior than the synchronous mode in terms of scalability. In
addition, Singa has some optimizations towards deep learning models
(e.g., model
parallelism, data parallelism and hybrid-parallelism) which make Singa
more efficient. We also provide ease of use programming model for deep
learning algorithms.

There are also plans for integration with Apache Hadoop's HDFS as
storage, to  handle large training data.
Specifically, we store the training data (e.g., images or raw features of
images) in HDFS, then (pre-)fetch them online.
We will also explore integration with Hadoop's Yarn and Apache Mesos
to do resource management.


== Documentation ==
The project is hosted at
http://www.comp.nus.edu.sg/~dbsystem/project/singa.html.
Documentations can be found at the Github Wiki Page:
https://github.com/nusinga/singa/wiki.
We continue to refine and improve the documentation.

== Initial Source ==
We use Github to maintain our source code, https://github.com/nusinga/singa

== Source and Intellectual Property Submission Plan ==
We plan to make our code base be under Apache License, Version 2.0.

== External Dependencies ==
 * required by the core code base: glog, gflags, google protobuf,
open-blas, mpich, armci-mpi.
 * required by data preparation and preprocessing: opencv, hdfs, python.

== Cryptography ==
Not Applicable

== Required Resources ==
=== Mailing Lists ===
Currently, we use google group for internal discussion. The mailing address is
nusi...@googlegroup.com. We will migrate the content to the apache mailing
lists in the future.

 * singa-dev
 * singa-user
 * singa-commits
 * singa-private (for private discussion within PCM)

=== Git Repository ===
We want to continue using git for version control. Hence, a git repo
is required.

=== Issue Tracking ===
JIRA Singa (SINGA)

== Initial Committers ==
 * Beng Chin Ooi (ooibc @comp.nus.edu.sg)
 * Kian Lee Tan (tankl @comp.nus.edu.sg)
 * Gang Chen (cg @zju.edu.cn)
 * Wei Wang (wangwei @comp.nus.edu.sg)
 * Dinh Tien Tuan Anh (dinhtta @comp.nus.edu.sg)
 * Jinyang Gao (jinyang.gao @comp.nus.edu.sg)
 * Sheng Wang (wangsh @comp.nus.edu.sg)
 * Kaiping Zheng (kaiping @comp.nus.edu.sg)
 * Zhaojing Luo (zhaojing @comp.nus.edu.sg)
 * Zhongle Xie (zhongle @comp.nus.edu.sg)

== Affiliations ==
 * Beng Chin Ooi, National University of Singapore
 * Kian Lee Tan, National University of Singapore
 * Gang Chen, Zhejiang University
 * Wei Wang, National University of Singapore
 * Dinh Tien Tuan Anh, National University of Singapore
 * Jinyang Gao, National University of Singapore
 * Sheng Wang, National University of Singapore
 * Kaiping Zheng, National University of Singapore
 * Zhaojing Luo, National University of Singapore
 * Zhongle Xie, National University of Singapore

== Sponsors ==
===  Champion ===
Thejas Nair (thejas at apache.org)

=== Nominated Mentors ===
 * Thejas Nair (thejas at apache.org)
 * Alan Gates (gates at apache dot org)
 * Daniel Dai (daijy at apache dot org)
 * Ted Dunning (tdunning at apache dot org)

=== Sponsoring Entity ===
We are requesting the Incubator to sponsor this project.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Apache Singa as incubator project

2015-03-10 Thread Thejas Nair
Thanks for raising this issue. I agree that committer diversity is
important for long term success of a project. I think that should be a
criteria for graduation from incubator.
I think it is going to be more easier to find new contributors as an Apache
incubator project.


On Tue, Mar 10, 2015 at 9:09 AM, jan i j...@apache.org wrote:


 +0 I am really concerned about the diversity of the initial committers,
 what happens if the university pulls the plug. I know we all say it will
 never happen, but it could happen.

 rgds
 jan i.


 On 10 March 2015 at 16:20, Alan Gates alanfga...@gmail.com wrote:

 +1

 Alan.

   Thejas Nair thejas.n...@gmail.com
  March 10, 2015 at 7:33
 The Singa Incubator Proposal document has been updated based on
 feedback in the proposal thread.

 This vote is proposing the inclusion of Apache Singa as incubator project.
 The vote will run for at least 72 hours.

 [ ] +1 Accept Apache Singa into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache Singa into the Incubator because..

 Please vote !

 Here is my +1 .

 Link to version of proposal being voted on :
 https://wiki.apache.org/incubator/SingaProposal?action=recallrev=10

 The text is below
 --

 = Singa Incubator Proposal =
 == Abstract ==
 SINGA is a distributed deep learning platform.

 == Proposal ==
 SINGA is an efficient, scalable and easy-to-use distributed platform
 for training deep learning models, e.g., Deep Convolutional Neural
 Network and
 Deep Belief Network. It parallelizes the computation (i.e., training)
 onto a
 cluster of nodes by distributing the training data and model
 automatically to
 speed up the training. Built-in training algorithms like Back-Propagation
 and
 Contrastive Divergence are implemented based on common abstractions of
 deep
 learning models. Users can train their own deep learning models by simply
 customizing these abstractions like implementing the Mapper and
 Reducer in Hadoop.

 == Background ==
 Deep learning refers to a set of feature (or representation) learning
 models
 that consist of multiple (non-linear) layers, where different layers learn
 different levels of abstractions (representations) of the raw input data.
 Larger (in terms of model parameters) and deeper (in terms of number of
 layers)
 models have shown better performance, e.g., lower image classification
 error in
 Large Scale Visual Recognition Challenge. However, a larger model
 requires more
 memory and larger training data to reduce over-fitting. Complex
 numeric operations
 make the training computation intensive. In practice, training large
 deep learning
 models takes weeks or months on a single node (even with GPU).

 == Rational ==
 Deep learning has gained a lot of attraction in both academia and
 industry due to
 its success in a wide range of areas such as computer vision and
 speech recognition.
 However, training of such models is computationally expensive,
 especially for large
 and deep models (e.g., with billions of parameters and more than 10
 layers). Both
 Google and Microsoft have developed distributed deep learning systems
 to make the
 training more efficient by distributing the computations within a
 cluster of nodes.
 However, these systems are closed source softwares. Our goal is to
 leverage the
 community of open source developers to make SINGA efficient, scalable
 and easy to
 use. SINGA is a full fledged distributed platform, that could benefit the
 community and also benefit from the community in their involvement in
 contributing
 to the further work in this area. We believe the nature of SINGA and our
 visions
 for the system fit naturally to Apache's philosophy and development
 framework.

 == Initial Goals ==
 We have developed a system for SINGA running on a commodity computer
 cluster. The initial goals include,
 * improving the system in terms of scalability and efficiency, e.g.,
 using Infiniband for network communication and multi-threading for one
 node computation. We would consider extending SINGA to GPU clusters
 later.
 * benchmarking with larger datasets (hundreds of millions of training
 instances) and models (billions of parameters).
 * adding more built-in deep learning models. Users can train the
 built-in models on their datasets directly.


 == Current Status ==
 === Meritocracy ===
 We would like to follow ASF meritocratic principles to encourage more
 developers
 to contribute in this project. We know that only active and excellent
 developers
 can make SINGA a successful project. The committer list and PMC will be
 updated
 based on developers' performance and commitment. We are also improving the
 documentation and code to help new developers get started quickly.

 === Community ===
 SINGA is currently being developed in the Database System Research Lab at
 the
 National University of Singapore (NUS) in collaboration with Zhejiang
 University in China.
 Our lab has extensive experience in building

Re: [Fwd: Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator]

2015-03-03 Thread Thejas Nair
I have added Ted as a mentor, so we now have some diversity in mentor
affiliations. (Thanks Ted!)
I also reached out to few other people in mahout community who I
thought might be potentially interested, but I didn't hear from them.

I am planning to put this to a vote in 2 days. Meanwhile, please let
me know if anybody else would be willing to join as a mentor.

Thanks,
Thejas


On Fri, Feb 27, 2015 at 3:44 PM, Thejas Nair thejas.n...@gmail.com wrote:
 Thanks Ted. That helps a lot !
 I have also reached out to few other folks in Mahout community to see
 if they might also be interested.


 On Fri, Feb 27, 2015 at 8:06 AM, Ted Dunning ted.dunn...@gmail.com wrote:
 Thejas,

 Please add me as a mentor if it helps to have diversity.  I have enormous
 trust based on previous experience with him that Alan Gates would act as a
 highly impartial and effective mentor, but would be happy to help if there
 is a concern that could be addressed by having another mentor from a
 different company.



 On Thu, Feb 26, 2015 at 6:12 PM, Thejas Nair thejas.n...@gmail.com wrote:

 The incubator proposal has been updated with the feedback so far.
 We have 3 mentors now, but I think it would be good to have additional
 mentors. Please let me know if anyone is able to help mentor this
 project.

 I am planning to start a vote on the proposal in a day or two.


 On Fri, Feb 6, 2015 at 5:21 PM,  oo...@comp.nus.edu.sg wrote:
 
  Regarding the number of users using this project -- at this moment, the
  community is not big.  A few local start-ups have been trying to use it
  (mainly due to announcement in our seminar list), eg. one is using it for
  image recognition (given a phone snapped by a user, it wants to be return
  the same the product, and a list of similar products, such as a luxury
 bag
  on a passerby).  Researchers from outside of NUS may have been using it
  since we published an application paper on cross domain/modal retrieval
 in
  VLDB 2014.
 
  We have not announced the project to the outside community yet -- we
 would
  announce it in dbworld etc in due course.
 
  Thanks and have a good weekend.
 
  regards
  beng chin
 
 
  Thanks for the comments and suggestions.
  With permission from Thejas, I would like to respond to point 2.
 
  We have a huge team down at NUS (National University of Singapore) --
  we have about seven database/data mining data professors (not including
  those in systems, networking, and machine learning).
  I myself have nine PhD students in a steady state, and I have a few
 large
  grants, with a total budget of about 15 million S$ (~12 million USD),
 that
  allows me to hire a number of research fellows and research assistants
 for
  the next few years.  In a constant state, I have about 20 people (PhD
  students/RA/RF) working with me alone.  Other professors have their own
  grants (unlike other countries, it is relatively easy to get large
 grants
  in Singapore; many overseas Universities, including UIUC, MIT, ETH etc
  have research labs funded by Singapore Research Foundation [equivalent
 of
  NSF]).
 
  SINGA is a long term project for us -- while it is a platform as it is,
 we
  are using it for healthcare predictive analytics (by working with a
  hospital associated with the University).  Therefore, we will be working
  on SINGA, not solely as a distributed DL platform, but as a tool that
 will
  enable us to do data analytics on some business domains (eg. healthcase,
  consumer etc)
 
  For the initial set of committers, three are tenured professors, five
 are
  students, with 2-5 years to go before they complete their PhD.  Quite
  often, some would stay back as a research fellow for a couple of years
  before they start looking for a job outside.  We will work with mentors
  and new developers (from outside of NUS or Zhejiang University) in
  enhancing the system.
 
  The project should survive in that sense.
 
  (I have an on-going project CIIDAA that has been around since 2008; it
 was
  started as another project, epiC,  with a different grant, and then we
  continue the development with a new grant for CIIDAA --
  http://www.comp.nus.edu.sg/~ciidaa/
  )
 
  Thanks.
 
  regards
  beng chin
  ps: i am not sure if my email will get through to the group.
 
 
   Original Message
 
  Subject: Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator
  From:Henry Saputra henry.sapu...@gmail.com
  Date:Thu, February 5, 2015 2:57 pm
  To:  general@incubator.apache.org general@incubator.apache.org
  Cc:  oo...@comp.nus.edu.sg
 
 --
 
  Several comments:
  -) How many users already using this project? I would reccomend to
  drop request for singa-user list at the beginning.
  -) All the initial committers come from university and seemed like
  some of them already ready to leave university. I am not too sure if
  this project go survive

Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator

2015-02-27 Thread Thejas Nair
Thanks for your inputs Henry.
I did send personal emails to two folks (outside of Hortonworks) who
seemed to be interested in the project, but that didn't help.  I have
also been soliciting more mentors in this thread as well. I will try
reaching out to folks who are in the intersection of incubator and
mahout (or spark-ml) to see if they might be interested (hopefully
people working on related projects are more likely to join in).
Any other suggestions for soliciting more diverse set of mentors are
also welcome.

Regarding the diversity of initial set of committers, growing that
should be easier once the project is an apache incubator project.  I
see a strong desire to grow the community in the people who are
currently working on the project.



On Thu, Feb 26, 2015 at 11:42 PM, Henry Saputra henry.sapu...@gmail.com wrote:
 I was not actually talking about requirement, but for the sake of
 podling itself.

 If all initial mentors coming from same company, the risk of all of
 them absent are greater because all will be subjected to same schedule
 and priorities from their daytime employers. Especially for release
 VOTEs. Three initial mentors wont be enough for this project, I think.

 Not too worries about initial committers coming from same org, but I
 have seen that podling that does not have initial community will
 struggle to thrive.

 Just 2-cents from my experience in incubator.

 - Henry

 On Thu, Feb 26, 2015 at 11:37 PM, jan i j...@apache.org wrote:
 On Friday, February 27, 2015, Henry Saputra henry.sapu...@gmail.com wrote:

 I am strongly suggest you solicit more (diverse) mentors before start the
 VOTE.

 All initial committers are from same org and all initial mentors are
 from same company (HW).

 We do have a requirement for diversity, for me all initial committers from
 the same company is just as big a problem as mentors. when everyone
 involved are from the same company then that signals a serious problem
 which should be addressed before starting a vote.

 rgds
 jan i


 I am not sure this is a good start for Apache podling.


 - Henry

 On Thu, Feb 26, 2015 at 9:12 AM, Thejas Nair thejas.n...@gmail.com
 javascript:; wrote:
  The incubator proposal has been updated with the feedback so far.
  We have 3 mentors now, but I think it would be good to have additional
  mentors. Please let me know if anyone is able to help mentor this
  project.
 
  I am planning to start a vote on the proposal in a day or two.
 
 
  On Fri, Feb 6, 2015 at 5:21 PM,  oo...@comp.nus.edu.sg javascript:;
 wrote:
 
  Regarding the number of users using this project -- at this moment, the
  community is not big.  A few local start-ups have been trying to use it
  (mainly due to announcement in our seminar list), eg. one is using it
 for
  image recognition (given a phone snapped by a user, it wants to be
 return
  the same the product, and a list of similar products, such as a luxury
 bag
  on a passerby).  Researchers from outside of NUS may have been using it
  since we published an application paper on cross domain/modal retrieval
 in
  VLDB 2014.
 
  We have not announced the project to the outside community yet -- we
 would
  announce it in dbworld etc in due course.
 
  Thanks and have a good weekend.
 
  regards
  beng chin
 
 
  Thanks for the comments and suggestions.
  With permission from Thejas, I would like to respond to point 2.
 
  We have a huge team down at NUS (National University of Singapore) --
  we have about seven database/data mining data professors (not including
  those in systems, networking, and machine learning).
  I myself have nine PhD students in a steady state, and I have a few
 large
  grants, with a total budget of about 15 million S$ (~12 million USD),
 that
  allows me to hire a number of research fellows and research assistants
 for
  the next few years.  In a constant state, I have about 20 people (PhD
  students/RA/RF) working with me alone.  Other professors have their own
  grants (unlike other countries, it is relatively easy to get large
 grants
  in Singapore; many overseas Universities, including UIUC, MIT, ETH etc
  have research labs funded by Singapore Research Foundation [equivalent
 of
  NSF]).
 
  SINGA is a long term project for us -- while it is a platform as it
 is, we
  are using it for healthcare predictive analytics (by working with a
  hospital associated with the University).  Therefore, we will be
 working
  on SINGA, not solely as a distributed DL platform, but as a tool that
 will
  enable us to do data analytics on some business domains (eg.
 healthcase,
  consumer etc)
 
  For the initial set of committers, three are tenured professors, five
 are
  students, with 2-5 years to go before they complete their PhD.  Quite
  often, some would stay back as a research fellow for a couple of years
  before they start looking for a job outside.  We will work with mentors
  and new developers (from outside of NUS or Zhejiang University) in
  enhancing the system

Re: [Fwd: Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator]

2015-02-27 Thread Thejas Nair
Thanks Ted. That helps a lot !
I have also reached out to few other folks in Mahout community to see
if they might also be interested.


On Fri, Feb 27, 2015 at 8:06 AM, Ted Dunning ted.dunn...@gmail.com wrote:
 Thejas,

 Please add me as a mentor if it helps to have diversity.  I have enormous
 trust based on previous experience with him that Alan Gates would act as a
 highly impartial and effective mentor, but would be happy to help if there
 is a concern that could be addressed by having another mentor from a
 different company.



 On Thu, Feb 26, 2015 at 6:12 PM, Thejas Nair thejas.n...@gmail.com wrote:

 The incubator proposal has been updated with the feedback so far.
 We have 3 mentors now, but I think it would be good to have additional
 mentors. Please let me know if anyone is able to help mentor this
 project.

 I am planning to start a vote on the proposal in a day or two.


 On Fri, Feb 6, 2015 at 5:21 PM,  oo...@comp.nus.edu.sg wrote:
 
  Regarding the number of users using this project -- at this moment, the
  community is not big.  A few local start-ups have been trying to use it
  (mainly due to announcement in our seminar list), eg. one is using it for
  image recognition (given a phone snapped by a user, it wants to be return
  the same the product, and a list of similar products, such as a luxury
 bag
  on a passerby).  Researchers from outside of NUS may have been using it
  since we published an application paper on cross domain/modal retrieval
 in
  VLDB 2014.
 
  We have not announced the project to the outside community yet -- we
 would
  announce it in dbworld etc in due course.
 
  Thanks and have a good weekend.
 
  regards
  beng chin
 
 
  Thanks for the comments and suggestions.
  With permission from Thejas, I would like to respond to point 2.
 
  We have a huge team down at NUS (National University of Singapore) --
  we have about seven database/data mining data professors (not including
  those in systems, networking, and machine learning).
  I myself have nine PhD students in a steady state, and I have a few
 large
  grants, with a total budget of about 15 million S$ (~12 million USD),
 that
  allows me to hire a number of research fellows and research assistants
 for
  the next few years.  In a constant state, I have about 20 people (PhD
  students/RA/RF) working with me alone.  Other professors have their own
  grants (unlike other countries, it is relatively easy to get large
 grants
  in Singapore; many overseas Universities, including UIUC, MIT, ETH etc
  have research labs funded by Singapore Research Foundation [equivalent
 of
  NSF]).
 
  SINGA is a long term project for us -- while it is a platform as it is,
 we
  are using it for healthcare predictive analytics (by working with a
  hospital associated with the University).  Therefore, we will be working
  on SINGA, not solely as a distributed DL platform, but as a tool that
 will
  enable us to do data analytics on some business domains (eg. healthcase,
  consumer etc)
 
  For the initial set of committers, three are tenured professors, five
 are
  students, with 2-5 years to go before they complete their PhD.  Quite
  often, some would stay back as a research fellow for a couple of years
  before they start looking for a job outside.  We will work with mentors
  and new developers (from outside of NUS or Zhejiang University) in
  enhancing the system.
 
  The project should survive in that sense.
 
  (I have an on-going project CIIDAA that has been around since 2008; it
 was
  started as another project, epiC,  with a different grant, and then we
  continue the development with a new grant for CIIDAA --
  http://www.comp.nus.edu.sg/~ciidaa/
  )
 
  Thanks.
 
  regards
  beng chin
  ps: i am not sure if my email will get through to the group.
 
 
   Original Message
 
  Subject: Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator
  From:Henry Saputra henry.sapu...@gmail.com
  Date:Thu, February 5, 2015 2:57 pm
  To:  general@incubator.apache.org general@incubator.apache.org
  Cc:  oo...@comp.nus.edu.sg
 
 --
 
  Several comments:
  -) How many users already using this project? I would reccomend to
  drop request for singa-user list at the beginning.
  -) All the initial committers come from university and seemed like
  some of them already ready to leave university. I am not too sure if
  this project go survive if all of the inital committers are from
  university as students.
  -) Need to solicit more mentors if this project ever get to Apache
  incubator.
 
  - Henry
 
  On Tue, Feb 3, 2015 at 3:58 PM, Thejas Nair thejas.n...@gmail.com
 wrote:
  The Relationship with Other Apache Products section has been
  updated. The reference to H2O in that section has been removed, and
  other projects have been added.
   Thanks for the feedback!
 
 
  On Wed, Jan 28, 2015 at 10

Re: [Fwd: Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator]

2015-02-26 Thread Thejas Nair
The incubator proposal has been updated with the feedback so far.
We have 3 mentors now, but I think it would be good to have additional
mentors. Please let me know if anyone is able to help mentor this
project.

I am planning to start a vote on the proposal in a day or two.


On Fri, Feb 6, 2015 at 5:21 PM,  oo...@comp.nus.edu.sg wrote:

 Regarding the number of users using this project -- at this moment, the
 community is not big.  A few local start-ups have been trying to use it
 (mainly due to announcement in our seminar list), eg. one is using it for
 image recognition (given a phone snapped by a user, it wants to be return
 the same the product, and a list of similar products, such as a luxury bag
 on a passerby).  Researchers from outside of NUS may have been using it
 since we published an application paper on cross domain/modal retrieval in
 VLDB 2014.

 We have not announced the project to the outside community yet -- we would
 announce it in dbworld etc in due course.

 Thanks and have a good weekend.

 regards
 beng chin


 Thanks for the comments and suggestions.
 With permission from Thejas, I would like to respond to point 2.

 We have a huge team down at NUS (National University of Singapore) --
 we have about seven database/data mining data professors (not including
 those in systems, networking, and machine learning).
 I myself have nine PhD students in a steady state, and I have a few large
 grants, with a total budget of about 15 million S$ (~12 million USD), that
 allows me to hire a number of research fellows and research assistants for
 the next few years.  In a constant state, I have about 20 people (PhD
 students/RA/RF) working with me alone.  Other professors have their own
 grants (unlike other countries, it is relatively easy to get large grants
 in Singapore; many overseas Universities, including UIUC, MIT, ETH etc
 have research labs funded by Singapore Research Foundation [equivalent of
 NSF]).

 SINGA is a long term project for us -- while it is a platform as it is, we
 are using it for healthcare predictive analytics (by working with a
 hospital associated with the University).  Therefore, we will be working
 on SINGA, not solely as a distributed DL platform, but as a tool that will
 enable us to do data analytics on some business domains (eg. healthcase,
 consumer etc)

 For the initial set of committers, three are tenured professors, five are
 students, with 2-5 years to go before they complete their PhD.  Quite
 often, some would stay back as a research fellow for a couple of years
 before they start looking for a job outside.  We will work with mentors
 and new developers (from outside of NUS or Zhejiang University) in
 enhancing the system.

 The project should survive in that sense.

 (I have an on-going project CIIDAA that has been around since 2008; it was
 started as another project, epiC,  with a different grant, and then we
 continue the development with a new grant for CIIDAA --
 http://www.comp.nus.edu.sg/~ciidaa/
 )

 Thanks.

 regards
 beng chin
 ps: i am not sure if my email will get through to the group.


  Original Message 
 Subject: Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator
 From:Henry Saputra henry.sapu...@gmail.com
 Date:Thu, February 5, 2015 2:57 pm
 To:  general@incubator.apache.org general@incubator.apache.org
 Cc:  oo...@comp.nus.edu.sg
 --

 Several comments:
 -) How many users already using this project? I would reccomend to
 drop request for singa-user list at the beginning.
 -) All the initial committers come from university and seemed like
 some of them already ready to leave university. I am not too sure if
 this project go survive if all of the inital committers are from
 university as students.
 -) Need to solicit more mentors if this project ever get to Apache
 incubator.

 - Henry

 On Tue, Feb 3, 2015 at 3:58 PM, Thejas Nair thejas.n...@gmail.com wrote:
 The Relationship with Other Apache Products section has been
 updated. The reference to H2O in that section has been removed, and
 other projects have been added.
  Thanks for the feedback!


 On Wed, Jan 28, 2015 at 10:27 AM, Thejas Nair thejas.n...@gmail.com
 wrote:
 Thanks for pointing that out Henry! Yes, looks like H20 is not an
 apache project, I should have verified that.
 I will edit that, and revisit that section along with the folks in
 Singa community.


 On Tue, Jan 27, 2015 at 6:55 PM, Henry Saputra
 henry.sapu...@gmail.com wrote:
 Quick immediate comment that Apache H2O is not really Apache
 project.

 I assume you are referring to https://github.com/h2oai/h2o (or
 https://github.com/h2oai/h2o-dev) ?

 - Henry

 On Tue, Jan 27, 2015 at 5:29 PM, Thejas Nair thejas.n...@gmail.com
 wrote:
 Hello everyone,

 I would like to propose the inclusion of Singa as an Apache Incubator
 project.

 Here is the proposal -
 https

Re: [DISCUSS] [PROPOSAL] Singa for Apache Incubator

2015-02-03 Thread Thejas Nair
The Relationship with Other Apache Products section has been
updated. The reference to H2O in that section has been removed, and
other projects have been added.
 Thanks for the feedback!


On Wed, Jan 28, 2015 at 10:27 AM, Thejas Nair thejas.n...@gmail.com wrote:
 Thanks for pointing that out Henry! Yes, looks like H20 is not an
 apache project, I should have verified that.
 I will edit that, and revisit that section along with the folks in
 Singa community.


 On Tue, Jan 27, 2015 at 6:55 PM, Henry Saputra henry.sapu...@gmail.com 
 wrote:
 Quick immediate comment that Apache H2O is not really Apache project.

 I assume you are referring to https://github.com/h2oai/h2o (or
 https://github.com/h2oai/h2o-dev) ?

 - Henry

 On Tue, Jan 27, 2015 at 5:29 PM, Thejas Nair thejas.n...@gmail.com wrote:
 Hello everyone,

 I would like to propose the inclusion of Singa as an Apache Incubator 
 project.

 Here is the proposal - https://wiki.apache.org/incubator/SingaProposal

 Please review the proposal and give feedback. I am planning to start a
 vote after 7 days if the proposal looks good.
 We are also seeking additional Apache mentors for the project.

 Thanks,
 Thejas
 ==
 Singa Incubator Proposal

 Abstract

 SINGA is a distributed deep learning platform.

 Proposal

 SINGA is an efficient, scalable and easy-to-use distributed platform
 for training deep learning models, e.g., Deep Convolutional Neural
 Network and Deep Belief Network. It parallelizes the computation
 (i.e., training) onto a cluster of nodes by distributing the training
 data and model automatically to speed up the training. Built-in
 training algorithms like Back-Propagation and Contrastive Divergence
 are implemented based on common abstractions of deep learning models.
 Users can train their own deep learning models by simply customizing
 these abstractions like implementing the Mapper and Reducer in Hadoop.

 Background

 Deep learning refers to a set of feature (or representation) learning
 models that consist of multiple (non-linear) layers, where different
 layers learn different levels of abstractions (representations) of the
 raw input data. Larger (in terms of model parameters) and deeper (in
 terms of number of layers) models have shown better performance, e.g.,
 lower image classification error in Large Scale Visual Recognition
 Challenge. However, a larger model requires more memory and larger
 training data to reduce over-fitting. Complex numeric operations make
 the training computation intensive. In practice, training large deep
 learning models takes weeks or months on a single node (even with
 GPU).

 Rational

 Deep learning has gained a lot of attraction in both academia and
 industry due to its success in a wide range of areas such as computer
 vision and speech recognition. However, training of such models is
 computationally expensive, especially for large and deep models (e.g.,
 with billions of parameters and more than 10 layers). Both Google and
 Microsoft have developed distributed deep learning systems to make the
 training more efficient by distributing the computations within a
 cluster of nodes. However, these systems are closed source softwares.
 Our goal is to leverage the community of open source developers to
 make SINGA efficient, scalable and easy to use. SINGA is a full
 fledged distributed platform, that could benefit the community and
 also benefit from the community in their involvement in contributing
 to the further work in this area. We believe the nature of SINGA and
 our visions for the system fit naturally to Apache's philosophy and
 development framework.

 Initial Goals

 We have developed a system for SINGA running on a commodity computer
 cluster. The initial goals include, * improving the system in terms of
 scalability and efficiency, e.g., using Infiniband for network
 communication and multi-threading for one node computation. We would
 consider extending SINGA to GPU clusters later. * benchmarking with
 larger datasets (hundreds of millions of training instances) and
 models (billions of parameters). * adding more built-in deep learning
 models. Users can train the built-in models on their datasets
 directly.

 Current Status

 Meritocracy

 We would like to follow ASF meritocratic principles to encourage more
 developers to contribute in this project. We know that only active and
 excellent developers can make SINGA a successful project. The
 committer list and PMC will be updated based on developers'
 performance and commitment. We are also improving the documentation
 and code to help new developers get started quickly.

 Community

 SINGA is currently being developed in the Database System Research Lab
 at the National University of Singapore (NUS) in collaboration with
 Zhejiang University in China. Our lab has extensive experience in
 building database related systems, including distributed systems. Six
 PhD students

[DISCUSS] [PROPOSAL] Singa for Apache Incubator

2015-01-27 Thread Thejas Nair
 are averaged as the final model parameters. This
training algorithm is different from the distributed training
algorithm used by DistBelief, Adam and SINGA, which frequently
synchronizes the parameters trained from different nodes. SINGA adopts
the parameter server framework to support a wide range of distributed
training algorithms and parallelization methods (e.g., data
parallelism, model parallelism and hybrid parallelism. H2O only
support data parallelism) . Second, in H2O, users are restricted to
use the two built-in models. In SINGA, we provide simple programming
model to let users implement their own deep learning models. A new
deep learning model can be implemented by customizing the base Layer
class for each layer involved in the model. It is similar to writing
Hadoop programs where users only need to override the base Mapper and
Reducer. We also provide built-in models for users to use directly.

Documentation

The project is hosted at
http://www.comp.nus.edu.sg/~dbsystem/project/singa.html.
Documentations can be found at the Github Wiki Page:
https://github.com/nusinga/singa/wiki. We continue to refine and
improve the documentation.

Initial Source

We use Github to maintain our source code, https://github.com/nusinga/singa

Source and Intellectual Property Submission Plan

We plan to make our code base be under Apache License, Version 2.0.

External Dependencies

required by the core code base: glog, gflags, google protobuf,
open-blas, mpich, armci-mpi.
required by data preparation and preprocessing: opencv, hdfs, python.

Cryptography

Not Applicable

Required Resources

Mailing Lists

Currently, we use google group for internal discussion. The mailing
address is nusi...@googlegroup.com. We will migrate the content to the
apache mailing lists in the future.

singa-dev
singa-user
singa-commits
singa-private (for private discussion within PCM)

Git Repository

We want to continue using git for version control. Hence, a git repo
is required.

Issue Tracking

JIRA Singa (SINGA)

Initial Committers

Beng Chin Ooi (ooibc @comp.nus.edu.sg)
Kian Lee Tan (tankl @comp.nus.edu.sg)
Gang Chen (cg @zju.edu.cn)
Wei Wang (wangwei @comp.nus.edu.sg)
Dinh Tien Tuan Anh (dinhtta @comp.nus.edu.sg)
Jinyang Gao (jinyang.gao @comp.nus.edu.sg)
Sheng Wang (wangsh @comp.nus.edu.sg)
Kaiping Zheng (kaiping @comp.nus.edu.sg)
Zhaojing Luo (zhaojing @comp.nus.edu.sg)
Zhongle Xie (zhongle @comp.nus.edu.sg)

Affiliations

Beng Chin Ooi, National University of Singapore
Kian Lee Tan, National University of Singapore
Gang Chen, Zhejiang University
Wei Wang, National University of Singapore
Dinh Tien Tuan Anh, National University of Singapore
Jinyang Gao, National University of Singapore
Sheng Wang, National University of Singapore
Kaiping Zheng, National University of Singapore
Zhaojing Luo, National University of Singapore
Zhongle Xie, National University of Singapore

Sponsors

Champion

Thejas Nair (thejas at apache.org) - Hortonworks

Nominated Mentors

Thejas Nair (thejas at apache.org) - Hortonworks
Alan Gates (gates at apache dot org) - Hortonworks
(Seeking more volunteers!)

Sponsoring Entity

We are requesting the Incubator to sponsor this project.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Wiki edit permissions for ThejasNair

2014-12-16 Thread Thejas Nair
Please grant incubator wiki edit permissions for my username - ThejasNair .
I am championing a incubator proposal and would like to create
proposal wiki page.

Thanks,
Thejas

-- 
CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader 
of this message is not the intended recipient, you are hereby notified that 
any printing, copying, dissemination, distribution, disclosure or 
forwarding of this communication is strictly prohibited. If you have 
received this communication in error, please contact the sender immediately 
and delete it from your system. Thank You.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org