Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-23 Thread Ate Douma

+1 (binding)

Very interesting.
And if you still like or need another mentor, I'd be willing to help out.

Ate

On 2015-02-20 06:38, Mattmann, Chris A (3980) wrote:

Hi Everyone,

OK, discussion has died down on this thread. I was originally
suggesting that the pTLP option may be best for this community,
but after some discussions with the existing community of
AsterixDB’ers proposing to bring the project here to the ASF,
AsterixDB would like to move forward independent of whatever
comes of the pTLP discussions.

That said, I would like to propose Apache AsterixDB as an
Incubator project. I am now calling a VOTE to accept AsterixDB
into the Apache Incubator. This VOTE will run for at least 72 hours.

[ ] +1 Accept Apache AsterixDB into the Incubator
[ ] +0 Don’t care.
[ ] -1 Don’t accept Apache AsterixDB into the Incubator because..

Thanks for the feedback so far and looking forward to the VOTE!

You can count my binding +1.

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov
Date: Wednesday, January 14, 2015 at 6:20 PM
To: general@incubator.apache.org general@incubator.apache.org
Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
Westmann t...@westmann.org
Subject: [PROPOSAL] Apache AsterixDB Incubator


Hi Folks,

I am pleased to bring forth the Apache AsterixDB proposal to the
Apache Incubator as Champion, working in collaboration with the
team. Please find the wiki proposal here:

https://wiki.apache.org/incubator/AsterixDBProposal


Full text of the proposal is below. Please discuss and enjoy. I’ll
leave the discussion open for a week, and then look to call a VOTE
hopefully end of next week if all is well.

Cheers!
Chris Mattmann

=
Apache AsterixDB Proposal

Abstract

Apache AsterixDB is a scalable big data management system (BDMS) that
provides storage, management, and query capabilities for large
collections of semi-structured data.

Proposal

AsterixDB is a big data management system (BDMS) that makes it
well-suited to needs such as web data warehousing and social data
storage and analysis. Feature-wise, AsterixDB has:

* A NoSQL style data model (ADM) based on extending JSON with object
  database concepts.
* An expressive and declarative query language (AQL) for querying
  semi-structured data.
* A runtime query execution engine, Hyracks, for partitioned-parallel
  execution of query plans.
* Partitioned LSM-based data storage and indexing for efficient
  ingestion of newly arriving data.
* Support for querying and indexing external data (e.g., in HDFS) as
  well as data stored within AsterixDB.
* A rich set of primitive data types, including support for spatial,
  temporal, and textual data.
* Indexing options that include B+ trees, R trees, and inverted
  keyword index support.
* Basic transactional (concurrency and recovery) capabilities akin to
  those of a NoSQL store.


Background and Rationale

In the world of relational databases, the need to tackle data volumes
that exceed the capabilities of a single server led to the
development of “shared-nothing” parallel database systems several
decades ago. These systems spread data over a cluster based on a
partitioning strategy, such as hash partitioning, and queries are
processed by employing partitioned-parallel divide-and-conquer
techniques. Since these systems are fronted by a high-level,
declarative language (SQL), their users are shielded from the
complexities of parallel programming. Parallel database systems have
been an extremely successful application of parallel computing, and
quite a number of commercial products exist today.

In the distributed systems world, the Web brought a need to index and
query its huge content. SQL and relational databases were not the
answer, though shared-nothing clusters again emerged as the hardware
platform of choice. Google developed the Google File System (GFS) and
MapReduce programming model to allow programmers to store and process
Big Data by writing a few user-defined functions. The MapReduce
framework applies these functions in parallel to data instances in
distributed files (map) and to sorted groups of instances sharing a
common key (reduce) -- not unlike the partitioned parallelism in
parallel database systems. Apache's Hadoop MapReduce platform is the
most prominent implementation of this 

Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-23 Thread Mattmann, Chris A (3980)
Thank you Ate! I have added you as a mentor on the proposal!

https://wiki.apache.org/incubator/AsterixDBProposal

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Ate Douma a...@douma.nu
Reply-To: general@incubator.apache.org general@incubator.apache.org
Date: Monday, February 23, 2015 at 6:47 AM
To: general@incubator.apache.org general@incubator.apache.org
Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
Westmann t...@westmann.org
Subject: Re: [VOTE] Accept Apache AsterixDB in to the Incubator

+1 (binding)

Very interesting.
And if you still like or need another mentor, I'd be willing to help out.

Ate

On 2015-02-20 06:38, Mattmann, Chris A (3980) wrote:
 Hi Everyone,

 OK, discussion has died down on this thread. I was originally
 suggesting that the pTLP option may be best for this community,
 but after some discussions with the existing community of
 AsterixDB’ers proposing to bring the project here to the ASF,
 AsterixDB would like to move forward independent of whatever
 comes of the pTLP discussions.

 That said, I would like to propose Apache AsterixDB as an
 Incubator project. I am now calling a VOTE to accept AsterixDB
 into the Apache Incubator. This VOTE will run for at least 72 hours.

 [ ] +1 Accept Apache AsterixDB into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..

 Thanks for the feedback so far and looking forward to the VOTE!

 You can count my binding +1.

 Cheers,
 Chris

 ++
 Chris Mattmann, Ph.D.
 Chief Architect
 Instrument Software and Science Data Systems Section (398)
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 168-519, Mailstop: 168-527
 Email: chris.a.mattm...@nasa.gov
 WWW:  http://sunset.usc.edu/~mattmann/
 ++
 Adjunct Associate Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 ++






 -Original Message-
 From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov
 Date: Wednesday, January 14, 2015 at 6:20 PM
 To: general@incubator.apache.org general@incubator.apache.org
 Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
 Westmann t...@westmann.org
 Subject: [PROPOSAL] Apache AsterixDB Incubator

 Hi Folks,

 I am pleased to bring forth the Apache AsterixDB proposal to the
 Apache Incubator as Champion, working in collaboration with the
 team. Please find the wiki proposal here:

 https://wiki.apache.org/incubator/AsterixDBProposal


 Full text of the proposal is below. Please discuss and enjoy. I’ll
 leave the discussion open for a week, and then look to call a VOTE
 hopefully end of next week if all is well.

 Cheers!
 Chris Mattmann

 =
 Apache AsterixDB Proposal

 Abstract

 Apache AsterixDB is a scalable big data management system (BDMS) that
 provides storage, management, and query capabilities for large
 collections of semi-structured data.

 Proposal

 AsterixDB is a big data management system (BDMS) that makes it
 well-suited to needs such as web data warehousing and social data
 storage and analysis. Feature-wise, AsterixDB has:

 * A NoSQL style data model (ADM) based on extending JSON with object
   database concepts.
 * An expressive and declarative query language (AQL) for querying
   semi-structured data.
 * A runtime query execution engine, Hyracks, for partitioned-parallel
   execution of query plans.
 * Partitioned LSM-based data storage and indexing for efficient
   ingestion of newly arriving data.
 * Support for querying and indexing external data (e.g., in HDFS) as
   well as data stored within AsterixDB.
 * A rich set of primitive data types, including support for spatial,
   temporal, and textual data.
 * Indexing options that include B+ trees, R trees, and inverted
   keyword index support.
 * Basic transactional (concurrency and recovery) capabilities akin to
   those of a NoSQL store.


 Background and Rationale

 In the world of relational databases, the need to tackle data volumes
 that exceed the capabilities of a single server led to the
 development of “shared-nothing” parallel database systems several
 decades ago. These systems spread data over

Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-23 Thread Mike Carey

Thx!

On 2/23/15 4:26 PM, Mattmann, Chris A (3980) wrote:

Thank you Ate! I have added you as a mentor on the proposal!

https://wiki.apache.org/incubator/AsterixDBProposal

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Ate Douma a...@douma.nu
Reply-To: general@incubator.apache.org general@incubator.apache.org
Date: Monday, February 23, 2015 at 6:47 AM
To: general@incubator.apache.org general@incubator.apache.org
Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
Westmann t...@westmann.org
Subject: Re: [VOTE] Accept Apache AsterixDB in to the Incubator


+1 (binding)

Very interesting.
And if you still like or need another mentor, I'd be willing to help out.

Ate

On 2015-02-20 06:38, Mattmann, Chris A (3980) wrote:

Hi Everyone,

OK, discussion has died down on this thread. I was originally
suggesting that the pTLP option may be best for this community,
but after some discussions with the existing community of
AsterixDB’ers proposing to bring the project here to the ASF,
AsterixDB would like to move forward independent of whatever
comes of the pTLP discussions.

That said, I would like to propose Apache AsterixDB as an
Incubator project. I am now calling a VOTE to accept AsterixDB
into the Apache Incubator. This VOTE will run for at least 72 hours.

[ ] +1 Accept Apache AsterixDB into the Incubator
[ ] +0 Don’t care.
[ ] -1 Don’t accept Apache AsterixDB into the Incubator because..

Thanks for the feedback so far and looking forward to the VOTE!

You can count my binding +1.

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++






-Original Message-
From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov
Date: Wednesday, January 14, 2015 at 6:20 PM
To: general@incubator.apache.org general@incubator.apache.org
Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
Westmann t...@westmann.org
Subject: [PROPOSAL] Apache AsterixDB Incubator


Hi Folks,

I am pleased to bring forth the Apache AsterixDB proposal to the
Apache Incubator as Champion, working in collaboration with the
team. Please find the wiki proposal here:

https://wiki.apache.org/incubator/AsterixDBProposal


Full text of the proposal is below. Please discuss and enjoy. I’ll
leave the discussion open for a week, and then look to call a VOTE
hopefully end of next week if all is well.

Cheers!
Chris Mattmann

=
Apache AsterixDB Proposal

Abstract

Apache AsterixDB is a scalable big data management system (BDMS) that
provides storage, management, and query capabilities for large
collections of semi-structured data.

Proposal

AsterixDB is a big data management system (BDMS) that makes it
well-suited to needs such as web data warehousing and social data
storage and analysis. Feature-wise, AsterixDB has:

* A NoSQL style data model (ADM) based on extending JSON with object
   database concepts.
* An expressive and declarative query language (AQL) for querying
   semi-structured data.
* A runtime query execution engine, Hyracks, for partitioned-parallel
   execution of query plans.
* Partitioned LSM-based data storage and indexing for efficient
   ingestion of newly arriving data.
* Support for querying and indexing external data (e.g., in HDFS) as
   well as data stored within AsterixDB.
* A rich set of primitive data types, including support for spatial,
   temporal, and textual data.
* Indexing options that include B+ trees, R trees, and inverted
   keyword index support.
* Basic transactional (concurrency and recovery) capabilities akin to
   those of a NoSQL store.


Background and Rationale

In the world of relational databases, the need to tackle data volumes
that exceed the capabilities of a single server led to the
development of “shared-nothing” parallel database systems several
decades ago. These systems spread data over a cluster

Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-22 Thread Alan D. Cabrera
+1 binding


Regards,
Alan

 On Feb 19, 2015, at 9:38 PM, Mattmann, Chris A (3980) 
 chris.a.mattm...@jpl.nasa.gov wrote:
 
 That said, I would like to propose Apache AsterixDB as an
 Incubator project. I am now calling a VOTE to accept AsterixDB
 into the Apache Incubator. This VOTE will run for at least 72 hours.
 
 [ ] +1 Accept Apache AsterixDB into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..



Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-22 Thread Timothy Chen
+1 (non-binding)

Tim

 On Feb 22, 2015, at 10:59 AM, Alan D. Cabrera l...@toolazydogs.com wrote:
 
 +1 binding
 
 
 Regards,
 Alan
 
 On Feb 19, 2015, at 9:38 PM, Mattmann, Chris A (3980) 
 chris.a.mattm...@jpl.nasa.gov wrote:
 
 That said, I would like to propose Apache AsterixDB as an
 Incubator project. I am now calling a VOTE to accept AsterixDB
 into the Apache Incubator. This VOTE will run for at least 72 hours.
 
 [ ] +1 Accept Apache AsterixDB into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..
 

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-21 Thread Chris Douglas
+1 (binding) -C

On Thu, Feb 19, 2015 at 9:38 PM, Mattmann, Chris A (3980)
chris.a.mattm...@jpl.nasa.gov wrote:
 Hi Everyone,

 OK, discussion has died down on this thread. I was originally
 suggesting that the pTLP option may be best for this community,
 but after some discussions with the existing community of
 AsterixDB’ers proposing to bring the project here to the ASF,
 AsterixDB would like to move forward independent of whatever
 comes of the pTLP discussions.

 That said, I would like to propose Apache AsterixDB as an
 Incubator project. I am now calling a VOTE to accept AsterixDB
 into the Apache Incubator. This VOTE will run for at least 72 hours.

 [ ] +1 Accept Apache AsterixDB into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..

 Thanks for the feedback so far and looking forward to the VOTE!

 You can count my binding +1.

 Cheers,
 Chris

 ++
 Chris Mattmann, Ph.D.
 Chief Architect
 Instrument Software and Science Data Systems Section (398)
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 168-519, Mailstop: 168-527
 Email: chris.a.mattm...@nasa.gov
 WWW:  http://sunset.usc.edu/~mattmann/
 ++
 Adjunct Associate Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 ++






 -Original Message-
 From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov
 Date: Wednesday, January 14, 2015 at 6:20 PM
 To: general@incubator.apache.org general@incubator.apache.org
 Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
 Westmann t...@westmann.org
 Subject: [PROPOSAL] Apache AsterixDB Incubator

Hi Folks,

I am pleased to bring forth the Apache AsterixDB proposal to the
Apache Incubator as Champion, working in collaboration with the
team. Please find the wiki proposal here:

https://wiki.apache.org/incubator/AsterixDBProposal


Full text of the proposal is below. Please discuss and enjoy. I’ll
leave the discussion open for a week, and then look to call a VOTE
hopefully end of next week if all is well.

Cheers!
Chris Mattmann

=
Apache AsterixDB Proposal

Abstract

Apache AsterixDB is a scalable big data management system (BDMS) that
provides storage, management, and query capabilities for large
collections of semi-structured data.

Proposal

AsterixDB is a big data management system (BDMS) that makes it
well-suited to needs such as web data warehousing and social data
storage and analysis. Feature-wise, AsterixDB has:

* A NoSQL style data model (ADM) based on extending JSON with object
  database concepts.
* An expressive and declarative query language (AQL) for querying
  semi-structured data.
* A runtime query execution engine, Hyracks, for partitioned-parallel
  execution of query plans.
* Partitioned LSM-based data storage and indexing for efficient
  ingestion of newly arriving data.
* Support for querying and indexing external data (e.g., in HDFS) as
  well as data stored within AsterixDB.
* A rich set of primitive data types, including support for spatial,
  temporal, and textual data.
* Indexing options that include B+ trees, R trees, and inverted
  keyword index support.
* Basic transactional (concurrency and recovery) capabilities akin to
  those of a NoSQL store.


Background and Rationale

In the world of relational databases, the need to tackle data volumes
that exceed the capabilities of a single server led to the
development of “shared-nothing” parallel database systems several
decades ago. These systems spread data over a cluster based on a
partitioning strategy, such as hash partitioning, and queries are
processed by employing partitioned-parallel divide-and-conquer
techniques. Since these systems are fronted by a high-level,
declarative language (SQL), their users are shielded from the
complexities of parallel programming. Parallel database systems have
been an extremely successful application of parallel computing, and
quite a number of commercial products exist today.

In the distributed systems world, the Web brought a need to index and
query its huge content. SQL and relational databases were not the
answer, though shared-nothing clusters again emerged as the hardware
platform of choice. Google developed the Google File System (GFS) and
MapReduce programming model to allow programmers to store and process
Big Data by writing a few user-defined functions. The MapReduce
framework applies these functions in parallel to data instances in
distributed files (map) and to sorted groups of instances sharing a
common key (reduce) -- not unlike the partitioned parallelism in
parallel database systems. Apache's Hadoop MapReduce platform is the
most prominent implementation of this paradigm for the rest 

Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-21 Thread John D. Ament
+1 happy to see you guys come on board!

On Fri Feb 20 2015 at 12:40:42 AM Mattmann, Chris A (3980) 
chris.a.mattm...@jpl.nasa.gov wrote:

 Hi Everyone,

 OK, discussion has died down on this thread. I was originally
 suggesting that the pTLP option may be best for this community,
 but after some discussions with the existing community of
 AsterixDB’ers proposing to bring the project here to the ASF,
 AsterixDB would like to move forward independent of whatever
 comes of the pTLP discussions.

 That said, I would like to propose Apache AsterixDB as an
 Incubator project. I am now calling a VOTE to accept AsterixDB
 into the Apache Incubator. This VOTE will run for at least 72 hours.

 [ ] +1 Accept Apache AsterixDB into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..

 Thanks for the feedback so far and looking forward to the VOTE!

 You can count my binding +1.

 Cheers,
 Chris

 ++
 Chris Mattmann, Ph.D.
 Chief Architect
 Instrument Software and Science Data Systems Section (398)
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 168-519, Mailstop: 168-527
 Email: chris.a.mattm...@nasa.gov
 WWW:  http://sunset.usc.edu/~mattmann/
 ++
 Adjunct Associate Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 ++






 -Original Message-
 From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov
 Date: Wednesday, January 14, 2015 at 6:20 PM
 To: general@incubator.apache.org general@incubator.apache.org
 Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
 Westmann t...@westmann.org
 Subject: [PROPOSAL] Apache AsterixDB Incubator

 Hi Folks,
 
 I am pleased to bring forth the Apache AsterixDB proposal to the
 Apache Incubator as Champion, working in collaboration with the
 team. Please find the wiki proposal here:
 
 https://wiki.apache.org/incubator/AsterixDBProposal
 
 
 Full text of the proposal is below. Please discuss and enjoy. I’ll
 leave the discussion open for a week, and then look to call a VOTE
 hopefully end of next week if all is well.
 
 Cheers!
 Chris Mattmann
 
 =
 Apache AsterixDB Proposal
 
 Abstract
 
 Apache AsterixDB is a scalable big data management system (BDMS) that
 provides storage, management, and query capabilities for large
 collections of semi-structured data.
 
 Proposal
 
 AsterixDB is a big data management system (BDMS) that makes it
 well-suited to needs such as web data warehousing and social data
 storage and analysis. Feature-wise, AsterixDB has:
 
 * A NoSQL style data model (ADM) based on extending JSON with object
   database concepts.
 * An expressive and declarative query language (AQL) for querying
   semi-structured data.
 * A runtime query execution engine, Hyracks, for partitioned-parallel
   execution of query plans.
 * Partitioned LSM-based data storage and indexing for efficient
   ingestion of newly arriving data.
 * Support for querying and indexing external data (e.g., in HDFS) as
   well as data stored within AsterixDB.
 * A rich set of primitive data types, including support for spatial,
   temporal, and textual data.
 * Indexing options that include B+ trees, R trees, and inverted
   keyword index support.
 * Basic transactional (concurrency and recovery) capabilities akin to
   those of a NoSQL store.
 
 
 Background and Rationale
 
 In the world of relational databases, the need to tackle data volumes
 that exceed the capabilities of a single server led to the
 development of “shared-nothing” parallel database systems several
 decades ago. These systems spread data over a cluster based on a
 partitioning strategy, such as hash partitioning, and queries are
 processed by employing partitioned-parallel divide-and-conquer
 techniques. Since these systems are fronted by a high-level,
 declarative language (SQL), their users are shielded from the
 complexities of parallel programming. Parallel database systems have
 been an extremely successful application of parallel computing, and
 quite a number of commercial products exist today.
 
 In the distributed systems world, the Web brought a need to index and
 query its huge content. SQL and relational databases were not the
 answer, though shared-nothing clusters again emerged as the hardware
 platform of choice. Google developed the Google File System (GFS) and
 MapReduce programming model to allow programmers to store and process
 Big Data by writing a few user-defined functions. The MapReduce
 framework applies these functions in parallel to data instances in
 distributed files (map) and to sorted groups of instances sharing a
 common key (reduce) -- not unlike the partitioned parallelism in
 parallel database systems. 

Re: [VOTE] Accept Apache AsterixDB in to the Incubator

2015-02-21 Thread Henry Saputra
+1  (binding)

- Henry

On Thu, Feb 19, 2015 at 9:38 PM, Mattmann, Chris A (3980)
chris.a.mattm...@jpl.nasa.gov wrote:
 Hi Everyone,

 OK, discussion has died down on this thread. I was originally
 suggesting that the pTLP option may be best for this community,
 but after some discussions with the existing community of
 AsterixDB’ers proposing to bring the project here to the ASF,
 AsterixDB would like to move forward independent of whatever
 comes of the pTLP discussions.

 That said, I would like to propose Apache AsterixDB as an
 Incubator project. I am now calling a VOTE to accept AsterixDB
 into the Apache Incubator. This VOTE will run for at least 72 hours.

 [ ] +1 Accept Apache AsterixDB into the Incubator
 [ ] +0 Don’t care.
 [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..

 Thanks for the feedback so far and looking forward to the VOTE!

 You can count my binding +1.

 Cheers,
 Chris

 ++
 Chris Mattmann, Ph.D.
 Chief Architect
 Instrument Software and Science Data Systems Section (398)
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 168-519, Mailstop: 168-527
 Email: chris.a.mattm...@nasa.gov
 WWW:  http://sunset.usc.edu/~mattmann/
 ++
 Adjunct Associate Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 ++






 -Original Message-
 From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov
 Date: Wednesday, January 14, 2015 at 6:20 PM
 To: general@incubator.apache.org general@incubator.apache.org
 Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till
 Westmann t...@westmann.org
 Subject: [PROPOSAL] Apache AsterixDB Incubator

Hi Folks,

I am pleased to bring forth the Apache AsterixDB proposal to the
Apache Incubator as Champion, working in collaboration with the
team. Please find the wiki proposal here:

https://wiki.apache.org/incubator/AsterixDBProposal


Full text of the proposal is below. Please discuss and enjoy. I’ll
leave the discussion open for a week, and then look to call a VOTE
hopefully end of next week if all is well.

Cheers!
Chris Mattmann

=
Apache AsterixDB Proposal

Abstract

Apache AsterixDB is a scalable big data management system (BDMS) that
provides storage, management, and query capabilities for large
collections of semi-structured data.

Proposal

AsterixDB is a big data management system (BDMS) that makes it
well-suited to needs such as web data warehousing and social data
storage and analysis. Feature-wise, AsterixDB has:

* A NoSQL style data model (ADM) based on extending JSON with object
  database concepts.
* An expressive and declarative query language (AQL) for querying
  semi-structured data.
* A runtime query execution engine, Hyracks, for partitioned-parallel
  execution of query plans.
* Partitioned LSM-based data storage and indexing for efficient
  ingestion of newly arriving data.
* Support for querying and indexing external data (e.g., in HDFS) as
  well as data stored within AsterixDB.
* A rich set of primitive data types, including support for spatial,
  temporal, and textual data.
* Indexing options that include B+ trees, R trees, and inverted
  keyword index support.
* Basic transactional (concurrency and recovery) capabilities akin to
  those of a NoSQL store.


Background and Rationale

In the world of relational databases, the need to tackle data volumes
that exceed the capabilities of a single server led to the
development of “shared-nothing” parallel database systems several
decades ago. These systems spread data over a cluster based on a
partitioning strategy, such as hash partitioning, and queries are
processed by employing partitioned-parallel divide-and-conquer
techniques. Since these systems are fronted by a high-level,
declarative language (SQL), their users are shielded from the
complexities of parallel programming. Parallel database systems have
been an extremely successful application of parallel computing, and
quite a number of commercial products exist today.

In the distributed systems world, the Web brought a need to index and
query its huge content. SQL and relational databases were not the
answer, though shared-nothing clusters again emerged as the hardware
platform of choice. Google developed the Google File System (GFS) and
MapReduce programming model to allow programmers to store and process
Big Data by writing a few user-defined functions. The MapReduce
framework applies these functions in parallel to data instances in
distributed files (map) and to sorted groups of instances sharing a
common key (reduce) -- not unlike the partitioned parallelism in
parallel database systems. Apache's Hadoop MapReduce platform is the
most prominent implementation of this paradigm for the