Re: [VOTE] Accept Apache AsterixDB in to the Incubator
+1 (binding) Very interesting. And if you still like or need another mentor, I'd be willing to help out. Ate On 2015-02-20 06:38, Mattmann, Chris A (3980) wrote: Hi Everyone, OK, discussion has died down on this thread. I was originally suggesting that the pTLP option may be best for this community, but after some discussions with the existing community of AsterixDB’ers proposing to bring the project here to the ASF, AsterixDB would like to move forward independent of whatever comes of the pTLP discussions. That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. Thanks for the feedback so far and looking forward to the VOTE! You can count my binding +1. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov Date: Wednesday, January 14, 2015 at 6:20 PM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: [PROPOSAL] Apache AsterixDB Incubator Hi Folks, I am pleased to bring forth the Apache AsterixDB proposal to the Apache Incubator as Champion, working in collaboration with the team. Please find the wiki proposal here: https://wiki.apache.org/incubator/AsterixDBProposal Full text of the proposal is below. Please discuss and enjoy. I’ll leave the discussion open for a week, and then look to call a VOTE hopefully end of next week if all is well. Cheers! Chris Mattmann = Apache AsterixDB Proposal Abstract Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Proposal AsterixDB is a big data management system (BDMS) that makes it well-suited to needs such as web data warehousing and social data storage and analysis. Feature-wise, AsterixDB has: * A NoSQL style data model (ADM) based on extending JSON with object database concepts. * An expressive and declarative query language (AQL) for querying semi-structured data. * A runtime query execution engine, Hyracks, for partitioned-parallel execution of query plans. * Partitioned LSM-based data storage and indexing for efficient ingestion of newly arriving data. * Support for querying and indexing external data (e.g., in HDFS) as well as data stored within AsterixDB. * A rich set of primitive data types, including support for spatial, temporal, and textual data. * Indexing options that include B+ trees, R trees, and inverted keyword index support. * Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store. Background and Rationale In the world of relational databases, the need to tackle data volumes that exceed the capabilities of a single server led to the development of “shared-nothing” parallel database systems several decades ago. These systems spread data over a cluster based on a partitioning strategy, such as hash partitioning, and queries are processed by employing partitioned-parallel divide-and-conquer techniques. Since these systems are fronted by a high-level, declarative language (SQL), their users are shielded from the complexities of parallel programming. Parallel database systems have been an extremely successful application of parallel computing, and quite a number of commercial products exist today. In the distributed systems world, the Web brought a need to index and query its huge content. SQL and relational databases were not the answer, though shared-nothing clusters again emerged as the hardware platform of choice. Google developed the Google File System (GFS) and MapReduce programming model to allow programmers to store and process Big Data by writing a few user-defined functions. The MapReduce framework applies these functions in parallel to data instances in distributed files (map) and to sorted groups of instances sharing a common key (reduce) -- not unlike the partitioned parallelism in parallel database systems. Apache's Hadoop MapReduce platform is the most prominent implementation of this
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
Thank you Ate! I have added you as a mentor on the proposal! https://wiki.apache.org/incubator/AsterixDBProposal Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Ate Douma a...@douma.nu Reply-To: general@incubator.apache.org general@incubator.apache.org Date: Monday, February 23, 2015 at 6:47 AM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: Re: [VOTE] Accept Apache AsterixDB in to the Incubator +1 (binding) Very interesting. And if you still like or need another mentor, I'd be willing to help out. Ate On 2015-02-20 06:38, Mattmann, Chris A (3980) wrote: Hi Everyone, OK, discussion has died down on this thread. I was originally suggesting that the pTLP option may be best for this community, but after some discussions with the existing community of AsterixDB’ers proposing to bring the project here to the ASF, AsterixDB would like to move forward independent of whatever comes of the pTLP discussions. That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. Thanks for the feedback so far and looking forward to the VOTE! You can count my binding +1. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov Date: Wednesday, January 14, 2015 at 6:20 PM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: [PROPOSAL] Apache AsterixDB Incubator Hi Folks, I am pleased to bring forth the Apache AsterixDB proposal to the Apache Incubator as Champion, working in collaboration with the team. Please find the wiki proposal here: https://wiki.apache.org/incubator/AsterixDBProposal Full text of the proposal is below. Please discuss and enjoy. I’ll leave the discussion open for a week, and then look to call a VOTE hopefully end of next week if all is well. Cheers! Chris Mattmann = Apache AsterixDB Proposal Abstract Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Proposal AsterixDB is a big data management system (BDMS) that makes it well-suited to needs such as web data warehousing and social data storage and analysis. Feature-wise, AsterixDB has: * A NoSQL style data model (ADM) based on extending JSON with object database concepts. * An expressive and declarative query language (AQL) for querying semi-structured data. * A runtime query execution engine, Hyracks, for partitioned-parallel execution of query plans. * Partitioned LSM-based data storage and indexing for efficient ingestion of newly arriving data. * Support for querying and indexing external data (e.g., in HDFS) as well as data stored within AsterixDB. * A rich set of primitive data types, including support for spatial, temporal, and textual data. * Indexing options that include B+ trees, R trees, and inverted keyword index support. * Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store. Background and Rationale In the world of relational databases, the need to tackle data volumes that exceed the capabilities of a single server led to the development of “shared-nothing” parallel database systems several decades ago. These systems spread data over
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
Thx! On 2/23/15 4:26 PM, Mattmann, Chris A (3980) wrote: Thank you Ate! I have added you as a mentor on the proposal! https://wiki.apache.org/incubator/AsterixDBProposal Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Ate Douma a...@douma.nu Reply-To: general@incubator.apache.org general@incubator.apache.org Date: Monday, February 23, 2015 at 6:47 AM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: Re: [VOTE] Accept Apache AsterixDB in to the Incubator +1 (binding) Very interesting. And if you still like or need another mentor, I'd be willing to help out. Ate On 2015-02-20 06:38, Mattmann, Chris A (3980) wrote: Hi Everyone, OK, discussion has died down on this thread. I was originally suggesting that the pTLP option may be best for this community, but after some discussions with the existing community of AsterixDB’ers proposing to bring the project here to the ASF, AsterixDB would like to move forward independent of whatever comes of the pTLP discussions. That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. Thanks for the feedback so far and looking forward to the VOTE! You can count my binding +1. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov Date: Wednesday, January 14, 2015 at 6:20 PM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: [PROPOSAL] Apache AsterixDB Incubator Hi Folks, I am pleased to bring forth the Apache AsterixDB proposal to the Apache Incubator as Champion, working in collaboration with the team. Please find the wiki proposal here: https://wiki.apache.org/incubator/AsterixDBProposal Full text of the proposal is below. Please discuss and enjoy. I’ll leave the discussion open for a week, and then look to call a VOTE hopefully end of next week if all is well. Cheers! Chris Mattmann = Apache AsterixDB Proposal Abstract Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Proposal AsterixDB is a big data management system (BDMS) that makes it well-suited to needs such as web data warehousing and social data storage and analysis. Feature-wise, AsterixDB has: * A NoSQL style data model (ADM) based on extending JSON with object database concepts. * An expressive and declarative query language (AQL) for querying semi-structured data. * A runtime query execution engine, Hyracks, for partitioned-parallel execution of query plans. * Partitioned LSM-based data storage and indexing for efficient ingestion of newly arriving data. * Support for querying and indexing external data (e.g., in HDFS) as well as data stored within AsterixDB. * A rich set of primitive data types, including support for spatial, temporal, and textual data. * Indexing options that include B+ trees, R trees, and inverted keyword index support. * Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store. Background and Rationale In the world of relational databases, the need to tackle data volumes that exceed the capabilities of a single server led to the development of “shared-nothing” parallel database systems several decades ago. These systems spread data over a cluster
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
+1 binding Regards, Alan On Feb 19, 2015, at 9:38 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because..
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
+1 (non-binding) Tim On Feb 22, 2015, at 10:59 AM, Alan D. Cabrera l...@toolazydogs.com wrote: +1 binding Regards, Alan On Feb 19, 2015, at 9:38 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
+1 (binding) -C On Thu, Feb 19, 2015 at 9:38 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Hi Everyone, OK, discussion has died down on this thread. I was originally suggesting that the pTLP option may be best for this community, but after some discussions with the existing community of AsterixDB’ers proposing to bring the project here to the ASF, AsterixDB would like to move forward independent of whatever comes of the pTLP discussions. That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. Thanks for the feedback so far and looking forward to the VOTE! You can count my binding +1. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov Date: Wednesday, January 14, 2015 at 6:20 PM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: [PROPOSAL] Apache AsterixDB Incubator Hi Folks, I am pleased to bring forth the Apache AsterixDB proposal to the Apache Incubator as Champion, working in collaboration with the team. Please find the wiki proposal here: https://wiki.apache.org/incubator/AsterixDBProposal Full text of the proposal is below. Please discuss and enjoy. I’ll leave the discussion open for a week, and then look to call a VOTE hopefully end of next week if all is well. Cheers! Chris Mattmann = Apache AsterixDB Proposal Abstract Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Proposal AsterixDB is a big data management system (BDMS) that makes it well-suited to needs such as web data warehousing and social data storage and analysis. Feature-wise, AsterixDB has: * A NoSQL style data model (ADM) based on extending JSON with object database concepts. * An expressive and declarative query language (AQL) for querying semi-structured data. * A runtime query execution engine, Hyracks, for partitioned-parallel execution of query plans. * Partitioned LSM-based data storage and indexing for efficient ingestion of newly arriving data. * Support for querying and indexing external data (e.g., in HDFS) as well as data stored within AsterixDB. * A rich set of primitive data types, including support for spatial, temporal, and textual data. * Indexing options that include B+ trees, R trees, and inverted keyword index support. * Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store. Background and Rationale In the world of relational databases, the need to tackle data volumes that exceed the capabilities of a single server led to the development of “shared-nothing” parallel database systems several decades ago. These systems spread data over a cluster based on a partitioning strategy, such as hash partitioning, and queries are processed by employing partitioned-parallel divide-and-conquer techniques. Since these systems are fronted by a high-level, declarative language (SQL), their users are shielded from the complexities of parallel programming. Parallel database systems have been an extremely successful application of parallel computing, and quite a number of commercial products exist today. In the distributed systems world, the Web brought a need to index and query its huge content. SQL and relational databases were not the answer, though shared-nothing clusters again emerged as the hardware platform of choice. Google developed the Google File System (GFS) and MapReduce programming model to allow programmers to store and process Big Data by writing a few user-defined functions. The MapReduce framework applies these functions in parallel to data instances in distributed files (map) and to sorted groups of instances sharing a common key (reduce) -- not unlike the partitioned parallelism in parallel database systems. Apache's Hadoop MapReduce platform is the most prominent implementation of this paradigm for the rest
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
+1 happy to see you guys come on board! On Fri Feb 20 2015 at 12:40:42 AM Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Hi Everyone, OK, discussion has died down on this thread. I was originally suggesting that the pTLP option may be best for this community, but after some discussions with the existing community of AsterixDB’ers proposing to bring the project here to the ASF, AsterixDB would like to move forward independent of whatever comes of the pTLP discussions. That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. Thanks for the feedback so far and looking forward to the VOTE! You can count my binding +1. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov Date: Wednesday, January 14, 2015 at 6:20 PM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: [PROPOSAL] Apache AsterixDB Incubator Hi Folks, I am pleased to bring forth the Apache AsterixDB proposal to the Apache Incubator as Champion, working in collaboration with the team. Please find the wiki proposal here: https://wiki.apache.org/incubator/AsterixDBProposal Full text of the proposal is below. Please discuss and enjoy. I’ll leave the discussion open for a week, and then look to call a VOTE hopefully end of next week if all is well. Cheers! Chris Mattmann = Apache AsterixDB Proposal Abstract Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Proposal AsterixDB is a big data management system (BDMS) that makes it well-suited to needs such as web data warehousing and social data storage and analysis. Feature-wise, AsterixDB has: * A NoSQL style data model (ADM) based on extending JSON with object database concepts. * An expressive and declarative query language (AQL) for querying semi-structured data. * A runtime query execution engine, Hyracks, for partitioned-parallel execution of query plans. * Partitioned LSM-based data storage and indexing for efficient ingestion of newly arriving data. * Support for querying and indexing external data (e.g., in HDFS) as well as data stored within AsterixDB. * A rich set of primitive data types, including support for spatial, temporal, and textual data. * Indexing options that include B+ trees, R trees, and inverted keyword index support. * Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store. Background and Rationale In the world of relational databases, the need to tackle data volumes that exceed the capabilities of a single server led to the development of “shared-nothing” parallel database systems several decades ago. These systems spread data over a cluster based on a partitioning strategy, such as hash partitioning, and queries are processed by employing partitioned-parallel divide-and-conquer techniques. Since these systems are fronted by a high-level, declarative language (SQL), their users are shielded from the complexities of parallel programming. Parallel database systems have been an extremely successful application of parallel computing, and quite a number of commercial products exist today. In the distributed systems world, the Web brought a need to index and query its huge content. SQL and relational databases were not the answer, though shared-nothing clusters again emerged as the hardware platform of choice. Google developed the Google File System (GFS) and MapReduce programming model to allow programmers to store and process Big Data by writing a few user-defined functions. The MapReduce framework applies these functions in parallel to data instances in distributed files (map) and to sorted groups of instances sharing a common key (reduce) -- not unlike the partitioned parallelism in parallel database systems.
Re: [VOTE] Accept Apache AsterixDB in to the Incubator
+1 (binding) - Henry On Thu, Feb 19, 2015 at 9:38 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Hi Everyone, OK, discussion has died down on this thread. I was originally suggesting that the pTLP option may be best for this community, but after some discussions with the existing community of AsterixDB’ers proposing to bring the project here to the ASF, AsterixDB would like to move forward independent of whatever comes of the pTLP discussions. That said, I would like to propose Apache AsterixDB as an Incubator project. I am now calling a VOTE to accept AsterixDB into the Apache Incubator. This VOTE will run for at least 72 hours. [ ] +1 Accept Apache AsterixDB into the Incubator [ ] +0 Don’t care. [ ] -1 Don’t accept Apache AsterixDB into the Incubator because.. Thanks for the feedback so far and looking forward to the VOTE! You can count my binding +1. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Mattmann, Chris Mattmann chris.a.mattm...@jpl.nasa.gov Date: Wednesday, January 14, 2015 at 6:20 PM To: general@incubator.apache.org general@incubator.apache.org Cc: Michael Carey dtab...@gmail.com, Ian Maxon ima...@uci.edu, Till Westmann t...@westmann.org Subject: [PROPOSAL] Apache AsterixDB Incubator Hi Folks, I am pleased to bring forth the Apache AsterixDB proposal to the Apache Incubator as Champion, working in collaboration with the team. Please find the wiki proposal here: https://wiki.apache.org/incubator/AsterixDBProposal Full text of the proposal is below. Please discuss and enjoy. I’ll leave the discussion open for a week, and then look to call a VOTE hopefully end of next week if all is well. Cheers! Chris Mattmann = Apache AsterixDB Proposal Abstract Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Proposal AsterixDB is a big data management system (BDMS) that makes it well-suited to needs such as web data warehousing and social data storage and analysis. Feature-wise, AsterixDB has: * A NoSQL style data model (ADM) based on extending JSON with object database concepts. * An expressive and declarative query language (AQL) for querying semi-structured data. * A runtime query execution engine, Hyracks, for partitioned-parallel execution of query plans. * Partitioned LSM-based data storage and indexing for efficient ingestion of newly arriving data. * Support for querying and indexing external data (e.g., in HDFS) as well as data stored within AsterixDB. * A rich set of primitive data types, including support for spatial, temporal, and textual data. * Indexing options that include B+ trees, R trees, and inverted keyword index support. * Basic transactional (concurrency and recovery) capabilities akin to those of a NoSQL store. Background and Rationale In the world of relational databases, the need to tackle data volumes that exceed the capabilities of a single server led to the development of “shared-nothing” parallel database systems several decades ago. These systems spread data over a cluster based on a partitioning strategy, such as hash partitioning, and queries are processed by employing partitioned-parallel divide-and-conquer techniques. Since these systems are fronted by a high-level, declarative language (SQL), their users are shielded from the complexities of parallel programming. Parallel database systems have been an extremely successful application of parallel computing, and quite a number of commercial products exist today. In the distributed systems world, the Web brought a need to index and query its huge content. SQL and relational databases were not the answer, though shared-nothing clusters again emerged as the hardware platform of choice. Google developed the Google File System (GFS) and MapReduce programming model to allow programmers to store and process Big Data by writing a few user-defined functions. The MapReduce framework applies these functions in parallel to data instances in distributed files (map) and to sorted groups of instances sharing a common key (reduce) -- not unlike the partitioned parallelism in parallel database systems. Apache's Hadoop MapReduce platform is the most prominent implementation of this paradigm for the