David started this thread yesterday. Since then I have read everything, I think, and like what I see. I still have a question. To populate an Accumulo database, using the D4M schema, it would appear one would do so using Mutation objects et. al. just as if one were not using D4M schema. Am I correct? All the examples appear to focus on the analytic side of things.
Thanks On Mon, Apr 28, 2014 at 9:12 PM, Kepner, Jeremy - 0553 - MITLL < [email protected]> wrote: > D4M is two things: > > (1) A set of software for doing analytics. > (2) A schema for ingesting and indexing diverse data into a NoSQL database > like Accumulo > > It hits two parts of the Common Big Data Architecture. > The CBDA is merely a restating of the obvious components a system needs to > effective at processing Big Data. > It can be implemented with a variety of technologies. > > Regards. -Jeremy > > > On Apr 28, 2014, at 9:08 PM, Chris Bennight <[email protected]> > wrote: > > I'm not getting what exactly the "Common Big Data Architecture" is? > > Is it just a term that describes any system that has the 7 components > Jeremy mentioned (fs, ingest, DB, analytics, web services, resource > scheduler, elastic compute)? If so, what's the significance of naming > this collection? > > And how exactly is D4M related to this? (I understand it (D4M) hits a > subset of those features, but don't think it encompasses all of those) > > Apologies if these are obtuse questions, I just feel like I"m not > comprehending what information is trying to be conveyed? > > > > > On Mon, Apr 28, 2014 at 8:51 PM, Jeremy Kepner <[email protected]> wrote: > >> No problem. I am glad to start getting the definitions out there. >> Great work on the page. I think helps clarify things a lot. >> >> On Mon, Apr 28, 2014 at 08:45:47PM -0400, David Medinets wrote: >> > Sorry for my misunderstanding. I've updated the github project and >> moved >> > it to [1]https://github.com/medined/D4M_Schema. >> > >> > On Mon, Apr 28, 2014 at 5:36 PM, Jeremy Kepner <[2][email protected] >> > >> > wrote: >> > >> > David's well written example is illustrating the D4M Schema >> > ([3] >> http://ieee-hpec.org/2013/index_htm_files/11-Kepner-D4Mschema-IEEE-HPEC.pdf >> ). >> > >> > The Common Big Data Architecture is a broad description that >> encompasses >> > many >> > big data systems and consists of 7 components: filesystem, ingest >> > processes, >> > databases, analytic processes, web services, resource scheduler, >> and >> > elastic computing. A reference will most likely appear in IEEE >> HPEC >> > 2014. >> > >> > Accumulo is the database of choice in many CBDA systems. >> > >> > The D4M schema is used in many Accumulo systems. >> > >> > On Mon, Apr 28, 2014 at 05:23:00PM -0400, David Medinets wrote: >> > > [1][4]https://github.com/medined/Common-Big-Data-Architecture- >> > This project >> > > provides simple examples of the CBDA which is used by the D4M >> 2.0 >> > > software. >> > > >> > > References >> > > >> > > Visible links >> > > 1. [5]https://github.com/medined/Common-Big-Data-Architecture >> > >> > References >> > >> > Visible links >> > 1. https://github.com/medined/D4M_Schema >> > 2. mailto:[email protected] >> > 3. >> http://ieee-hpec.org/2013/index_htm_files/11-Kepner-D4Mschema-IEEE-HPEC.pdf >> > 4. https://github.com/medined/Common-Big-Data-Architecture >> > 5. https://github.com/medined/Common-Big-Data-Architecture >> > > > -- There are ways and there are ways, Geoffry Roberts
