Hi Jonathan

Thanks for your reply. I had gone through the URL that you have specified.
Let me put my problem statement with a clear statement:

We have a RDBMS table that contains Category ID, Section ID, Article,
IS_Published column. Now the application that we currently have uses SQL and
gets the data in various forms e.g. get all the articles that belong to a
section, get all the articles that belong a specific category, specific
section and which is published and so on.

With your example, I understand that it is possible for me to have multiple
columnfamilies and store the same data e.g:

keyspace.category[WORLDNEWS][SECTION] = HOCKEY
keyspace.category[WORLDNEWS][ARTICLE] = World cup hockey matches begin...
keyspace.category[WORLDNEWS][IS_PUBLISHED] = TRUE

and another set as
keyspace.section[HOCKEY][CATEGORY] = WORLDNEWS
keyspace.section[HOCKEY][ARTICLE] = World cup hockey matches begin...
keyspace.section[HOCKEY][IS_PUBLISHED] = TRUE

Now, if the above example is correct then I have following questions:

   1. This would lead to enourmous amount of duplication of data, in short
   if I now want to view the data from IS_PUBLISHED dimenstion then my database
   size would scale up tremendously.
   2. Above way of reprensting the data would suffice if I want to retrieve
   something like, get me all the articles whose category is WORLDNEWS. But
   what if I want to find out something like: Get me all the articles whose
   Section is BASEBALL and Category is WORLDNEWS. For addressing queries taht
   depend on multiple parameter how do we do it? Hope I am clear with my
   problem statement :(

Please help me out in understanding this basic difference between
interpreting data in RDBMS world v/s NRDBMS world.


On Fri, Jan 29, 2010 at 8:00 PM, Jonathan Ellis <jbel...@gmail.com> wrote:

> Cassandra does not support ad-hoc queries the way SQL does.  If you
> want to ask "what rows have a column X containing value Y" then you
> need to create a columnfamily whose keys are the values of X, and
> whose columns are the keys of your original CF.
>
> Read http://arin.me/blog/wtf-is-a-supercolumn-cassandra-data-model if
> you haven't yet.
>
> On Fri, Jan 29, 2010 at 6:16 AM, Mehar Chaitanya
> <meharchaita...@gmail.com> wrote:
> > Hi All
> >
> > I am a J2EE programmer only i had knowledge related to queries i will
> query
> > the sql where i can found the result.
> >
> > How can i use cassandra for my requirement is it possible?
> >
> > Below is my scenario
> >
> >   - I have a table which contains columns like
> >   Category_name,Section_name,article,is_published_by  with  multiple
> records
> >   in the table.
> >   - I want to retrieve a query based on condition like belongs some
> >   category_name 'X' .
> >   - Same will be applied to other 3 ,condition based on Section and
> >   is_published_by
> >
> >
> > Please let me know if it would be possible.
> >
> > Thanks&Regards,
> > Mehar Chaitanya Bandaru,
> > Software Engineer,
> > S cubes IT Solutions India Pvt. Ltd.,
> > http://www.scubian.com
> > (W) +91 4040307821,
> > (Cell) +91 9440 999 262,
> > #4-1-319, 2nd Floor, Abids Road, Hyderabad - 01.
> >
>

Reply via email to