Re: Planning to Use Cassandra
Thank you so much, Paul. This is extremely useful information for me. The new content will come from websites (brief program information i.e. just titles and times) that will be copy/paste into a text file. I've thought about other alternatives but I'd rather have a clean database and put in the work than anything else. Thank you so much for all of your help. Melinda On Saturday, July 9, 2016 2:10 PM, Paul Fifewrote: Hello Melinda, 1. My recommendation is to script your database creation instead of copying database files around. You could use code executing statements via a driver, or simply write a CQL script and execute with CQLSH to create your schema and possibly some minimal data set if needed. This is a good practice regardless of what database you use - you'll have an easy way to create a known environment for repeatable testing. If you have a large amount of data to load into a cluster after creating the schema you can use sstableloader, otherwise known as Cassandra bulk loader, to quickly stream large amounts of data into an existing database. sstableloader bulk loads sstables (the Cassandra data file format) into an existing data via network connection, and is pretty flexible in that you can take files from a test system and load into a production database that has completely different replication settings. 2. Since you haven't said where your new content is coming from it is hard to make a decent recommendation on how to do your daily process. If it's something as simple as preparing new data and wanting it loaded the sstableloader might work. If there's something more complicated happening, you could be better off writing a tool/application to do this. Either way, definitely automate as much as possible since it's going to be done regularly! Paul On Sat, Jul 9, 2016 at 10:15 AM, Melinda Zoe wrote: Hello Group, I’m ready tohave a go at Cassandra. I have two questions as I plan everything out: 1. If I create the database on Mac OS, what is theprocess for migrating it to another environment like Windows? This concerns mebecause I’m not sure who will host me later. Is it as simple as copying andpasting the database and moving all of the files over or will the database needto be converted? 2. I want to use Cassandra for a website that willrely heavily on content that I provide the user over new profiles and/or newaccounts that the user creates. I plan on uploading content daily. Have any ofyou created a template for uploading records into Cassandra or have any of you createda script for loading multiple records at a time? Melinda
Re: Planning to Use Cassandra
Hello Melinda, 1. My recommendation is to script your database creation instead of copying database files around. You could use code executing statements via a driver, or simply write a CQL script and execute with CQLSH to create your schema and possibly some minimal data set if needed. This is a good practice regardless of what database you use - you'll have an easy way to create a known environment for repeatable testing. If you have a large amount of data to load into a cluster after creating the schema you can use sstableloader, otherwise known as Cassandra bulk loader, to quickly stream large amounts of data into an existing database. sstableloader bulk loads sstables (the Cassandra data file format) into an existing data via network connection, and is pretty flexible in that you can take files from a test system and load into a production database that has completely different replication settings. 2. Since you haven't said where your new content is coming from it is hard to make a decent recommendation on how to do your daily process. If it's something as simple as preparing new data and wanting it loaded the sstableloader might work. If there's something more complicated happening, you could be better off writing a tool/application to do this. Either way, definitely automate as much as possible since it's going to be done regularly! Paul On Sat, Jul 9, 2016 at 10:15 AM, Melinda Zoewrote: > Hello Group, > > I’m ready to have a go at Cassandra. I have two questions as I plan > everything out: > > 1.If I create the database on Mac OS, what is the process for > migrating it to another environment like Windows? This concerns me because > I’m not sure who will host me later. Is it as simple as copying and pasting > the database and moving all of the files over or will the database need to > be converted? > > 2.I want to use Cassandra for a website that will rely heavily on > content that I provide the user over new profiles and/or new accounts that > the user creates. I plan on uploading content daily. Have any of you > created a template for uploading records into Cassandra or have any of you > created a script for loading multiple records at a time? > > Melinda >
Planning to Use Cassandra
Hello Group, I’m ready tohave a go at Cassandra. I have two questions as I plan everything out: 1. If I create the database on Mac OS, what is theprocess for migrating it to another environment like Windows? This concerns mebecause I’m not sure who will host me later. Is it as simple as copying andpasting the database and moving all of the files over or will the database needto be converted? 2. I want to use Cassandra for a website that willrely heavily on content that I provide the user over new profiles and/or newaccounts that the user creates. I plan on uploading content daily. Have any ofyou created a template for uploading records into Cassandra or have any of you createda script for loading multiple records at a time? Melinda
Re: Blog post on Cassandra's inner workings and performance - feedback welcome
I saw this really useful post a few days ago. I found the organization and presentation quite clear and helpful (I often struggle trying to do high level comparisons of Hadoop and Cass). Thank you! If there was sections I would like to see your clear thoughts appear within, it would be around: - (1) why networks need to be clean (the impact of "dirty"/erratic networks); - (2) the impact of java (off heap, stop the world garbage collection, why more memory makes things worse; - (3) table design decisions (read mostly, write mostly, mixed read/write, etc.) A really great writeup, thank you! *...* *Daemeon C.M. ReiydelleUSA (+1) 415.501.0198London (+44) (0) 20 8144 9872* On Fri, Jul 8, 2016 at 11:59 PM, Manuel Kiessling < kiessling.man...@gmail.com> wrote: > Yes, the joke's on me. It was a copy error, and I've since posted > the correct URL (journeymonitor.com > :4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/). > > Substantial feedback regarding the actual post still very much welcome. > > Regards, > Manuel > > Am 09.07.2016 um 03:32 schrieb daemeon reiydelle: > > Localhost is a special network address that never leaves the operating > system. It only goes "half way" down the IP stack. Thanks for your efforts! > > > *...* > > > > *Daemeon C.M. ReiydelleUSA (+1) 415.501.0198 > <%28%2B1%29%20415.501.0198>London (+44) (0) 20 8144 9872 > <%28%2B44%29%20%280%29%2020%208144%209872>* > > On Fri, Jul 8, 2016 at 5:53 PM, Joaquin Alzola > wrote: > >> Hi Manuel >> >> >> >> I think localhost will not work for people on the internet. >> >> >> >> BR >> >> >> >> Joaquin >> >> >> >> *From:* kiessling.man...@gmail.com [mailto:kiessling.man...@gmail.com] *On >> Behalf Of *Manuel Kiessling >> *Sent:* 07 July 2016 14:12 >> *To:* user@cassandra.apache.org >> *Subject:* Blog post on Cassandra's inner workings and performance - >> feedback welcome >> >> >> >> Hi all, >> >> I'm currently in the process of understanding the inner workings of >> Cassandra with regards to network and local storage mechanisms and >> operations. In order to do so, I've written a blog post about it which is >> now in a "first final" version. >> >> Any feedback, especially corrections regarding misunderstandings on my >> side, would be highly appreciated. The post really represents my very >> subjective view on how Cassandra works under the hood, which makes it prone >> to errors of course. >> >> You can access the current version at >> http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/ >> >> >> >> Thanks, >> >> -- >> >> Manuel >> This email is confidential and may be subject to privilege. If you are >> not the intended recipient, please do not copy or disclose its content but >> contact the sender immediately upon receipt. >> > >
How I can ignore clustering key order and read latests writes first?
Hello everyone! I have asked a question in SO and I've not got response (Of curse someone tried to help me, but his/her answer wan't satisfied me)!: http://stackoverflow.com/questions/38247368/as-i-know-in-range-queries-cassandra-reterives-result-ordered-by-culstring-key I want to ignore the clustering keys orders in my query? Can I do that? -- رضا سامعی / http://samee.blog.ir
Re: Blog post on Cassandra's inner workings and performance - feedback welcome
Yes, the joke's on me. It was a copy error, and I've since posted the correct URL (journeymonitor.com:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/). Substantial feedback regarding the actual post still very much welcome. Regards, Manuel > Am 09.07.2016 um 02:53 schrieb Joaquin Alzola: > > Hi Manuel > > I think localhost will not work for people on the internet. > > BR > > Joaquin > > From: kiessling.man...@gmail.com [mailto:kiessling.man...@gmail.com] On > Behalf Of Manuel Kiessling > Sent: 07 July 2016 14:12 > To: user@cassandra.apache.org > Subject: Blog post on Cassandra's inner workings and performance - feedback > welcome > > Hi all, > > I'm currently in the process of understanding the inner workings of Cassandra > with regards to network and local storage mechanisms and operations. In order > to do so, I've written a blog post about it which is now in a "first final" > version. > > Any feedback, especially corrections regarding misunderstandings on my side, > would be highly appreciated. The post really represents my very subjective > view on how Cassandra works under the hood, which makes it prone to errors of > course. > > You can access the current version at > http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/ > > Thanks, > -- > Manuel > > This email is confidential and may be subject to privilege. If you are not > the intended recipient, please do not copy or disclose its content but > contact the sender immediately upon receipt.
Re: Blog post on Cassandra's inner workings and performance - feedback welcome
Yes, the joke's on me. It was a copy error, and I've since posted the correct URL (journeymonitor.com:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/). Substantial feedback regarding the actual post still very much welcome. Regards, Manuel > Am 09.07.2016 um 03:32 schrieb daemeon reiydelle: > > Localhost is a special network address that never leaves the operating > system. It only goes "half way" down the IP stack. Thanks for your efforts! > > > ... > > Daemeon C.M. Reiydelle > USA (+1) 415.501.0198 > London (+44) (0) 20 8144 9872 > >> On Fri, Jul 8, 2016 at 5:53 PM, Joaquin Alzola >> wrote: >> Hi Manuel >> >> >> >> I think localhost will not work for people on the internet. >> >> >> >> BR >> >> >> >> Joaquin >> >> >> >> From: kiessling.man...@gmail.com [mailto:kiessling.man...@gmail.com] On >> Behalf Of Manuel Kiessling >> Sent: 07 July 2016 14:12 >> To: user@cassandra.apache.org >> Subject: Blog post on Cassandra's inner workings and performance - feedback >> welcome >> >> >> >> Hi all, >> >> I'm currently in the process of understanding the inner workings of >> Cassandra with regards to network and local storage mechanisms and >> operations. In order to do so, I've written a blog post about it which is >> now in a "first final" version. >> >> Any feedback, especially corrections regarding misunderstandings on my side, >> would be highly appreciated. The post really represents my very subjective >> view on how Cassandra works under the hood, which makes it prone to errors >> of course. >> >> You can access the current version at >> http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/ >> >> >> >> Thanks, >> >> -- >> >> Manuel >> >> This email is confidential and may be subject to privilege. If you are not >> the intended recipient, please do not copy or disclose its content but >> contact the sender immediately upon receipt. >