Re: Planning to Use Cassandra

2016-07-09 Thread Melinda Zoe
Thank you so much, Paul. This is extremely useful information for me. The new 
content will come from websites (brief program information i.e. just titles and 
times) that will be copy/paste into a text file. I've thought about other 
alternatives but I'd rather have a clean database and put in the work than 
anything else. Thank you so much for all of your help.

Melinda
 

On Saturday, July 9, 2016 2:10 PM, Paul Fife  wrote:
 

 Hello Melinda,
1. My recommendation is to script your database creation instead of copying 
database files around. You could use code executing statements via a driver, or 
simply write a CQL script and execute with CQLSH to create your schema and 
possibly some minimal data set if needed. This is a good practice regardless of 
what database you use - you'll have an easy way to create a known environment 
for repeatable testing. If you have a large amount of data to load into a 
cluster after creating the schema you can use sstableloader, otherwise known as 
Cassandra bulk loader, to quickly stream large amounts of data into an existing 
database. sstableloader bulk loads sstables (the Cassandra data file format) 
into an existing data via network connection, and is pretty flexible in that 
you can take files from a test system and load into a production database that 
has completely different replication settings.
2. Since you haven't said where your new content is coming from it is hard to 
make a decent recommendation on how to do your daily process. If it's something 
as simple as preparing new data and wanting it loaded the sstableloader might 
work. If there's something more complicated happening, you could be better off 
writing a tool/application to do this. Either way, definitely automate as much 
as possible since it's going to be done regularly!
Paul
On Sat, Jul 9, 2016 at 10:15 AM, Melinda Zoe  wrote:

Hello Group,
I’m ready tohave a go at Cassandra. I have two questions as I plan everything 
out: 1.   If I create the database on Mac OS, what is theprocess for migrating 
it to another environment like Windows? This concerns mebecause I’m not sure 
who will host me later. Is it as simple as copying andpasting the database and 
moving all of the files over or will the database needto be converted?
2.   I want to use Cassandra for a website that willrely heavily on content 
that I provide the user over new profiles and/or newaccounts that the user 
creates. I plan on uploading content daily. Have any ofyou created a template 
for uploading records into Cassandra or have any of you createda script for 
loading multiple records at a time?  Melinda



  

Re: Planning to Use Cassandra

2016-07-09 Thread Paul Fife
Hello Melinda,

1. My recommendation is to script your database creation instead of copying
database files around. You could use code executing statements via a
driver, or simply write a CQL script and execute with CQLSH to create your
schema and possibly some minimal data set if needed. This is a good
practice regardless of what database you use - you'll have an easy way to
create a known environment for repeatable testing. If you have a large
amount of data to load into a cluster after creating the schema you can use
sstableloader, otherwise known as Cassandra bulk loader, to quickly stream
large amounts of data into an existing database. sstableloader bulk loads
sstables (the Cassandra data file format) into an existing data via network
connection, and is pretty flexible in that you can take files from a test
system and load into a production database that has completely different
replication settings.

2. Since you haven't said where your new content is coming from it is hard
to make a decent recommendation on how to do your daily process. If it's
something as simple as preparing new data and wanting it loaded the
sstableloader might work. If there's something more complicated happening,
you could be better off writing a tool/application to do this. Either way,
definitely automate as much as possible since it's going to be done
regularly!

Paul

On Sat, Jul 9, 2016 at 10:15 AM, Melinda Zoe  wrote:

> Hello Group,
>
> I’m ready to have a go at Cassandra. I have two questions as I plan
> everything out:
>
> 1.If I create the database on Mac OS, what is the process for
> migrating it to another environment like Windows? This concerns me because
> I’m not sure who will host me later. Is it as simple as copying and pasting
> the database and moving all of the files over or will the database need to
> be converted?
>
> 2.I want to use Cassandra for a website that will rely heavily on
> content that I provide the user over new profiles and/or new accounts that
> the user creates. I plan on uploading content daily. Have any of you
> created a template for uploading records into Cassandra or have any of you
> created a script for loading multiple records at a time?
>
> Melinda
>


Planning to Use Cassandra

2016-07-09 Thread Melinda Zoe
Hello Group,
I’m ready tohave a go at Cassandra. I have two questions as I plan everything 
out: 1.   If I create the database on Mac OS, what is theprocess for migrating 
it to another environment like Windows? This concerns mebecause I’m not sure 
who will host me later. Is it as simple as copying andpasting the database and 
moving all of the files over or will the database needto be converted?
2.   I want to use Cassandra for a website that willrely heavily on content 
that I provide the user over new profiles and/or newaccounts that the user 
creates. I plan on uploading content daily. Have any ofyou created a template 
for uploading records into Cassandra or have any of you createda script for 
loading multiple records at a time?  Melinda

Re: Blog post on Cassandra's inner workings and performance - feedback welcome

2016-07-09 Thread daemeon reiydelle
I saw this really useful post a few days ago. I found the organization and
presentation quite clear and helpful (I often struggle trying to do high
level comparisons of Hadoop and Cass). Thank you!

If there was sections I would like to see your clear thoughts appear
within, it would be around:

   - (1) why networks need to be clean (the impact of "dirty"/erratic
   networks);
   - (2) the impact of java (off heap, stop the world garbage collection,
   why more memory makes things worse;
   - (3) table design decisions (read mostly, write mostly, mixed
   read/write, etc.)

A really great writeup, thank you!





*...*



*Daemeon C.M. ReiydelleUSA (+1) 415.501.0198London (+44) (0) 20 8144 9872*

On Fri, Jul 8, 2016 at 11:59 PM, Manuel Kiessling <
kiessling.man...@gmail.com> wrote:

> Yes, the joke's on me. It was a copy error, and I've since posted
> the correct URL (journeymonitor.com
> :4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/).
>
> Substantial feedback regarding the actual post still very much welcome.
>
> Regards,
> Manuel
>
> Am 09.07.2016 um 03:32 schrieb daemeon reiydelle :
>
> Localhost is a special network address that never leaves the operating
> system. It only goes "half way" down the IP stack. Thanks for your efforts!
>
>
> *...*
>
>
>
> *Daemeon C.M. ReiydelleUSA (+1) 415.501.0198
> <%28%2B1%29%20415.501.0198>London (+44) (0) 20 8144 9872
> <%28%2B44%29%20%280%29%2020%208144%209872>*
>
> On Fri, Jul 8, 2016 at 5:53 PM, Joaquin Alzola 
> wrote:
>
>> Hi Manuel
>>
>>
>>
>> I think localhost will not work for people on the internet.
>>
>>
>>
>> BR
>>
>>
>>
>> Joaquin
>>
>>
>>
>> *From:* kiessling.man...@gmail.com [mailto:kiessling.man...@gmail.com] *On
>> Behalf Of *Manuel Kiessling
>> *Sent:* 07 July 2016 14:12
>> *To:* user@cassandra.apache.org
>> *Subject:* Blog post on Cassandra's inner workings and performance -
>> feedback welcome
>>
>>
>>
>> Hi all,
>>
>> I'm currently in the process of understanding the inner workings of
>> Cassandra with regards to network and local storage mechanisms and
>> operations. In order to do so, I've written a blog post about it which is
>> now in a "first final" version.
>>
>> Any feedback, especially corrections regarding misunderstandings on my
>> side, would be highly appreciated. The post really represents my very
>> subjective view on how Cassandra works under the hood, which makes it prone
>> to errors of course.
>>
>> You can access the current version at
>> http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/
>>
>>
>>
>> Thanks,
>>
>> --
>>
>>  Manuel
>> This email is confidential and may be subject to privilege. If you are
>> not the intended recipient, please do not copy or disclose its content but
>> contact the sender immediately upon receipt.
>>
>
>


How I can ignore clustering key order and read latests writes first?

2016-07-09 Thread Reza Samee
Hello everyone!

I have asked a question in SO and I've not got response (Of curse someone
tried to help me, but his/her answer wan't satisfied me)!:
http://stackoverflow.com/questions/38247368/as-i-know-in-range-queries-cassandra-reterives-result-ordered-by-culstring-key

I want to ignore the clustering keys orders in my query? Can I do that?

-- 
رضا سامعی / http://samee.blog.ir


Re: Blog post on Cassandra's inner workings and performance - feedback welcome

2016-07-09 Thread Manuel Kiessling
Yes, the joke's on me. It was a copy error, and I've since posted the 
correct URL 
(journeymonitor.com:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/).
 

Substantial feedback regarding the actual post still very much welcome.

Regards,
Manuel


> Am 09.07.2016 um 02:53 schrieb Joaquin Alzola :
> 
> Hi Manuel
>  
> I think localhost will not work for people on the internet.
>  
> BR
>  
> Joaquin
>  
> From: kiessling.man...@gmail.com [mailto:kiessling.man...@gmail.com] On 
> Behalf Of Manuel Kiessling
> Sent: 07 July 2016 14:12
> To: user@cassandra.apache.org
> Subject: Blog post on Cassandra's inner workings and performance - feedback 
> welcome
>  
> Hi all,
> 
> I'm currently in the process of understanding the inner workings of Cassandra 
> with regards to network and local storage mechanisms and operations. In order 
> to do so, I've written a blog post about it which is now in a "first final" 
> version.
> 
> Any feedback, especially corrections regarding misunderstandings on my side, 
> would be highly appreciated. The post really represents my very subjective 
> view on how Cassandra works under the hood, which makes it prone to errors of 
> course.
> 
> You can access the current version at 
> http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/
>  
> Thanks,
> --
>  Manuel
> 
> This email is confidential and may be subject to privilege. If you are not 
> the intended recipient, please do not copy or disclose its content but 
> contact the sender immediately upon receipt.


Re: Blog post on Cassandra's inner workings and performance - feedback welcome

2016-07-09 Thread Manuel Kiessling
Yes, the joke's on me. It was a copy error, and I've since posted the 
correct URL 
(journeymonitor.com:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/).
 

Substantial feedback regarding the actual post still very much welcome.

Regards,
Manuel

> Am 09.07.2016 um 03:32 schrieb daemeon reiydelle :
> 
> Localhost is a special network address that never leaves the operating 
> system. It only goes "half way" down the IP stack. Thanks for your efforts!
> 
> 
> ...
> 
> Daemeon C.M. Reiydelle
> USA (+1) 415.501.0198
> London (+44) (0) 20 8144 9872
> 
>> On Fri, Jul 8, 2016 at 5:53 PM, Joaquin Alzola  
>> wrote:
>> Hi Manuel
>> 
>>  
>> 
>> I think localhost will not work for people on the internet.
>> 
>>  
>> 
>> BR
>> 
>>  
>> 
>> Joaquin
>> 
>>  
>> 
>> From: kiessling.man...@gmail.com [mailto:kiessling.man...@gmail.com] On 
>> Behalf Of Manuel Kiessling
>> Sent: 07 July 2016 14:12
>> To: user@cassandra.apache.org
>> Subject: Blog post on Cassandra's inner workings and performance - feedback 
>> welcome
>> 
>>  
>> 
>> Hi all,
>> 
>> I'm currently in the process of understanding the inner workings of 
>> Cassandra with regards to network and local storage mechanisms and 
>> operations. In order to do so, I've written a blog post about it which is 
>> now in a "first final" version.
>> 
>> Any feedback, especially corrections regarding misunderstandings on my side, 
>> would be highly appreciated. The post really represents my very subjective 
>> view on how Cassandra works under the hood, which makes it prone to errors 
>> of course.
>> 
>> You can access the current version at 
>> http://localhost:4000/tutorials/2016/02/29/cassandra-inner-workings-and-how-this-relates-to-performance/
>> 
>>  
>> 
>> Thanks,
>> 
>> --
>> 
>>  Manuel
>> 
>> This email is confidential and may be subject to privilege. If you are not 
>> the intended recipient, please do not copy or disclose its content but 
>> contact the sender immediately upon receipt.
>