RE: shutdown by drain

2011-09-23 Thread Viktor Jevdokimov
More of it, Cassandra 0.8.6 still leaves all commit logs under Windows.


Best regards/ Pagarbiai

Viktor Jevdokimov
Senior Developer

Email: viktor.jevdoki...@adform.com
Phone: +370 5 212 3063
Fax: +370 5 261 0453

J. Jasinskio 16C,
LT-01112 Vilnius,
Lithuania



Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the intended recipient, you are reminded that the 
information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
this message and any copies.-Original Message-
From: Radim Kolar [mailto:h...@sendmail.cz]
Sent: Friday, September 23, 2011 12:04
To: user@cassandra.apache.org
Subject: Re: shutdown by drain

Dne 10.9.2011 21:48, Chris Goffinet napsal(a):
 For things like rolling restarts, we do:

 disablethrift
 disablegossip
 (...wait for all nodes to see this node go down..) drain
I discovered problem with this advice.

If i do nodetool drain before killing node nodetool returns just after flush 
and stuff disabling is finished on cassandra node. But flush can trigger 
possible compaction and if you kill node after drain it will interrupt 
compaction in progress resulting in wasted disk space. I am not sure if tmp 
files are cleaned on cassandra start.




How to enable JNA for Cassandra on Windows?

2011-09-22 Thread Viktor Jevdokimov
Hi,

I'm trying without success to enable JNA for Cassandra on Windows.

Tried to place JNA 3.3.0 libs jna.jar and platform.jar into Cassandra 0.8.6
lib dir, but getting in log:
Unable to link C library. Native methods will be disabled.

What is missed or what is wrong?

One thing I've found on inet about JNA and Windows is this sample:

// Library is c for unix and msvcrt for windows
String libName = c;
if (System.getProperty(os.name).contains(Windows))
{
  libName = msvcrt;
}

// Loading dynamically the library
CInterface demo = (CInterface) Native.loadLibrary(libName, CInterface.class);


from http://www.scriptol.com/programming/jna.php

while in Cassandra:

try
{
Native.register(c);
}
catch (NoClassDefFoundError e)
{
logger.info(JNA not found. Native methods will be disabled.);
}
catch (UnsatisfiedLinkError e)
{
logger.info(Unable to link C library. Native methods will be
disabled.);
}
catch (NoSuchMethodError e)
{
logger.warn(Obsolete version of JNA present; unable to
register C library. Upgrade to JNA 3.2.7 or later);
}


Is it true that for Windows Cassandra should do something like:

if (System.getProperty(os.name).contains(Windows))
{
Native.register(msvcrt);
}
else
{
Native.register(c);
}



Thanks
Viktor


Re: Cassandra Statistics and Metrics

2011-06-16 Thread Viktor Jevdokimov
There's possibility to use command line JMX client with standard Zabbix
agent to request JMX counters without incorporating zapcat into Cassandra or
another Java app.
I'm investigating this feature right now, will post results when finish.

2011/6/15 Viktor Jevdokimov vjevdoki...@gmail.com

 http://www.kjkoster.org/zapcat/Zapcat_JMX_Zabbix_Bridge.html

 2011/6/14 Marcos Ortiz mlor...@uci.cu

  Where I can find the source code?

 El 6/14/2011 10:13 AM, Viktor Jevdokimov escribió:

 We're using open source monitoring solution Zabbix from
 http://www.zabbix.com/ using zapcat - not only for Cassandra but for the
 whole system.

  As MX4J tools plugin is supported by Cassandra, support of zapcat in
 Cassandra by default is welcome - we have to use a wrapper to start zapcat
 agent.

 2011/6/14 Marcos Ortiz mlor...@uci.cu

 Regards to all.
 My team and me here on the University are working on a generic solution
 for Monitoring and Capacity Planning for Open Sources Databases, and one of
 the NoSQL db that we choosed to give it support is Cassandra.
 Where I can find all the metrics and statistics of Cassandra? I'm
 thinking for example:
 - Available space
 - Number of CF
 and all kind of metrics

 We are using for this development: Python + Django + Twisted + Orbited +
 jQuery. The idea behind is to build a Comet-based web application on top of
 these technologies.
 Any advice is welcome

 --
 Marcos Luís Ortíz Valmaseda
  Software Engineer (UCI)
  http://marcosluis2186.posterous.com
  http://twitter.com/marcosluis2186




 --
 Marcos Luís Ortíz Valmaseda
  Software Engineer (UCI)
  http://marcosluis2186.posterous.com
  http://twitter.com/marcosluis2186





Re: Cassandra Statistics and Metrics

2011-06-15 Thread Viktor Jevdokimov
http://www.kjkoster.org/zapcat/Zapcat_JMX_Zabbix_Bridge.html

2011/6/14 Marcos Ortiz mlor...@uci.cu

  Where I can find the source code?

 El 6/14/2011 10:13 AM, Viktor Jevdokimov escribió:

 We're using open source monitoring solution Zabbix from
 http://www.zabbix.com/ using zapcat - not only for Cassandra but for the
 whole system.

  As MX4J tools plugin is supported by Cassandra, support of zapcat in
 Cassandra by default is welcome - we have to use a wrapper to start zapcat
 agent.

 2011/6/14 Marcos Ortiz mlor...@uci.cu

 Regards to all.
 My team and me here on the University are working on a generic solution
 for Monitoring and Capacity Planning for Open Sources Databases, and one of
 the NoSQL db that we choosed to give it support is Cassandra.
 Where I can find all the metrics and statistics of Cassandra? I'm thinking
 for example:
 - Available space
 - Number of CF
 and all kind of metrics

 We are using for this development: Python + Django + Twisted + Orbited +
 jQuery. The idea behind is to build a Comet-based web application on top of
 these technologies.
 Any advice is welcome

 --
 Marcos Luís Ortíz Valmaseda
  Software Engineer (UCI)
  http://marcosluis2186.posterous.com
  http://twitter.com/marcosluis2186




 --
 Marcos Luís Ortíz Valmaseda
  Software Engineer (UCI)
  http://marcosluis2186.posterous.com
  http://twitter.com/marcosluis2186




Re: possible 'coming back to life' bug with counters

2011-06-15 Thread Viktor Jevdokimov
What if it is OK for our case and we need counters with TTL?
For us Counters and TTL both are important. After column is expired it is
not important what value counter will have.
Scanning millions rows just to delete expired ones is not a solution.

2011/6/14 Sylvain Lebresne sylv...@datastax.com

 As listed here: http://wiki.apache.org/cassandra/Counters, counter
 deletion is
 provided as a convenience for permanent deletion of counters but, because
 of the design of counters, it is never safe to issue an increment on a
 counter that
 has been deleted (that is, you will experience back to life behavior
 sometimes in
 that case).
 More precisely, you'd have to wait long enough after a deletion to start
 incrementing the counter again. But in the worst cases, long enough is
 something
 like gc_grace_seconds + major compaction.

 This is *not* something that is likely to change anytime soon (I don't
 think this is
 fixable with the current design for counters).

 --
 Sylvain

 On Sat, Jun 11, 2011 at 3:54 AM, David Hawthorne dha...@3crowd.com
 wrote:
  Please take a look at this thread over in the hector-users mailing list:
 
 http://groups.google.com/group/hector-users/browse_thread/thread/99835159b9ea1766
  It looks as if the deleted columns are coming back to life when they
  shouldn't be.
  I don't want to open a bug on something if it's already got one that I
 just
  couldn't find when I scanned the list of open bugs.
  I'm using hector 0.8 against cassandra 0.8 release.  I can give you
 whatever
  logs or files you'd like.



Re: Cassandra Statistics and Metrics

2011-06-14 Thread Viktor Jevdokimov
We're using open source monitoring solution Zabbix from
http://www.zabbix.com/ using zapcat - not only for Cassandra but for the
whole system.

As MX4J tools plugin is supported by Cassandra, support of zapcat in
Cassandra by default is welcome - we have to use a wrapper to start zapcat
agent.

2011/6/14 Marcos Ortiz mlor...@uci.cu

 Regards to all.
 My team and me here on the University are working on a generic solution for
 Monitoring and Capacity Planning for Open Sources Databases, and one of the
 NoSQL db that we choosed to give it support is Cassandra.
 Where I can find all the metrics and statistics of Cassandra? I'm thinking
 for example:
 - Available space
 - Number of CF
 and all kind of metrics

 We are using for this development: Python + Django + Twisted + Orbited +
 jQuery. The idea behind is to build a Comet-based web application on top of
 these technologies.
 Any advice is welcome

 --
 Marcos Luís Ortíz Valmaseda
  Software Engineer (UCI)
  http://marcosluis2186.posterous.com
  http://twitter.com/marcosluis2186




Re: Reads and memory usage clarification

2011-02-23 Thread Viktor Jevdokimov
Everything as I thought, thank you!

2011/2/23 Matthew Dennis mden...@datastax.com

 Data is in Memtables from writes before they get flushed (based on first
 threshold of ops/size/time exceeded; all are configurable) to SSTables on
 disk.

 There is a keycache and a rowcache.  The keycache caches offsets into
 SSTables for the rows.  the rowcache caches the entire row.  There is also
 the OS page cache which is heavily used.

 When a read happens, the keycache is updated with the information for the
 SSTables the row was eventually found in.  If there are too many entries now
 in the keycache, some are ejected.  Overall the keycache uses very little
 memory per entry and can cut your disk IO in half so it's a pretty big win.

 If you read an entire row it goes in the row cache.  Like the keycache,
 this may result in older entries being ejected from the cache.  If you
 insert lots of really large rows in the rowcache you can OOM your JVM.  The
 rowcache is kept up to date with the memtables as writes come in.

 When a read comes in, C* will collect the data from the SSTables and
 Memtables and merge them together but data only goes into Memtables from
 writes.


 On Tue, Feb 22, 2011 at 3:32 AM, Viktor Jevdokimov 
 vjevdoki...@gmail.comwrote:

 Hello,

 Write path is perfectly documented in architecture overview.

 I need Reads to be clarified:

 How memory is used
 1. When data is in the Memtable
 2. When data is in the SSTable

 How cache is used alongside with Memtable?

 Are records created in the Memtable from writes only or from reads also?

 What I need to know is, how Cassandra uses memory and Memtables for reads?


 Thenk you,
 Viktor





Reads and memory usage clarification

2011-02-22 Thread Viktor Jevdokimov
Hello,

Write path is perfectly documented in architecture overview.

I need Reads to be clarified:

How memory is used
1. When data is in the Memtable
2. When data is in the SSTable

How cache is used alongside with Memtable?

Are records created in the Memtable from writes only or from reads also?

What I need to know is, how Cassandra uses memory and Memtables for reads?


Thenk you,
Viktor


RE: Do supercolumns have a purpose?

2011-02-09 Thread Viktor Jevdokimov
SCFs are very useful and I hope lives forever. We need them!


Best regards/ Pagarbiai

Viktor Jevdokimov
Senior Developer

Email: viktor.jevdoki...@adform.com
Phone: +370 5 212 3063
Fax: +370 5 261 0453

Konstitucijos pr. 23,
LT-08105 Vilnius,
Lithuania



Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the interested recipient, you are reminded that 
the information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
or destroy this message and any copies.-Original Message-
From: norman.mau...@googlemail.com [mailto:norman.mau...@googlemail.com] On 
Behalf Of Norman Maurer
Sent: Wednesday, February 09, 2011 20:59
To: user@cassandra.apache.org
Subject: Re: Do supercolumns have a purpose?

I still think super-columns are useful you just need to be aware of
the limitations...

Bye,
Norman


2011/2/9 Mike Malone m...@simplegeo.com:
 On Tue, Feb 8, 2011 at 2:03 AM, David Boxenhorn da...@lookin2.com wrote:

 Shaun, I agree with you, but marking them as deprecated is not good enough
 for me. I can't easily stop using supercolumns. I need an upgrade path.

 David,
 Cassandra is open source and community developed. The right thing to do is
 what's best for the community, which sometimes conflicts with what's best
 for individual users. Such strife should be minimized, it will never be
 eliminated. Luckily, because this is an open source, liberal licensed
 project, if you feel strongly about something you should feel free to add
 whatever features you want yourself. I'm sure other people in your situation
 will thank you for it.
 At a minimum I think it would behoove you to re-read some of the comments
 here re: why super columns aren't really needed and take another look at
 your data model and code. I would actually be quite surprised to find a use
 of super columns that could not be trivially converted to normal columns. In
 fact, it should be possible to do at the framework/client library layer -
 you probably wouldn't even need to change any application code.
 Mike

 On Tue, Feb 8, 2011 at 3:53 AM, Shaun Cutts sh...@cuttshome.net wrote:

 I'm a newbie here, but, with apologies for my presumptuousness, I think
 you should deprecate SuperColumns. They are already distracting you, and as
 the years go by the cost of supporting them as you add more and more
 functionality is only likely to get worse. It would be better to concentrate
 on making the core column families better (and I'm sure we can all think
 of lots of things we'd like).
 Just dropping SuperColumns would be bad for your reputation -- and for
 users like David who are currently using them. But if you mark them clearly
 as deprecated and explain why and what to do instead (perhaps putting a bit
 of effort into migration tools... or even a virtual layer supporting
 arbitrary hierarchical data), then you can drop them in a few years (when
 you get to 1.0, say), without people feeling betrayed.

 -- Shaun
 On Feb 6, 2011, at 3:48 AM, David Boxenhorn wrote:

 My main point was to say that it's think it is better to create tickets
 for what you want, rather than for something else completely different that
 would, as a by-product, give you what you want.

 Then let me say what I want: I want supercolumn families to have any
 feature that regular column families have.

 My data model is full of supercolumns. I used them, even though I knew it
 didn't *have to*, because they were there, which implied to me that I was
 supposed to use them for some good reason. Now I suspect that they will
 gradually become less and less functional, as features are added to regular
 column families and not supported for supercolumn families.


 On Fri, Feb 4, 2011 at 10:58 AM, Sylvain Lebresne sylv...@datastax.com
 wrote:

 On Fri, Feb 4, 2011 at 12:35 AM, Mike Malone m...@simplegeo.com wrote:

 On Thu, Feb 3, 2011 at 6:44 AM, Sylvain Lebresne sylv...@datastax.com
 wrote:

 On Thu, Feb 3, 2011 at 3:00 PM, David Boxenhorn da...@lookin2.com
 wrote:

 The advantage would be to enable secondary indexes on supercolumn
 families.

 Then I suggest opening a ticket for adding secondary indexes to
 supercolumn families and voting on it. This will be 1 or 2 order of
 magnitude less work than getting rid of super column internally, and
 probably a much better solution anyway.

 I realize that this is largely subjective, and on such matters code
 speaks louder than words, but I don't think I agree with you on the issue 
 of
 which alternative is less work, or even which is a better solution.

 You are right, I put probably too much emphase in that sentence. My main
 point was to say that it's think it is better to create tickets for what 
 you
 want, rather than for something else

RE: Do you have a site in production environment with Cassandra? What client do you use?

2011-01-16 Thread Viktor Jevdokimov
- Cassandra 0.7 beta1 on virtual Windows Server 2008 64bit machines (8 total).
- In-house built C# client for .NET app connecting using Thrift, was worth it 
to built own client.
- 150M transactions/day load and growing.


Best regards/ Pagarbiai

Viktor Jevdokimov
Senior Developer

Email: viktor.jevdoki...@adform.com
Phone: +370 5 212 3063
Fax: +370 5 261 0453

Konstitucijos pr. 23,
LT-08105 Vilnius,
Lithuania



Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the interested recipient, you are reminded that 
the information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
or destroy this message and any copies.-Original Message-
From: Ertio Lew [mailto:ertio...@gmail.com]
Sent: Friday, January 14, 2011 19:25
To: user
Subject: Do you have a site in production environment with Cassandra? What 
client do you use?

Hey,

If you have a site in production environment or considering so, what
is the client that you use to interact with Cassandra. I know that
there are several clients available out there according to the
language you use but I would love to know what clients are being used
widely in production environments and are best to work with(support
most required features for performance).

Also preferably tell about the technology stack for your applications.

Any suggestions, comments appreciated ?

Thanks
Ertio




RE: about the data directory

2011-01-12 Thread Viktor Jevdokimov
I have 4 nodes, then I  I create one keyspace (such as FOO) with replica 
factor =1 and insert an data,
 why I can see the directory of /var/lib/Cassandra/data/FOO in every nodes? As 
 I know, I just have one replica

So why do you have installed 4 nodes, not 1?

They're for your data to be distributed between 4 nodes with 1 copy on one of 
them. This is like you have 100% of data and each node will have 25% of the 
data (random partitioning).


Viktor.

 Best regards/ Pagarbiai



Viktor Jevdokimov

Senior Developer



Email: viktor.jevdoki...@adform.com

Phone: +370 5 212 3063

Fax: +370 5 261 0453



Konstitucijos pr. 23,

LT-08105 Vilnius,

Lithuania






[cid:signature-logo6784.png]http://www.adform.com/


Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the interested recipient, you are reminded that 
the information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
or destroy this message and any copies.

inline: signature-logo6784.png

RE: [RELEASE] 0.7.0 (and 0.6.9)

2011-01-11 Thread Viktor Jevdokimov
Congratulations!!!


Best regards/ Pagarbiai

Viktor Jevdokimov
Senior Developer

Email: viktor.jevdoki...@adform.com
Phone: +370 5 212 3063
Fax: +370 5 261 0453

Konstitucijos pr. 23,
LT-08105 Vilnius,
Lithuania



Disclaimer: The information contained in this message and attachments is 
intended solely for the attention and use of the named addressee and may be 
confidential. If you are not the interested recipient, you are reminded that 
the information remains the property of the sender. You must not use, disclose, 
distribute, copy, print or rely on this e-mail. If you have received this 
message in error, please contact the sender immediately and irrevocably delete 
or destroy this message and any copies.-Original Message-
From: Eric Evans [mailto:eev...@rackspace.com]
Sent: Tuesday, January 11, 2011 19:23
To: user@cassandra.apache.org
Subject: [RELEASE] 0.7.0 (and 0.6.9)


As some of you may already be aware, 0.7.0 has been officially released.
You are free to start your upgrades, though not all at once, you'll
spoil your supper!

I apologize to anyone that might have noticed artifacts published as
early as Sunday and were confused by the lack of announcement, I was
waiting for an Official ASF Press Release and my timing sucks.

https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces9

There is way too much hotness in 0.7.0 to cover here, so instead I will
refer you to the following articles.

http://www.rackspacecloud.com/blog/2010/10/27/new-features-in-cassandra-0-7
http://www.riptano.com/blog/whats-new-cassandra-07-secondary-indexes
http://www.riptano.com/blog/whats-new-cassandra-07-hadoop-output-cassandra
http://www.riptano.com/blog/whats-new-cassandra-07-expiring-columns
http://www.riptano.com/blog/whats-new-cassandra-07-live-schema-updates

And of course, let's not forget the release notes.

http://goo.gl/Bi8LD

As usual you can download 0.7.0 from the website:
http://cassandra.apache.org/download

Users of Debian and derivatives can install from our repository:
http://wiki.apache.org/cassandra/DebianPackaging

New for 0.7.0, Cassandra is also available from Maven Central (thanks
Stephen Connolly).


But wait, there's more! If you're not in a hurry to upgrade, we have a
new 0.6 release as well, 0.6.9.  It's an easy upgrade for anyone running
0.6.8 and contains a number of useful changes (http://goo.gl/6NIPG).

The Debian repository has been extended to support an extra version so
if you're accustomed to installing 0.6 from our repository, then be sure
to change your suite name to 06x in sources.list.  For example:

  deb http://www.apache.org/dist/cassandra/debian 06x main
  deb-src http://www.apache.org/dist/cassandra/debian 06x main


That's it, thanks everyone!

--
Eric Evans
eev...@rackspace.com




RE: Quorum: killing 1 out of 3 server kills the cluster (?)

2010-12-09 Thread Viktor Jevdokimov
With 3 nodes and RF=2 you have 3 key ranges: N1+N2, N2+N3 and N3+N1.
Killing N1 you've got only 1 alive range N2+N3 and 2/3 of the range is down for 
Quorum, which is actually all, so N1+N2 and N3+N1 fails.

-Original Message-
From: Timo Nentwig [mailto:timo.nent...@toptarif.de] 
Sent: Thursday, December 09, 2010 6:01 PM
To: user@cassandra.apache.org
Subject: Re: Quorum: killing 1 out of 3 server kills the cluster (?)


On Dec 9, 2010, at 16:50, Daniel Lundin wrote:

 Quorum is really only useful when RF  2, since the for a quorum to
 succeed RF/2+1 replicas must be available.

2/2+1==2 and I killed 1 of 3, so... don't get it.

 This means for RF = 2, consistency levels QUORUM and ALL yield the same 
 result.
 
 /d
 
 On Thu, Dec 9, 2010 at 4:40 PM, Timo Nentwig timo.nent...@toptarif.de wrote:
 Hi!
 
 I've 3 servers running (0.7rc1) with a replication_factor of 2 and use 
 quorum for writes. But when I shut down one of them UnavailableExceptions 
 are thrown. Why is that? Isn't that the sense of quorum and a fault-tolerant 
 DB that it continues with the remaining 2 nodes and redistributes the data 
 to the broken one as soons as its up again?
 
 What may I be doing wrong?
 
 thx
 tcn





RE: Booting Cassandra v0.7.0 on Windows: rename failed

2010-11-29 Thread Viktor Jevdokimov
This isn't a first time Cassandra has I/O issues on Windows.

I think it's not easy to review source code and eliminate such issues, but 
would like developers to keep in mind such issues in the future.

We're also running a Cassandra cluster on Windows, but 0.7 beta1 (with similar 
issue, but for Commit Log) and waiting for 0.7 release to use it fully on 
production.


Viktor

-Original Message-
From: Jonathan Ellis [mailto:jbel...@gmail.com] 
Sent: Monday, November 29, 2010 5:09 PM
To: user
Subject: Re: Booting Cassandra v0.7.0 on Windows: rename failed

Please report a bug at https://issues.apache.org/jira/browse/CASSANDRA

On Mon, Nov 29, 2010 at 2:49 AM, Ramon Rockx r.ro...@asknow.nl wrote:
 Hi,

 Recently I downloaded Cassandra v0.7.0 rc1. When I try to run cassandra
 it ends with the following logging:

  INFO 09:17:30,044 Enqueuing flush of
 memtable-locationi...@839514767(643 bytes, 12 operations)
  INFO 09:17:30,045 Writing memtable-locationi...@839514767(643 bytes, 12
 operations)
 ERROR 09:17:30,233 Fatal exception in thread
 Thread[FlushWriter:1,5,main]
 java.io.IOError: java.io.IOException: rename failed of
 d:\cassandra\data\system\LocationInfo-e-1-Data.db
  at
 org.apache.cassandra.io.sstable.SSTableWriter.rename(SSTableWriter.java:
 214)
  at
 org.apache.cassandra.io.sstable.SSTableWriter.closeAndOpenReader(SSTable
 Writer.java:184)
  at
 org.apache.cassandra.io.sstable.SSTableWriter.closeAndOpenReader(SSTable
 Writer.java:167)
  at
 org.apache.cassandra.db.Memtable.writeSortedContents(Memtable.java:161)
  at org.apache.cassandra.db.Memtable.access$000(Memtable.java:49)
  at org.apache.cassandra.db.Memtable$1.runMayThrow(Memtable.java:174)
  at
 org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30)
  at
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
  at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
  at java.util.concurrent.FutureTask.run(FutureTask.java:138)
  at
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecuto
 r.java:886)
  at
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.ja
 va:908)
  at java.lang.Thread.run(Thread.java:619)
 Caused by: java.io.IOException: rename failed of
 d:\cassandra\data\system\LocationInfo-e-1-Data.db
  at
 org.apache.cassandra.utils.FBUtilities.renameWithConfirm(FBUtilities.jav
 a:359)
  at
 org.apache.cassandra.io.sstable.SSTableWriter.rename(SSTableWriter.java:
 210)
  ... 12 more

 Operating system is Windows 7. Tried it also on Windows 2003 server.
 I only modified a few (necessary) path settings in cassandra.yaml:

 commitlog_directory: d:/cassandra/commitlog
 data_file_directories:
 - d:/cassandra/data
 saved_caches_directory: d:/cassandra/saved_caches

 Does anybody know what I'm doing wrong?

 Regards,
 Ramon




-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com




RE: Preventing an update of a CF row

2010-10-19 Thread Viktor Jevdokimov
Nice and simple!

-Original Message-
From: Oleg Anastasyev [mailto:olega...@gmail.com] 
Sent: Tuesday, October 19, 2010 9:00 AM
To: user@cassandra.apache.org
Subject: Re: Preventing an update of a CF row

kannan chandrasekaran ckannanck at yahoo.com writes:

 Hi All,I have a query regarding the insert operation. The insert operation by
default  inserts an new row or updates an existing row. Is it possible to
prevent an update but allow only inserts automatically  ( especially when
multiple clients are writing to cassandra)?  I was wondering if there is any
flag in cassandra that will validate this for me automatically ( something like
unique key constraint) ? If not, is it non-trivial to implement this ? Any
suggestions would be helpful.ThanksKannan

Always specify some constant value for timestamp. Only 1st insertion with that
timestamp will succeed. Others will be ignored, because will be considered
duplicates by cassandra.




RE: Preventing an update of a CF row

2010-10-19 Thread Viktor Jevdokimov
Reverse timestamp.

-Original Message-
From: Sylvain Lebresne [mailto:sylv...@yakaz.com] 
Sent: Tuesday, October 19, 2010 10:44 AM
To: user@cassandra.apache.org
Subject: Re: Preventing an update of a CF row

 Always specify some constant value for timestamp. Only 1st insertion with that
 timestamp will succeed. Others will be ignored, because will be considered
 duplicates by cassandra.

Well, that's not entirely true. When cassandra 'resolves' two columns
having the
same timestamp, it will compare the values to decide which one to keep
(and it'll
keep the one column whose value is greater for bytes comparison).
Concretely, if you insert Column('foo', 'b', 0) and then Column('foo',
'bar', 0), then
you'll end up with the second column, even though the timestamps are the same
because 'bar'  'b'.
So constant timestamps doesn't work for preventing updates of a given column.

--
Sylvain




RE: 0.7 live schema updates

2010-09-16 Thread Viktor Jevdokimov
But you'll loose these settings after Cassandra restart.

-Original Message-
From: Oleg Anastasyev [mailto:olega...@gmail.com] 
Sent: Thursday, September 16, 2010 11:21 AM
To: user@cassandra.apache.org
Subject: Re: 0.7 live schema updates

You can change these attrs using JMX interface. Take a look at
org.apache.cassandra.tools.NodeProbe setCacheCapacities method.




RE: Cassandra 0.7 logging

2010-09-08 Thread Viktor Jevdokimov
We found that on Windows only log4j.properties is working for logging, not 
log4-server.properties

-Original Message-
From: Alaa Zubaidi [mailto:alaa.zuba...@pdf.com] 
Sent: Wednesday, September 08, 2010 7:34 AM
To: user@cassandra.apache.org
Subject: Re: Cassandra 0.7 logging

  I did most of the things you said, I will check the rest and double 
check all.

Thanks and Regards,
Alaa

On 9/7/2010 9:20 PM, Aaron Morton wrote:
 AFAIK nothing has changed in the logging other than the name of the config 
 file.
 All the logging is done through log4j.

 Do not know about the windows install, but using the normal (non package
 install) on linux the /var/log/cassandra path was not automatically created.

 It's going to be a config problem on your side. Check the settings again, 
 check
 all the directories exist, check the user that cassandra is running under has
 full permissions on the directory, check the root logger is configured to use
 the rolling file appender (e.g. log4j.rootLogger=INFO,R )



 Aaron

 On 08 Sep, 2010,at 03:42 PM, Alaa Zubaidialaa.zuba...@pdf.com  wrote:

   I modified the log4-server.properties:
 
   /# Edit the next line to point to your logs directory
   log4j.appender.RFile=E:\var\log\cassandra\Cass07\system.log
   /
   Cassandra is installed on E:
 
   The var\log\cassandra is created on the same place where Cassandra is 
  installed which is (in my case) E
   However, I am NOT sure that 0.7 is responsible for creating the system.log 
  under E:\var\log\cassandra\
   It was previous releases (0.6.5).
 
   Thanks
   Alaa
 
 
 
   On 9/7/2010 6:42 PM, Aaron Morton wrote:
   Ah, running on windows
 
   Have you included the drive spec in the logging configuration, i.e. is you
   logging configured to use
  E:\var\log\cassandra\Cass07\system.log
 
   When you say cassandra created /var/log/cassandra/system.log i'ts a bit
   confusing, what drive are you talking about ?
 
   Aaron
 
   On 08 Sep, 2010,at 01:22 PM, Alaa Zubaidialaa.zuba...@pdf.com   wrote:
 
  Thanks Aaron,
   
  For some reason /var/log/cassandra/system.log does not have the 0.7
  info, it has the Old 0.6.5 info.
  I modified the log4-server.properties and changed the default location
  and still did not get anything?
   
  I running on Windows by the way, and Cassandra created
  /var/log/cassandra/system.log and I changed it to:
  E:\var\log\cassandra\Cass07\system.log
   
  Regards,
   
  Alaa
   
  On 9/7/2010 6:04 PM, Aaron Morton wrote:
 logging is configured in the conf/log4-server.properties file.
  
 The default location is /var/log/cassandra/system.log
  
 It's enabled at the INFO level by default, and also logs to the 
  file and stdout
 by default.
  
 Aaron
  
  
  
 On 08 Sep, 2010,at 12:44 PM, Alaa Zubaidialaa.zuba...@pdf.com   
  wrote:
  
Hi,
I are testing 0.7 and I cannot find the system.log file, is 
  there any
thing I need to do in order to enable the logging?
I read about the changes made to the logging in 0.7 but did 
  not find any
thing that needs to be done to enable the logging.
 
Regards,
Alaa
 
   
  -- 
  Alaa Zubaidi
  PDF Solutions, Inc
  333 West San Carlos Street, Suite 700
  San Jose, CA 95110 USA
  Tel: 408-283-5639 (or 408-280-7900 x5639)
  fax: 408-938-6479
  email:alaa.zuba...@pdf.com
   
   
 
   -- 
   Alaa Zubaidi
   PDF Solutions, Inc.
   333 West San Carlos Street, Suite 700
   San Jose, CA 95110  USA
   Tel: 408-283-5639 (or 408-280-7900 x5639)
   fax: 408-938-6479
   email:alaa.zuba...@pdf.com

-- 
Alaa Zubaidi
PDF Solutions, Inc.
333 West San Carlos Street, Suite 700
San Jose, CA 95110  USA
Tel: 408-283-5639 (or 408-280-7900 x5639)
fax: 408-938-6479
email: alaa.zuba...@pdf.com





Cache capacity set with JConsole is lost after restart

2010-09-03 Thread Viktor Jevdokimov
Hi,

We're not setting cache capacity upon creation of Column Family, since the type 
and capacity is unknown at that time. By default it = 0.

After Column Family has enough data and we could decide on cache type (Row or 
Key) and capacity, we connect with JConsole and set cache capacity manually on 
every node. But after Cassandra restart cache capacity is 0 again.

How to avoid losing cache capacity after restart?


Viktor


RE: Cache capacity set with JConsole is lost after restart

2010-09-03 Thread Viktor Jevdokimov
Forgot to mention the version: 0.7 beta 1

-Original Message-
From: Edward Capriolo [mailto:edlinuxg...@gmail.com] 
Sent: Friday, September 03, 2010 4:59 PM
To: user@cassandra.apache.org
Subject: Re: Cache capacity set with JConsole is lost after restart

On Fri, Sep 3, 2010 at 9:22 AM, Viktor Jevdokimov
viktor.jevdoki...@adform.com wrote:
 Hi,



 We're not setting cache capacity upon creation of Column Family, since the
 type and capacity is unknown at that time. By default it = 0.



 After Column Family has enough data and we could decide on cache type (Row
 or Key) and capacity, we connect with JConsole and set cache capacity
 manually on every node. But after Cassandra restart cache capacity is 0
 again.



 How to avoid losing cache capacity after restart?





 Viktor

Viktor,
I will assume you are using 6.X.

In 6.X changes to cache capacity through JMX are NOT saved. Use the
KeysCached and RowsCached

  ColumnFamily Name=Super2
ColumnType=Super
CompareWith=UTF8Type
CompareSubcolumnsWith=UTF8Type
RowsCached=1
KeysCached=111

Be warned the attributes are CaSe SeNSative!



RE: Running Cassandra as a Windows Service

2010-07-02 Thread Viktor Jevdokimov
I'm very interested in Windows Service, as x86 as x64 versions.

Have you any success to start Cassandra with Procrun alone?

Viktor

From: Kochheiser,Todd W - TOK-DITT-1 [mailto:twkochhei...@bpa.gov]
Sent: Thursday, July 01, 2010 9:49 PM
To: 'user@cassandra.apache.org'
Subject: RE: Running Cassandra as a Windows Service

I've been running it in our development  test environments as a Windows 
Service without any problem.  I have not been too sophisticated in my 
configurations, but have been running some simple two node clusters.  At this 
point nothing has yet caused me any concern.

I have been working on a contrib. package for Cassandra and hope to have 
something submitted for review in the next week or so.  It will include 
everything needed to run Cassandra as a service.  In order to make Cassandra 
work well with Apache's Procrunhttp://commons.apache.org/daemon/procrun.html, 
I needed to write a simple WindowsService class.  In addition to the procrun 
executables for 32/64 bit platforms and a batch file to install and remove the 
service, I'll also be including an ant build file and associated unit tests.

Todd


From: Viktor Jevdokimov [mailto:viktor.jevdoki...@adform.com]
Sent: Thursday, July 01, 2010 6:39 AM
To: user@cassandra.apache.org
Cc: Kochheiser,Todd W - TOK-DITT-1
Subject: RE: Running Cassandra as a Windows Service

Hi,

Can you share your experience with running Cassandra as a Windows Service?

Thank you,
Viktor

From: Kochheiser,Todd W - TO-DITT1 [mailto:twkochhei...@bpa.gov]
Sent: Thursday, June 10, 2010 8:34 PM
To: 'user@cassandra.apache.org'
Subject: Running Cassandra as a Windows Service

For various reasons I am required to deploy systems on Windows.  As such, I 
went looking for information on running Cassandra as a Windows service.  I've 
read some of the user threads regarding running Cassandra as a Windows service, 
such as this one:

http://www.mail-archive.com/user@cassandra.apache.org/msg01656.html

I also found the following JIRA issue:

https://issues.apache.org/jira/browse/CASSANDRA-292

As it didn't look like anyone has contributed a formal solution and having some 
experience using Apache's Procrun 
(http://commons.apache.org/daemon/procrun.html), I decided to go ahead and 
write a batch script and a simple WindowsService class to accomplish the 
task.  The WindowsService class only makes calls to public methods in 
CassandraDeamon and is fairly simple.  In combination with the batch script, it 
is very easy to install and remove the service.  At this point, I've installed 
Cassandra as a Windows service on XP (32 bit), Windows 7 (64 bit) and Windows 
Server 2008 R1/R2 (64 bit).  It should work fine on other version of Windows 
(2K, 2K3).

Questions:

1. Has anyone else already done this work?
2. If not, I wouldn't mind sharing the code/script or contributing it back 
to the project.  Is there any interest in this from the Cassandra dev team or 
the user community?

Ideally the WindowsService could be included in the distributed source/binary 
distributions (perhaps in a contrib area) as well as the batch script and 
associated procrun executables.  Or, perhaps it could be posted to a Cassandra 
community site (is there one?).

Todd







RE: Extremly slow inserts on LAN

2010-05-09 Thread Viktor Jevdokimov
We had similar experience.

Problem was with TSocket as transport alone:

var transport = new TSocket(192.168.0.123, 9160);
var protocol = new TBinaryProtocol(transport);
var client = new Cassandra.Client(protocol);

Using TBufferedTransport helped a lot:

var transport = new TBufferedTransport(new TSocket(192.168.0.123, 
9160));
var protocol = new TBinaryProtocol(transport);
var client = new Cassandra.Client(protocol);

Viktor


-Original Message-
From: Arie Keren [mailto:a...@doubleverify.com] 
Sent: Monday, May 10, 2010 8:51 AM
To: user@cassandra.apache.org
Subject: RE: Extremly slow inserts on LAN

No - just Windows.
So I'm going to do some experiments to isolate the cause:
- use java client on windows
- use linux server
- use java client on linux

Thanx


-Original Message-
From: David Strauss [mailto:da...@fourkitchens.com]
Sent: May 09, 2010 5:48 PM
To: user@cassandra.apache.org
Subject: Re: Extremly slow inserts on LAN

From a naive (not caring about Cassandra internals) basis, the first step is 
to isolate whether the problem is on the client or server side.
Have you tried a Linux-based server or a Linux-based client?

On 2010-05-09 14:06, Arie Keren wrote:
 While making our first steps with Cassandra, we experience slow
 inserts working on LAN.

 Inserting 7000 keys with 1 column family takes about 10 seconds when
 Cassandra server running on the same host with the client.

 But when server runs on a different host on LAN, the same inserts take
 more than 10 (!) minutes.

 In both cases Cassandra server contains a single node.



 We use Cassandra version 0.6.0 running on Windows server 2008.

 The client is .NET c# application.

--
David Strauss
   | da...@fourkitchens.com
Four Kitchens
   | http://fourkitchens.com
   | +1 512 454 6659 [office]
   | +1 512 870 8453 [direct]



__ Information from ESET NOD32 Antivirus, version of virus signature 
database 4628 (20091122) __

The message was checked by ESET NOD32 Antivirus.

http://www.eset.com



RE: Cassandra on Windows network latency

2010-05-03 Thread Viktor Jevdokimov
Yes, we have already figured that out :)

Thanks!

-Original Message-
From: Carlos Alvarez [mailto:cbalva...@gmail.com] 
Sent: Thursday, April 29, 2010 4:03 PM
To: user@cassandra.apache.org
Subject: Re: Cassandra on Windows network latency

Are you using TSocket in the client?. If yes, use TbufferedTransport instead.


Carlos

On 4/29/10, Viktor Jevdokimov viktor.jevdoki...@adform.com wrote:
 Thrift C# sources, thrift generated Cassandra sources, test app built with
 C#. Simple connect/write/read operations. No pooling or anything else.

 From: Heath Oderman [mailto:he...@526valley.com]
 Sent: Thursday, April 29, 2010 2:17 PM
 To: user@cassandra.apache.org
 Subject: Re: Cassandra on Windows network latency

 I learned the hard way, that running py_stress in the src/contrib directory
 is a great way to test what kind of speeds you are really getting.

 What tools / client are you using to test to get the 200ms number?

 stu
 On Thu, Apr 29, 2010 at 7:12 AM, Viktor Jevdokimov
 viktor.jevdoki...@adform.commailto:viktor.jevdoki...@adform.com wrote:
 Hi all,

 We have installed Cassandra on Windows and found that with any number of
 Cassandra (single, or 3 node cluster) on Windows Vista or Windows Server
 2008, 32 or 64 bit, with any load or number of requests we, have:

 When client and server are on the same machine, connect/read/write latencies
 ~0-1ms
 When client on another machine, same network, on the same switch, connection
 latency 0-1ms (as a ping), read/write latencies =200ms.

 What causes 200ms latency accessing Cassandra on Windows through network?
 Does anybody experience such behavior?

 Cassandra 0.6.1
 Java SE 6 u20


 Best regards,
 Viktor




-- 
Sent from my mobile device

Tal vez hubo un error en la grafía. O en la articulación del Sacro Nombre.


Primary and Backup clusters

2010-05-03 Thread Viktor Jevdokimov
Hello,

Our system (not Cassandra) have backup cluster in different datacenter in case 
of primary cluster unavailability or for software upgrades.
100% of traffic goes to primary cluster. We switch 100% traffic to backup 
cluster in case above for a short time, then when issues are resolved, traffic 
is switched back to primary.

We'd like to have primary and backup Cassandra clusters in different 
datacenters for the same reasons.
We do not want to have a high traffic between primary and backup datacenters.

Now the questions:

1. How to sync Cassandra clusters (backup-primary) with minimal traffic?
2. How to configure Cassandra in such case?


Thanks,

Viktor


Cassandra on Windows network latency

2010-04-29 Thread Viktor Jevdokimov
Hi all,

We have installed Cassandra on Windows and found that with any number of 
Cassandra (single, or 3 node cluster) on Windows Vista or Windows Server 2008, 
32 or 64 bit, with any load or number of requests we, have:

When client and server are on the same machine, connect/read/write latencies 
~0-1ms
When client on another machine, same network, on the same switch, connection 
latency 0-1ms (as a ping), read/write latencies =200ms.

What causes 200ms latency accessing Cassandra on Windows through network? Does 
anybody experience such behavior?

Cassandra 0.6.1
Java SE 6 u20


Best regards,
Viktor



RE: Cassandra on Windows network latency

2010-04-29 Thread Viktor Jevdokimov
Thrift C# sources, thrift generated Cassandra sources, test app built with C#. 
Simple connect/write/read operations. No pooling or anything else.

From: Heath Oderman [mailto:he...@526valley.com]
Sent: Thursday, April 29, 2010 2:17 PM
To: user@cassandra.apache.org
Subject: Re: Cassandra on Windows network latency

I learned the hard way, that running py_stress in the src/contrib directory is 
a great way to test what kind of speeds you are really getting.

What tools / client are you using to test to get the 200ms number?

stu
On Thu, Apr 29, 2010 at 7:12 AM, Viktor Jevdokimov 
viktor.jevdoki...@adform.commailto:viktor.jevdoki...@adform.com wrote:
Hi all,

We have installed Cassandra on Windows and found that with any number of 
Cassandra (single, or 3 node cluster) on Windows Vista or Windows Server 2008, 
32 or 64 bit, with any load or number of requests we, have:

When client and server are on the same machine, connect/read/write latencies 
~0-1ms
When client on another machine, same network, on the same switch, connection 
latency 0-1ms (as a ping), read/write latencies =200ms.

What causes 200ms latency accessing Cassandra on Windows through network? Does 
anybody experience such behavior?

Cassandra 0.6.1
Java SE 6 u20


Best regards,
Viktor




<    1   2