Re: Cassandra and Apache Drill

2012-08-31 Thread Jake Luciani
I don't think Drill has been accepted into the incubator yet or has any code. If/When that happens then it's entirely possible Cassandra could be integrated. On Fri, Aug 31, 2012 at 4:29 PM, John Onusko jonu...@actiance.com wrote: Like a lot of folks, I have a need for Big Data and fast

Re: Combining all CFs into one big one

2011-05-01 Thread Jake Luciani
If you have N column families you need N * memtable size of RAM to support this. If that's not an option you can merge them into one as you suggest but then you will have much larger SSTables, slower compactions, etc. I don't necessarily agree with Tyler that the OS cache will be less

Re: Migrating all rows from 0.6.13 to 0.7.5 over thrift?

2011-05-07 Thread Jake Luciani
I know thrift and python and Unicode don't mix. On May 7, 2011, at 4:21 PM, aaron morton aa...@thelastpickle.com wrote: I've been able to reproduce the fault using python on my mac book see https://github.com/amorton/cassandra-unicode-bug When we try to find the unicode key in the

Re: Forcing Cassandra to free up some space

2011-05-26 Thread Jake Luciani
Is there a way for me to make (or even gently suggest to) Cassandra that it may be a good time to free up some space? Disregarding what's been said and until ref-counting is implemented this is a useful tool to gently suggest cleanup: https://github.com/ceocoder/jmxgc On Thu, May 26, 2011 at

Re: What's the best approach to search in Cassandra

2011-06-03 Thread Jake Luciani
Mark, Check out Solandra. http://github.com/tjake/Solandra On Fri, Jun 3, 2011 at 7:56 PM, Mark Kerzner markkerz...@gmail.com wrote: Hi, I need to store, say, 10M-100M documents, with each document having say 100 fields, like author, creation date, access date, etc., and then I want to

Re: Installing Thrift with Solandra

2011-06-06 Thread Jake Luciani
To access Cassandra in Solandra it's the same as regular cassandra. To access Solr you use one of the Php Solr libraries http://wiki.apache.org/solr/SolPHP On Mon, Jun 6, 2011 at 11:04 PM, Jean-Nicolas Boulay Desjardins jnbdzjn...@gmail.com wrote: I am trying to install Thrift with

Re: Installing Thrift with Solandra

2011-06-07 Thread Jake Luciani
to build a Thrift interface for Cassandra: ./compiler/cpp/thrift -gen php ../PATH-TO-CASSANDRA/interface/cassandra.thrift How do I do this? Where is the interface folder? Again, tjake thanks allot for your time and help. On Mon, Jun 6, 2011 at 11:13 PM, Jake Luciani jak...@gmail.com wrote

Re: Installing Thrift with Solandra

2011-06-07 Thread Jake Luciani
-gen php ../PATH-TO-CASSANDRA/interface/cassandra.thrift How do I do this? Where is the interface folder? Again, tjake thanks allot for your time and help. On Mon, Jun 6, 2011 at 11:13 PM, Jake Luciani jak...@gmail.com wrote: To access Cassandra in Solandra it's the same as regular cassandra

Re: Cassandra HDFS question

2011-06-09 Thread Jake Luciani
Hi JKnight, Yes. The Brisk project adds a HDFS compatible layer for Cassandra see http://github.com/riptano/brisk -Jake On Thu, Jun 9, 2011 at 11:05 PM, JKnight JKnight beukni...@gmail.comwrote: Dear all, Does Cassandra support HDFS storage? Thank a lot for support. -- Best regards,

Re: Is there a way from a running Cassandra node to determine whether or not itself is up?

2011-06-15 Thread Jake Luciani
No force a node down you can use nodetool disablegossip On Wed, Jun 15, 2011 at 6:42 PM, Suan Aik Yeo yeosuan...@gmail.com wrote: Thanks, Aaron, but we determined that adding Java into the equation just brings in too much complexity for something that's called out of an Nginx Perl module.

Re: What's the best approach to search in Cassandra

2011-06-16 Thread Jake Luciani
that if I read all there is on GitHub, I can probably start using it. * Thank you, Mark On Fri, Jun 3, 2011 at 8:07 PM, Jake Luciani jak...@gmail.com wrote: Mark, Check out Solandra. http://github.com/tjake/Solandra On Fri, Jun 3, 2011 at 7:56 PM, Mark Kerzner markkerz

Re: solandra or pig or....?

2011-06-21 Thread Jake Luciani
Solandra can answer the question you used as an example and it's more of a fit for low-latency ad-hoc reporting then PIG. Pig queries will take minutes not seconds. On Tue, Jun 21, 2011 at 12:12 PM, Sasha Dolgy sdo...@gmail.com wrote: Folks, Simple question ... Assuming my current use case

Re: solandra or pig or....?

2011-06-21 Thread Jake Luciani
i had a quick look at https://github.com/tjake/Solandra/wiki/Solandra-Wiki and it wasn't dead obvious to me On Tue, Jun 21, 2011 at 8:19 PM, Jake Luciani jak...@gmail.com wrote: Solandra can answer the question you used as an example and it's more of a fit for low-latency ad-hoc

Re: solandra or pig or....?

2011-06-21 Thread Jake Luciani
Right, Solr will not do anything other than basic aggregations (facets) and range queries. On Tue, Jun 21, 2011 at 3:16 PM, Dan Kuebrich dan.kuebr...@gmail.comwrote: Solandra is indeed distributed search, not distributed number-crunching. As a previous poster said, you could imagine

Re: solandra or pig or....?

2011-06-22 Thread Jake Luciani
be required... -sd On Tue, Jun 21, 2011 at 9:50 PM, Jake Luciani jak...@gmail.com wrote: Right, Solr will not do anything other than basic aggregations (facets) and range queries. On Tue, Jun 21, 2011 at 3:16 PM, Dan Kuebrich dan.kuebr...@gmail.com wrote: Solandra is indeed distributed search

Re: Little problems with Solandra

2011-07-23 Thread Jake Luciani
Sounds like you forgot to start solandra after you built it. cd solandra-app; ./bin/solandra You can verify it's running with jps look for SolandraServer. On Jul 23, 2011, at 10:52 AM, Jean-Nicolas Boulay Desjardins jnbdzjn...@gmail.com wrote: Hi, I have a server on RackSpace and it

Re: Slow Reads

2011-07-26 Thread Jake Luciani
It doesn't read the entire row, but it does read a section of the row from disk... How big is each supercolumn? If you re-read the data does the query time get faster? On Tue, Jul 26, 2011 at 11:59 AM, Philippe watche...@gmail.com wrote: i believe it's because it needs to read the whole row

Re: Slow Reads

2011-07-27 Thread Jake Luciani
The philosophy in no-sql is to store the data as you plan to access it. that means duplicating the data many time possibly. Disk is cheap, writes are fast. On Wed, Jul 27, 2011 at 2:22 PM, Priyanka priya...@gmail.com wrote: Thank you Indra for your suggestion. But the thing is apart from

Re: Cassandra Pig with network topology and data centers.

2011-07-29 Thread Jake Luciani
Yes it's read repair you can lower the read repair chance to tune this. On Jul 29, 2011, at 6:31 PM, Aaron Griffith aaron.c.griff...@gmail.com wrote: I currently have a 9 node cassandra cluster setup as follows: DC1: Six nodes DC2: Three nodes The tokens alternate between the two

Re: Setup Cassandra0.8 in Eclipse

2011-08-08 Thread Jake Luciani
you can simply run: ant generate-eclipse-files then import the project On Sun, Aug 7, 2011 at 5:39 PM, Alvin UW alvi...@gmail.com wrote: Hello, I am trying to Setup Cassandra0.8 in Eclipse following http://wiki.apache.org/cassandra/RunningCassandraInEclipse After right clicking on the

Re: Solandra/Cassandra ring issue

2011-08-11 Thread Jake Luciani
seriously, If you change the cluster name in cassandra.yaml they won't join. On Thu, Aug 11, 2011 at 12:31 PM, Ashley Martens amart...@ngmoco.comwrote: No shared seeds. Downright freaky. -- http://twitter.com/tjake

Re: Solandra multiple schemas

2011-08-15 Thread Jake Luciani
You want the solandra data stored under two keyspaces? Or you just want two different logical indexes. The former requires changing the keyspace name located in solandra.properties but you can only access one per process. The latter would involve creating two different solr cores at different

Re: Solandra distributed search

2011-08-15 Thread Jake Luciani
Solandra manages the shard parameters for you. you don't need to specify anything. On Mon, Aug 15, 2011 at 3:00 PM, Jeremiah Jordan jeremiah.jor...@morningstar.com wrote: When using Solandra, do I need to use the Solr sharding synxtax in my queries? I don't think I do because Cassandra is

Re: Reg File upload to cassandra - connection refused error

2011-08-18 Thread Jake Luciani
What is rpc_address set to in cassandra.yaml? Try setting these to 0.0.0.0 to be sure it's listening to external traffic. On Thu, Aug 18, 2011 at 8:37 AM, Thamizh tceg...@yahoo.co.in wrote: Hi All, This is regarding help to resolve connection refused error on Cassandra client API. I have

Re: Suggested settings for number crunching

2011-08-18 Thread Jake Luciani
Are you writing lots of tiny rows or a few very large rows, are you batching mutations? is the loading disk or cpu or network bound? -Jake On Thu, Aug 18, 2011 at 7:08 AM, Paul Loy ketera...@gmail.com wrote: Hi All, I have a program that crunches through around 3 billion calculations. We

Re: Suggested settings for number crunching

2011-08-18 Thread Jake Luciani
no network traffic so I think it's disk access. Will find out for sure tomorrow after the current test runs. Thanks, Paul. On Thu, Aug 18, 2011 at 2:23 PM, Jake Luciani jak...@gmail.com wrote: Are you writing lots of tiny rows or a few very large rows, are you batching mutations

Re: Solandra error - spaces in search

2011-08-23 Thread Jake Luciani
Thx for the info I'll try to reproduce On Aug 23, 2011, at 9:28 PM, Ashley Martens amart...@ngmoco.com wrote: INFO [769787724@qtp-311722089-9825] 2011-08-23 22:07:53,750 SolrCore.java (line 1370) [users] webapp=/solandra path=/select

Re: Possibility of going OOM using get_count

2011-09-18 Thread Jake Luciani
This is fixed in 1.0 https://issues.apache.org/jira/browse/CASSANDRA-2894 On Sun, Sep 18, 2011 at 2:16 PM, Tharindu Mathew mcclou...@gmail.comwrote: Hi everyone, I noticed this line in the API docs, The method is not O(1). It takes all the columns from disk to calculate the answer. The

Re: Solandra distributed search

2011-10-06 Thread Jake Luciani
the default setting of 4 for this property affect the distribution of data across my nodes? From: Jake Luciani jak...@gmail.com Reply-To: user@cassandra.apache.org user@cassandra.apache.org Date: Mon, 15 Aug 2011 12:03:22 -0700 To: user@cassandra.apache.org user@cassandra.apache.org Subject: Re

Re: Operator on secondary indexes in 0.8.x (GTE/LTE)

2011-10-11 Thread Jake Luciani
This hasn't changed in AFAIK, In Brisk we had the same problem in CFS so we created a sentinel value that all rows shared then it works. CASSANDRA-2915 should fix it. On Tue, Oct 11, 2011 at 4:48 PM, Sasha Dolgy sdo...@gmail.com wrote: I was trying to get a range of rows based on a

Re: Counter Experience (Performance)?

2011-10-27 Thread Jake Luciani
What's your bottleneck? http://spyced.blogspot.com/2010/01/linux-performance-basics.html On Thu, Oct 27, 2011 at 9:37 AM, Joe Stein crypt...@gmail.com wrote: Hey folks, I am interested in what others have seen in regards to their experience in the amount of depth and width (CF, Rows Columns)

Re: 2 node cluster, 1 node down, overall failure

2011-10-27 Thread Jake Luciani
You are unable to connect? or you are getting an UnavailableException? On Thu, Oct 27, 2011 at 11:14 AM, RobinUs2 ro...@us2.nl wrote: I currently run a 2-node cluster with version cassandra 1.0 (stable). With replication factor 2 on the keyspace which I'm testing. When I shutdown node B,

Re: 2 node cluster, 1 node down, overall failure

2011-10-27 Thread Jake Luciani
What consistency level are you using? With RF=2 your only option is CL.ONE when a node is down. On Thu, Oct 27, 2011 at 11:47 AM, RobinUs2 ro...@us2.nl wrote: The error I currently see when I take down node B: Error performing get_indexed_slices on NODE A IP:9160: exception

Re: Reminder: Cassandra Meetup, Thursday Nov. 10th in Vancouver

2011-11-07 Thread Jake Luciani
I'll be there! On Mon, Nov 7, 2011 at 5:23 PM, Eric Evans eev...@acunu.com wrote: Just a reminder; If you're planning to be at ApacheCon, or are otherwise able to be in Vancouver on the 10th, we're having a Cassandra Meetup. There is no cost to attend (you don't even need to be registered

Re: Secondary index issue, unable to query for records that should be there

2011-11-08 Thread Jake Luciani
Hi Nate, Could you try running it with debug enabled on the logs? it will give more insite into what's going on. -Jake On Tue, Nov 8, 2011 at 3:45 PM, Nate Sammons nsamm...@ften.com wrote: This is against a single server, not a cluster. Replication factor for the keyspace is set to 1, CL

Re: Second Cassandra users survey

2011-11-09 Thread Jake Luciani
Hi Todd, Entity Groups : https://issues.apache.org/jira/browse/CASSANDRA-1684 -Jake On Wed, Nov 9, 2011 at 6:44 AM, Todd Burruss bburr...@expedia.com wrote: I believe I heard someone talk at Cassandra SF conference about creating a partitioner that was a derivation of RandomPartitioner. It

Re: Second Cassandra users survey

2011-11-09 Thread Jake Luciani
at the conference that had already implemented what I mentioned. It didn't offer any atomicity, just co-locating a family of data on the same node. From: Jake Luciani jak...@gmail.com Reply-To: user@cassandra.apache.org user@cassandra.apache.org Date: Wed, 9 Nov 2011 02:53:20 -0800 To: user

Cassandra as a LinkedIn Skill

2011-11-14 Thread Jake Luciani
Hi, Sorry for the intrusion. I was speaking to some of the LinkedIn engineers at ApacheCon last week about to see how to get Cassandra into the linkedin skills page [1]. They claim if more people add Cassandra as a skill in their profile then it will show up. So my request is if you use

Re: Second Cassandra users survey

2011-11-14 Thread Jake Luciani
Re Simpler elasticity: Latest opscenter will now rebalance cluster optimally http://www.datastax.com/dev/blog/whats-new-in-opscenter-1-3 /plug -Jake On Mon, Nov 14, 2011 at 7:27 PM, Chris Burroughs chris.burrou...@gmail.comwrote: - It would be super cool if all of that counter work made it

Re: Cassandra not suitable?

2011-12-07 Thread Jake Luciani
Where do you see the timeout exceptions? in the mappers? How many mappers reducers slots are you using? What does your disk setup look like? do you have HDFS on same disk as cassandra data dir? -Jake On Tue, Dec 6, 2011 at 4:50 AM, Patrik Modesto patrik.mode...@gmail.comwrote: Hi, I'm

Re: best practices for simulating transactions in Cassandra

2011-12-12 Thread Jake Luciani
I've written a locking mechanism for Solandra (I refer to it as a reservation system) which basically allows you to acquire a lock. This is used to ensure a node is service unique sequential IDs for lucene. It sounds a bit similar to Dominic's description but I'll explain how the Solandra one

Re: best practices for simulating transactions in Cassandra

2011-12-12 Thread Jake Luciani
algorithm doesn't have the possibility of deadlock or livelock. It does seem that you have covered a lot of the bases though. Thanks for sharing guys :) John On Mon, Dec 12, 2011 at 6:21 AM, Jake Luciani jak...@gmail.com wrote: I've written a locking mechanism for Solandra (I refer

Re: Deploying Cassandra 1.0.7 on EC2 in minutes

2012-01-18 Thread Jake Luciani
Thanks Andrei! On Wed, Jan 18, 2012 at 8:00 AM, Andrei Savu savu.and...@gmail.com wrote: Hi guys, I just want to the let you know that Apache Whirr trunk (the upcoming 0.7.1 release) can deploy Cassandra 1.0.7 on AWS EC2 Rackspace Cloud. You can give it a try by running the following

Re: How much has Cassandra improved from 0.8.6 to 1.0+?

2012-01-30 Thread Jake Luciani
Well as they say Lies, damned lies, and statistics This is a alternate comparison you can review: http://www.cubrid.org/blog/dev-platform/nosql-benchmarking/ YCSB is a known and agreed upon benchmark. The benchmark you link includes no sourcecode to reproduce with and as the author mentions For

Re: Is the wiki outdated regarding Hive support?

2012-04-01 Thread Jake Luciani
Hi Ben. That is still the repo. The code that ships with latest DSE is the hive-0.8.1-merge branch. We will try to get this into the Cassandra trunk asap. Jake On Apr 1, 2012, at 6:39 PM, Ben McCann b...@benmccann.com wrote: The wiki says Hive support is currently a standalone project

Re: Row iteration using RandomPartitioner

2012-04-02 Thread Jake Luciani
Correct. Random partitioner order is md5 token order. If you make no changes you will get the same order On Apr 2, 2012, at 7:53 AM, christopher-t...@ubs.com wrote: Hi, Bit of a silly question, is row iteration using the RandomPartitioner deterministic? I don't particularly care what

Re: Write performance compared to Postgresql

2012-04-03 Thread Jake Luciani
Hi Jeff, Writing serially over one connection will be slower. If you run many threads hitting the server at once you will see throughput improve. Jake On Apr 3, 2012, at 7:08 AM, Jeff Williams je...@wherethebitsroam.com wrote: Hi, I am looking at cassandra for a logging application.

Re: 2 questions DataStax Enterprise

2012-04-03 Thread Jake Luciani
Hi reply inline. On Tue, Apr 3, 2012 at 12:18 PM, Alexandru Sicoe adsi...@gmail.com wrote: Hi guys, I'm trying out DSE and looking for the best way to arrange the cluster. I have 9 nodes: 3 behind a gateway taking in writes from my collectors and 6 outside the gateway that are supposed to

Re: cassandra and .net

2012-04-10 Thread Jake Luciani
You can also look at using a .net client wrapper like https://github.com/managedfusion/fluentcassandra On Tue, Apr 10, 2012 at 8:06 AM, puneet loya puneetl...@gmail.com wrote: thankk :) :) it works :) On Tue, Apr 10, 2012 at 3:07 PM, Henrik Schröder skro...@gmail.comwrote: In your

Re: Is the secondary index re-built under compaction?

2012-04-17 Thread Jake Luciani
No, the indexes are not rebuilt every compaction. Only if you manually rebuild or bootstrap a new node does it use compaction manager to rebuild. On Tue, Apr 17, 2012 at 9:47 AM, Maxim Potekhin potek...@bnl.gov wrote: Thanks Aaaron. Just to be clear, every time I do a compaction, I rebuild

Re: Is the secondary index re-built under compaction?

2012-04-17 Thread Jake Luciani
if that helps. Maxim On 4/17/2012 9:53 AM, Jake Luciani wrote: No, the indexes are not rebuilt every compaction. Only if you manually rebuild or bootstrap a new node does it use compaction manager to rebuild. On Tue, Apr 17, 2012 at 9:47 AM, Maxim Potekhin potek...@bnl.gov wrote

Re: Is the secondary index re-built under compaction?

2012-04-17 Thread Jake Luciani
says it's building the index, not compacting the corresponding CF. Either that's an ambiguous diagnostic, or indeed something is not right with my rig as of late. Maxim On 4/17/2012 10:05 AM, Jake Luciani wrote: Well, the since the secondary indexes are themselves column families

Re: Is the secondary index re-built under compaction?

2012-04-17 Thread Jake Luciani
loading process. Since the nodes have good hardware (24 cores + SSD), the apparent load on each node was nothing remarkable, even at 20kHz insertion rate. But maybe I'm being overoptimistic. Maxim On 4/17/2012 10:12 AM, Jake Luciani wrote: Hmm that does sound fishy. When you run show

Re: 200TB in Cassandra ?

2012-04-20 Thread Jake Luciani
What other solutions are you considering? Any OLTP style access of 200TB of data will require substantial IO. Do you know how big your working dataset will be? -Jake On Fri, Apr 20, 2012 at 3:30 AM, Franc Carter franc.car...@sirca.org.auwrote: On Fri, Apr 20, 2012 at 6:27 AM, aaron morton

Re: RandomPartitioner is providing a very skewed distribution of keys across a 5-node Solandra cluster

2012-06-24 Thread Jake Luciani
Hi Safdar, If you want to get better utilization of the cluster raise the solandra.shards.at.once param in solandra.properties -Jake On Sun, Jun 24, 2012 at 11:00 AM, Safdar Kureishy safdar.kurei...@gmail.com wrote: Hi, I've searched online but was unable to find any leads for the

Re: java.lang.OutOfMemoryError: unable to create new native thread

2012-06-25 Thread Jake Luciani
This means you need to raise the nproc limit for the user you run cassandra with On Mon, Jun 25, 2012 at 8:48 AM, Oli Schacher cassan...@lists.wgwh.chwrote: Hi list I have a small cassandra cluster consisting of three nodes. Every few weeks the whole cluster goes down at the same time. All

Re: DSE solr HA

2012-08-13 Thread Jake Luciani
Going through this page and it looks like indexes are stored locally http://www.datastax.com/dev/blog/cassandra-with-solr-integration-details . My question is what happens if one of the solr nodes crashes? Is the data indexed again on those nodes? Yes the data is indexed again

Re: Cassandra 0.7-beta3 - IndexOutOfBoundsException when create column family

2010-11-16 Thread Jake Luciani
This is a bug in beta3, if you checkout the cassandra-0.7 branch it should work for you. On Tue, Nov 16, 2010 at 3:38 PM, André Fiedler fiedler.an...@googlemail.com wrote: I try to perform the following action after a clean startup. And get the log below. How to fix this?

Re: Facebook messaging and choice of HBase over Cassandra - what can we learn?

2010-11-21 Thread Jake Luciani
+1 Ed On Nov 21, 2010, at 12:13 PM, Edward Capriolo edlinuxg...@gmail.com wrote: On Sun, Nov 21, 2010 at 12:10 PM, André Fiedler fiedler.an...@googlemail.com wrote: Facebook Messaging – HBase Comes of Age http://facility9.com/2010/11/18/facebook-messaging-hbase-comes-of-age

Re: Taking down a node in a 3-node cluster, RF=2

2010-11-28 Thread Jake Luciani
are writing with CL.ANY If you never write with CL.ANY then you can turn off hinted handoff. How do I reconcile this? On Sun, Nov 28, 2010 at 7:11 PM, Jake Luciani jak...@gmail.com wrote: If you read/write data with quorum then you can safely take a node down in this scenario. Subsequent

Re: Taking down a node in a 3-node cluster, RF=2

2010-11-28 Thread Jake Luciani
Right. On Sun, Nov 28, 2010 at 1:03 PM, David Boxenhorn da...@lookin2.com wrote: OK. To sum up: RF=2 and QUORUM are incompatible (if you want to be able to take a node down). Right? On Sun, Nov 28, 2010 at 7:59 PM, Jake Luciani jak...@gmail.com wrote: I was wrong on this scenario

Re: Data Model Question

2010-12-02 Thread Jake Luciani
You can also run Solr with Cassandra as the backend: https://github.com/tjake/Lucandra/tree/solandra /shameless_plug -Jake On Thu, Dec 2, 2010 at 6:27 AM, aaron morton aa...@thelastpickle.comwrote: Have you considered using Solr / lucene for the search? It has a lot more search features,

Re: Re: Re: Cassandra 0.7 beta 3 outOfMemory (OOM)

2010-12-07 Thread Jake Luciani
Max this was a bug fixed recently in 0.7 branch https://issues.apache.org/jira/browse/CASSANDRA-1801 fixed now in RC2 -Jake On Tue, Dec 7, 2010 at 8:11 AM, Max cassan...@ajowa.de wrote: As far as i can see, Lucandra already uses batch_mutations.

Re: Too many open files Exception + java.lang.ArithmeticException: / by zero

2010-12-15 Thread Jake Luciani
http://www.riptano.com/docs/0.6/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files On Wed, Dec 15, 2010 at 11:13 AM, Amin Sakka, Novapost amin.sa...@novapost.fr wrote: *Hello,* *I'm using cassandra 0.7.0 rc1, a single node configuration, replication factor 1,

Re: Too many open files Exception + java.lang.ArithmeticException: / by zero

2010-12-16 Thread Jake Luciani
(SSTable.java:233) * Thanks. * * 2010/12/15 Jake Luciani jak...@gmail.com http://www.riptano.com/docs/0.6/troubleshooting/index#java-reports-an-error-saying-there-are-too-many-open-files On Wed, Dec 15, 2010 at 11:13 AM, Amin Sakka, Novapost amin.sa...@novapost.fr wrote: *Hello,* *I'm

Re: Bootstrapping taking long

2011-01-04 Thread Jake Luciani
In 0.6, locate the node doing anti-compaction and look in the streams subdirectory in the keyspace data dir to monitor the anti-compaction progress (it puts new SSTables for bootstrapping node in there) On Tue, Jan 4, 2011 at 8:01 AM, Ran Tavory ran...@gmail.com wrote: Running nodetool

Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Jake Luciani
Some relevant information here: https://www.cloudkick.com/blog/2010/mar/02/4_months_with_cassandra/ On Tue, Jan 4, 2011 at 10:09 PM, Dave Viner davevi...@gmail.com wrote: Hi Peter, Thanks. These are great ideas. One comment tho. I'm actually not as worried about the logging into the

Re: Bootstrapping taking long

2011-01-05 Thread Jake Luciani
...@gmail.com wrote: I haven't tried repair. Should I? On Jan 5, 2011 3:48 PM, Jake Luciani jak...@gmail.com wrote: Have you tried not bootstrapping but setting the token and manually calling repair? On Wed, Jan 5, 2011 at 7:07 AM, Ran Tavory ran...@gmail.com wrote: My conclusion is lame: I

Re: Welcome committer Jake Luciani

2011-01-13 Thread Jake Luciani
Thanks Jonathan and Cassandra PMC! Happy to help Cassandra take over the world! -Jake On Thu, Jan 13, 2011 at 1:41 PM, Jonathan Ellis jbel...@gmail.com wrote: The Cassandra PMC has voted to add Jake as a committer. (Jake is also a committer on Thrift.) Welcome, Jake, and thanks for the

Re: GeoIndexing in Cassandra, Open Sourced?

2011-01-21 Thread Jake Luciani
One possible open source approach would be to use the Solr 1.4 spatial plugin[1] along with Solandra[2] What kind of spatial searches are you looking for? basic bounding box/radius? [1] https://github.com/outoftime/solr-spatial-light [2] https://github.com/tjake/lucandra On Fri, Jan 21, 2011

Re: Ring up but read fails ...

2011-01-23 Thread Jake Luciani
I've seen this when you leave a socket open and idle for a long time. The connection times out. On Jan 23, 2011, at 8:42 AM, ruslan usifov ruslan.usi...@gmail.com wrote: 2011/1/23 cbert...@libero.it cbert...@libero.it ERROR UserNameCmd:38 - java.net.SocketException: Broken pipe

Re: R: Re: Ring up but read fails ...

2011-01-23 Thread Jake Luciani
Reconnect and try again? On Jan 23, 2011, at 10:47 AM, cbert...@libero.it cbert...@libero.it wrote: I've seen this when you leave a socket open and idle for a long time. The connection times out. It could be the situation ... any idea about the solution? I create the pool once at

Re: Lucandra Limitations

2011-01-27 Thread Jake Luciani
Yes, but that's also the lucene limit http://lucene.apache.org/java/3_0_1/fileformats.html#Limitations Lucene uses a Java int to refer to document numbers, and the index file format uses an Int32 On Thu, Jan 27, 2011 at 1:40 PM, David G. Boney dbon...@semanticartifacts.com wrote: I was

Re: Lucandra Limitations

2011-01-27 Thread Jake Luciani
? Lucene supports the ability to create multiple IndexSearchers and stick them in a MultiSearcher. Is this the right way to view the problem? - Sincerely, David G. Boney dbon...@semanticartifacts.com http://www.semanticartifacts.com On Jan 27, 2011, at 12:45 PM, Jake Luciani

Re: Secondary Index information

2011-01-28 Thread Jake Luciani
http://www.datastax.com/blog/whats-new-cassandra-07-secondary-indexes On Fri, Jan 28, 2011 at 7:15 AM, Sasha Dolgy sasha.do...@gmail.com wrote: Hi there, Where can I find information regarding secondary indexes? Spent the past 2 days looking for some good details.

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Jake Luciani
Are you using a row cache? if so what is it set too? in general it should not be a percentage. On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs chris.burrou...@gmail.com wrote: We have a 6 node Cassandra 0.6.8 cluster running on boxes with 4 GB of RAM. Over the course of several weeks

Re: [0.7.1] more exceptions: Illegal mode

2011-02-07 Thread Jake Luciani
This sounds like a possible bug since the BRAF was re-written in 0.7.1. Could you open a ticket? On Mon, Feb 7, 2011 at 10:32 AM, Patrik Modesto patrik.mode...@gmail.comwrote: On Mon, Feb 7, 2011 at 15:42, Thibaut Britz thibaut.br...@trendiction.com wrote: I think this is related to a faulty

Re: [RELEASE] 0.7.1

2011-02-14 Thread Jake Luciani
It can take some time for the files to propagate to the mirrors. It's Eventually Consistent though :) On Mon, Feb 14, 2011 at 4:20 PM, Frank LoVecchio fr...@isidorey.com wrote: Ah, I meant quite a few of the mirror links keep showing up as links to gossip sites and whatnot. On Feb 14, 2011

Re: Possible EOFException regression in 0.7.1

2011-02-15 Thread Jake Luciani
Have you made any changes to the cassandra config? 2011/2/15 Jonas Borgström jonas.borgst...@trioptima.com Hi all, While testing the new 0.7.1 release I got the following exception: ERROR [ReadStage:11] 2011-02-15 16:39:18,105 DebuggableThreadPoolExecutor.java (line 103) Error in

Re: [RELEASE] 0.7.2

2011-02-17 Thread Jake Luciani
https://issues.apache.org/jira/browse/CASSANDRA-2174 Yes, just clear the cache On Thu, Feb 17, 2011 at 1:06 PM, Damick, Jeffrey jeffrey.dam...@neustar.biz wrote: So after upgrade to 0.7.2, I see this on startup – should I just blow away these cache files? WARN [main] 2011-02-17

Re: 0.7.2 slow memtables flushing

2011-02-19 Thread Jake Luciani
Are you running with JNA enabled? If so could you try disabling it? On Sat, Feb 19, 2011 at 11:32 AM, Ivan Georgiev yngw...@bk.ru wrote: On 19.2.2011 г. 16:43 ч., Jonathan Ellis wrote: Flush code didn't change between 0.7.0 and 0.7.2. There must be some other variable here. Memory pressure

Re: Ditching Cassandra

2011-03-29 Thread Jake Luciani
Hi Gregori, What language *were* you using to interact with cassandra? were you unable to find a wrapper API that you found We have discussed adopting the best of client api's in cassandra but we decided it's better for the community to naturally develop them. I think this has also motivated

Re: Endless minor compactions after heavy inserts

2011-04-01 Thread Jake Luciani
nodetool compactionstats On Fri, Apr 1, 2011 at 12:14 PM, mcasandra mohitanch...@gmail.com wrote: Is there a way to monitor the compactions using nodetools? I don't see it in tpstats. -- View this message in context:

Re: Lucandra or some way to query

2010-04-14 Thread Jake Luciani
Hi, What doesn't work with lucandra exactly? Feel free to msg me. -Jake On Wed, Apr 14, 2010 at 9:30 PM, Jesus Ibanez jesusiba...@gmail.com wrote: I will explore Lucandra a little more and if I can't get it to work today, I will go for Option 2. Using SQL will not be efficient in the

Re: Lucandra or some way to query

2010-04-14 Thread Jake Luciani
Lucandra spreads the data randomly by index + field combination so you do get some distribution for free. Otherwise you can use nodetool loadbalance to alter the token ring to alleviate hotspots. On Thu, Apr 15, 2010 at 2:04 AM, HubertChang hui...@gmail.com wrote: If you worked with Lucandra

Re: BinaryMemtable and collisions

2010-05-07 Thread Jake Luciani
by the work of Jake Luciani in Lucandra. I've successfully loaded nearly a million documents over a 3-node cluster, and initial query tests look promising. The problem is that our target use case has hundreds of millions of documents (each document is very small however). Loading time

Re: BinaryMemtable and collisions

2010-05-07 Thread Jake Luciani
at 12:09 AM, Jake Luciani jak...@gmail.com wrote: Any reason why you aren't using Lucandra directly? On Fri, May 7, 2010 at 8:21 PM, Tobias Jungen tobias.jun...@gmail.comwrote: Greetings, Started getting my feet wet with Cassandra in earnest this week. I'm building a custom inverted index

Re: Pooling Question

2010-05-20 Thread Jake Luciani
Look in /contrib it's already there. On May 20, 2010, at 6:23 PM, Mark Robson mar...@gmail.com wrote: On 20 May 2010 23:16, Ryan Daum r...@thimbleware.com wrote: I personally would love to see Cassandra add the concept of a read- only 'proxy' node which acts like the embedded ready only

Re: Thoughts on adding complex queries to Cassandra

2010-05-27 Thread Jake Luciani
I've secretly started working on this but nothing to show yet :( I'm calling it SliceDiceReduce or SliceReduce. The plan is to use the js thrift bindings I've added for 0.3 release of thrift (out very soon?) This will allow the supplied js to access the results like any other thrift

Re: Continuously increasing RAM usage

2010-06-02 Thread Jake Luciani
I've started seeing this issue as well. Running 0.6.2. One interesting thing I happened upon, I explicitly called the GC via jconsole and the heap dropped completely fixing the issue. When you explicitly call System.gc() it does a full sweep. I'm wondering if this issue is to do with the GC

Re: Lucandra issues

2010-06-18 Thread Jake Luciani
Hi Maxim, Lucandra doesn't support numeric queries quite yet. A workaround would be to load your numbers and convert them to strings. I'll eventually add support for this. Please feel free to help out if you can :) Jake On Jun 17, 2010, at 1:16 PM, Maxim Kramarenko

Re: Coke Products at Digg?

2010-07-07 Thread Jake Luciani
Coke sucks! Only drink it if you want to work hard for 20 minutes then crash. I started a new cola that's already way better than Coke and it will solve all your problems. I'm finalizing my results but so far I only need one drink per WEEK! On Jul 7, 2010, at 12:10 PM, Mike Malone

Re: indexing methods

2010-09-03 Thread Jake Luciani
Hi Courtney, You can take a look at lucandra http://github.com/tjake/Lucandra which uses the lucene api to maintain a inverted index in cassandra. There are a couple articles and presentations in the readme that give more info on how this is done. -Jake On Fri, Sep 3, 2010 at 6:26 AM, Courtney

Re: Not overwriting values

2010-09-21 Thread Jake Luciani
In theory you could use timestamps that go back in time for this CF. That way the first write will persist over future writes. On Sep 21, 2010, at 6:58 AM, Christian Decker decker.christ...@gmail.com wrote: Hi all, I have a rather strange problem I'd like to address. As I understand it

Re: Data model for financial time series

2013-06-07 Thread Jake Luciani
We have built a similar system, you can ready about our data model in CQL3 here: http://www.slideshare.net/carlyeks/nyc-big-tech-day-2013 We are going to be presenting a similar talk next week at the cassandra summit. On Fri, Jun 7, 2013 at 12:34 PM, Davide Anastasia

Re: Leveled Compaction, number of SStables growing.

2013-07-09 Thread Jake Luciani
We run with 128mb some run with 256mb. Leveled compaction creates fixed sized sstables by design so this is the only way to lower the file count. On Tue, Jul 9, 2013 at 2:56 PM, PARASHAR, BHASKARJYA JAY bp1...@att.comwrote: Hi, ** ** We recently switched from size tired compaction to

Re: Why does cassandra PoolingSegmentedFile recycle the RandomAccessReader?

2013-07-15 Thread Jake Luciani
Take a look at https://issues.apache.org/jira/browse/CASSANDRA-5661 On Mon, Jul 15, 2013 at 4:18 AM, sulong sulong1...@gmail.com wrote: Thanks for your help. Yes, I will try to increase the sstable size. I hope it can save me. 9000 SSTableReader x 10 RandomAccessReader x 64Kb = 5.6G memory.

Re: HsHa

2013-08-14 Thread Jake Luciani
This is technically a Thrift message not Cassandra, it happens when a client hangs up without closing the socket. You should be able to silence it by raising the class specific log level see log4j-server.properties as an example On Wed, Aug 14, 2013 at 9:59 AM, Alain RODRIGUEZ arodr...@gmail.com

Re: high pending compactions

2014-06-08 Thread Jake Luciani
23 On Sunday, June 8, 2014, S C as...@outlook.com wrote: I am using Cassandra 1.1 (sorry bit old) and I am seeing high pending compaction count. pending tasks: 67 while active compaction tasks are not more than 5. I have a 24CPU machine. Shouldn't I be seeing more compactions? Is this a

Re: Which way to Cassandraville?

2014-07-22 Thread Jake Luciani
Checkout datastax devcenter which is a GUI datamodelling tool for cql3 http://www.datastax.com/what-we-offer/products-services/devcenter On Sun, Jul 20, 2014 at 7:17 PM, jcllings jclli...@gmail.com wrote: So I'm a Java application developer and I'm trying to find entry points for learning to

  1   2   >