Importtable command not working from Java API

2016-12-15 Thread vaibhav thapliyal
Hi, I am trying to import an accumulo table from an HDFS directory. However when i use the importtable function in the java API I get the following error: org.apache.accumulo.core.client.AccumuloException: Table import directory /backup/Data_records does not exist! I have verified that this

Detecting database changes

2016-11-22 Thread vaibhav thapliyal
Hi, I have a use case where I need to send out notifications based on changes in a table. Are there any kind of listeners which can be used to listen to a change in table event in accumulo? How do I go about this use case? Thanks Vaibhav

Re: Data Replication

2016-10-16 Thread vaibhav thapliyal
I think neither of these would contribute much to load balancing. HDFS replication is mostly a safeguard against Single Points of failure in a Hadoop cluster. However, Data center replication would ensure the availability of an Accumulo instance. On 16 October 2016 at 21:02, Yamini Joshi

Re: Indexing Column Values in Accumulo

2016-10-12 Thread vaibhav thapliyal
of values(cqs)). In other > words, the filter will have to access all the cf|cqs, right? > > Best regards, > Yamini Joshi > > On Mon, Oct 10, 2016 at 5:09 AM, vaibhav thapliyal < > vaibhav.thapliyal...@gmail.com> wrote: > >> Creating an Inverted Index could serve

Re: Indexing Column Values in Accumulo

2016-10-10 Thread vaibhav thapliyal
Creating an Inverted Index could serve your use case. You can store the column family and column qualifier both in the row of the index table separated by a delimiter. For eg cf|cq And then perform queries on just the row id to get a low query time. On 29 September 2016 at 11:03, Josh Elser

Get accumulo monitoring stats via java code

2016-09-07 Thread vaibhav thapliyal
Dear all, I'm trying to visualize accumulo metrics on a Kibana dashboard so that analysis can be performed on them. For that I need to index the individual metrics in an elastic-search index. Can you please tell me how can I get these metrics through java code/api? If not through the public api

Re: Different versions of accumulo libraries on client and server

2015-10-17 Thread vaibhav thapliyal
Hi Denis, As far as my experience with working on accumulo, ideally you should use the client libraries corresponding to your server version. But I have also used the 1.7.0 libraries on my 1.6.3 accumulo cluster without ever facing a problem. Vaibhav On 17-Oct-2015 3:17 pm, "Denis"

Question regarding java being the choice for accumulo

2015-07-29 Thread vaibhav thapliyal
Hello everyone, I was wondering why did the developers chose java for writing accumulo. What advantage it has over using any other language say C++(in which another popular nosql database MongoDB is written) in context of accumulo? Thanks Vaibhav

Re: Question regarding java being the choice for accumulo

2015-07-29 Thread vaibhav thapliyal
that do not need to be high performance in Java, like tablet assignment and management of cluster state is nice. On Wed, Jul 29, 2015 at 1:51 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: Hello everyone, I was wondering why did the developers chose java for writing accumulo

Questions on intersecting iterator and partition ids

2015-07-13 Thread vaibhav thapliyal
Dear all, I have the following questions on intersecting iterator and partition ids used in document sharded indexing: 1. Can we run a boolean and query using the current intersecting iterator on a given range of ids. These ids are a subset of the total ids stored in the column qualifier field

Connection pooling in accumulo

2015-06-24 Thread vaibhav thapliyal
Hi everyone, I wanted to ask if Accumulo supports connection pooling? If yes is there something in the JAVA api that can be used to make use of it? Thanks Vaibhav

Re: BatchScanner taking too much time to scan rows

2015-05-14 Thread vaibhav thapliyal
queries go to one tablet, and 90% go to the other. -Eric On Wed, May 13, 2015 at 1:56 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: Thank you Eric. I will surely do the same. Should uneven distribution across the tablets affect querying in accumulo? If this case

Re: BatchScanner taking too much time to scan rows

2015-05-13 Thread vaibhav thapliyal
. Your best approach is probably to create an index table on whatever you are originally trying to query (assuming those 1 ids came from some other query). Thanks, Emilio On 05/13/2015 09:14 AM, vaibhav thapliyal wrote: The rf files per tablet vary between 2 to 5 per tablet. The entries

Re: BatchScanner taking too much time to scan rows

2015-05-13 Thread vaibhav thapliyal
look-ups. -Eric On Wed, May 13, 2015 at 11:31 AM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: Thank you Eric. One thing I would like to know. Does pre-splitting the data play a part in querying accumulo? Because I managed to somewhat decrease the querying time. I did

BatchScanner taking too much time to scan rows

2015-05-12 Thread vaibhav thapliyal
Hi, I am using BatchScanner to scan rows from a accumulo table. The table has around 187m entries and I am using a 3 node cluster which has accumulo 1.6.1. I have passed 1 ids which are stored as row id in my table as a list in the setRanges() method. This whole process takes around 50

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread vaibhav thapliyal
threads if your hardware can support them. The worst that'll happen is the client program crashes during testing. If that happens, cut the number of threads in half. And so on. On Tue, May 12, 2015 at 1:58 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: I have 194 tablets. Currently I

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread vaibhav thapliyal
tserver per thread. Each rpc call will include all tablets and associated ranges for that tserver. Keith On Tue, May 12, 2015 at 1:39 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: Hi, I am using BatchScanner to scan rows from a accumulo table. The table has around 187m

Re: Videos of Accumulo summit talks

2015-05-03 Thread vaibhav thapliyal
Thank you josh. On 04-May-2015 7:05 am, Josh Elser josh.el...@gmail.com wrote: Dan replied back to me off list. He guessed ~2weeks. Josh Elser wrote: Dan -- do you have an ETA on videos of the talks for summit? On May 2, 2015 6:00 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com

Videos of Accumulo summit talks

2015-05-02 Thread vaibhav thapliyal
Dear all, I would like to know if the videos of Accumulo talks be released on youtube? If yes then can anybody tell me in how many days? It would be awesome to watch all the talks at the summit. Thanks and Regards Vaibhav

Re: Serialization error

2015-04-28 Thread vaibhav thapliyal
Hi josh, We solved it using the kryo serializer library to serialise the key class. Thanks vaibhav On 28-Apr-2015 11:14 pm, Josh Elser josh.el...@gmail.com wrote: Hi Madhvi, Thanks for posting this. I'm not super familiar, but my hunch is that Spark requires objects that it works with to

Re: Custom Iterator output

2015-04-27 Thread vaibhav thapliyal
emitValue; } @Override public boolean hasTop() { return emitKey != null; } @Override public void next() throws IOException { emitKey = null; emitValue = null; } } Regards, Dylan Hutchison On Fri, Apr 17, 2015 at 8:05 PM, vaibhav thapliyal vaibhav.thapliyal

Approach to hold the output of an iterator in memory to do further operations

2015-04-27 Thread vaibhav thapliyal
Hello everyone. I am trying to carry out max and min kind of operations using accumulo. But since the Accumulo iterators only operate on the entries that are lovally hosted I get the local max and local min of the instead of a global max and min. To get this global max and min, I have to

Re: Custom Iterator output

2015-04-17 Thread vaibhav thapliyal
Hi, I also had this query that might be similar to shweta. What I want to do is process the key value pairs that I get from getTopKey() and getTopValue() methods and I want to output that value. Currently I was writing these values to tables from inside the iterators, but I read in the new

Re: Features that should he part of a rest client for accumulo

2015-04-14 Thread vaibhav thapliyal
: Consistent subdomain names should be used for your client developer portal (http://developer.soccer.__restapi.org http://developer.soccer.restapi.org) I hope that helps. Kurt On 4/13/15 3:33 PM, vaibhav thapliyal wrote: Dear all, I am

Re: Features that should he part of a rest client for accumulo

2015-04-13 Thread vaibhav thapliyal
those problems up front might help you avoid bigger problems down the road. In general, start small and get a nice, tested codebase. That will help in the long run. Keep us informed! This sounds great :D vaibhav thapliyal wrote: Dear all, I am trying to build a Rest-api for accumulo. I would