RE: Losing tservers - Unusually high Last Contact times

2014-05-19 Thread dlmarion
You are hitting the zookeeper timeout, default 30s I believe. You said you are not oversubscribed for memory, but what about CPU? Are you running YARN processes on the same nodes as the tablet servers? Is the tablet server being pushed into swap or starved of CPU? -Original Message- From:

Re: Unable to load Iterator with setscaniter and setshelliter

2014-06-15 Thread dlmarion
What does your classpath settings look like in accumulo-site.xml. I recently made some fixes in 1.6.1-Snapshot where the context classloader was not being used in all cases. I dont think this case was affected though. Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone Original

Re: Unable to load Iterator with setscaniter and setshelliter

2014-06-15 Thread dlmarion
HOME/[^.].*.jar, $HADOOP_HOME/lib/[^.].*.jar, Classpaths that accumulo checks for updates and class files. When using the Security Manager, please remove the ".../target/classes/" values. On Sun, Jun 15, 2014 at 10:49 AM, dlmarion wrote: What does

Re: accumulo 1.6 and HDFS non-HA conversion to HDFS HA

2014-08-08 Thread dlmarion
I believe the problem that you are running into is that because dfs.default.uri was not specified, then fs.defaultFS was used to write entries to the accumulo.root and accumulo.metadata tables. Suggest doing the following: Update to the latest version of Accumulo 1.6.1-SNAPSHOT set instance.df

Re: (U) I do I tell Accumulo where the jars for my custom formatters and balancers are for a specific table?

2014-08-25 Thread dlmarion
Jeff, Which version of Accumulo are you using? Most of the classpath settings are in the accumulo-site.xml file. - Original Message - From: "Jeff Kubina" To: user@accumulo.apache.org Sent: Monday, August 25, 2014 3:17:17 PM Subject: Re: (U) I do I tell Accumulo where the jars for

Re: (U) I do I tell Accumulo where the jars for my custom formatters and balancers are for a specific table?

2014-08-25 Thread dlmarion
We fixed some issues recently with the classloader in 1.6.1, make sure you are using an up-to-date snapshot version. The classloader was changed in version 1.5 and now supports hosting jars from various locations, including HDFS. Additionally, it supports creating a table level classloader that

RE: Moving HDFS name node to another host in accumulo 1.6

2015-01-16 Thread dlmarion
What Accumulo version? What Hadoop version? From: Calvin Feder [mailto:calvin.fe...@argyledata.com] Sent: Friday, January 16, 2015 7:29 PM To: user@accumulo.apache.org Subject: Moving HDFS name node to another host in accumulo 1.6 We need to move the HDFS name node of our cluster to a new

RE: Moving HDFS name node to another host in accumulo 1.6

2015-01-16 Thread dlmarion
In 1.6, accumulo.metadata entries are absolute URLs. You can change the existing URLs using the instance.volume.replacments[1] parameter. [1] http://accumulo.apache.org/1.6/accumulo_user_manual.html#_instance_volumes_replacements > -Original Message- > From: webmas...@webmaster.ms [mai

RE: Installing custom authorizor

2015-02-18 Thread dlmarion
Did you restart Accumulo after you copied the jar? The general.classpaths locations are not monitored and loaded dynamically. Original message From: Srikanth Viswanathan Date:02/18/2015 8:42 PM (GMT-05:00) To: user@accumulo.apache.org Cc: Subject: Installing custom autho

Re: Custom Iterator output

2015-04-17 Thread dlmarion
via the getTopKey() and getTopValue() methods. [1] should be a simple example. [1] https://git-wip-us.apache.org/repos/asf?p=accumulo.git;a=blob;f=core/src/main/java/org/apache/accumulo/core/iterators/user/GrepIterator.java;h=043a729a778fc34d2ee87a0227056ffac81b7fe7;hb=refs/heads/master -

RE: TSDB on Accumulo row key

2015-07-20 Thread dlmarion
https://github.com/ericnewton/accumulo-opentsdb From: Ranjan Sen [mailto:ranjan_...@hotmail.com] Sent: Monday, July 20, 2015 6:25 PM To: user@accumulo.apache.org Subject: TSDB on Accumulo row key Hi All, Is there something like TSDB (Time series database) on Accumulo? Thanks Ranj

Re: Question regarding java being the choice for accumulo

2015-07-29 Thread dlmarion
Except for any server side iterator logic, table balancers, etc... - Original Message - From: "Jonathan Parise" To: user@accumulo.apache.org Sent: Wednesday, July 29, 2015 3:02:27 PM Subject: RE: Question regarding java being the choice for accumulo I’m not sure what the motivat

Re: Entry-based TableBalancer

2015-07-29 Thread dlmarion
Hotspotting was the first thing that came to my mind with the proposed balancer. The fservers don't keep all the K/V in memory. You are balancing query and live ingest across your resources. Original message From: Eric Newton Date: 07/29/2015 8:46 PM (GMT-05:00) T

RE: How to control Minor Compaction by programming

2015-07-30 Thread dlmarion
It sounds like you want to try and not minor compact during your ingest of your data. Is that correct? From: William Slacum [mailto:wsla...@gmail.com] Sent: Thursday, July 30, 2015 8:10 PM To: user@accumulo.apache.org Subject: Re: How to control Minor Compaction by programming See http:/

Re: Accumulo GC and Hadoop trash settings

2015-08-17 Thread dlmarion
It's not temporary files, it's any file that has been compacted away. If you keep files around longer than { dfs.namenode.checkpoint.period}, then you have a chance to recover in case your most recent checkpoint is corrupt. - Original Message - From: "James Hughes" To: user@accumulo

RE: Accumulo GC and Hadoop trash settings

2015-08-17 Thread dlmarion
you determine the MTTR and then schedule your backups accordingly; a backup in case you are not able to recover your database using the techniques in the current documentation. [1] https://github.com/dlmarion/raccovery From: James Hughes [mailto:jn...@virginia.edu] Sent: Monday, August

RE: Question about configuring the linux niceness of tablet servers?

2015-08-17 Thread dlmarion
Check out the implementation of https://issues.apache.org/jira/browse/ACCUMULO-3793. You could do something similar and start your process with 'nice' instead of 'numactl'. > -Original Message- > From: Christopher [mailto:ctubb...@apache.org] > Sent: Monday, August 17, 2015 7:15 PM > To:

Re: thrift versions in accumulo

2015-08-19 Thread dlmarion
+1 to Netty. I have used it and like it, but I think you will have to write all of the code yourself. Nifty (Facebook) could be an option to use Thrift and Netty together. Finagle (Twitter) would be another alternative. - Original Message - From: "Christopher" To: "Accumulo User Lis

RE: Accumulo Monitor NullPointerException

2015-08-21 Thread dlmarion
Anything in the master log? Original message From: Scott Date: 08/21/2015 6:15 PM (GMT-05:00) To: user@accumulo.apache.org Subject: Accumulo Monitor NullPointerException Hello,   We have Accumulo 1.6.3 set up on the HortonWorks install with a basic cluster of on

RE: [Accumulo Contrib Proposal] Graphulo: Server-side Matrix Math library

2015-08-28 Thread dlmarion
Dylan,   I am a little confused about whether you want to place this in the contrib area or whether you want to create a sub-project as both are mentioned in your proposal. Also, if you intend for this to be a sub-project, have you looked at the incubator process? From what I understand gi

Re: [Accumulo Contrib Proposal] Graphulo: Server-side Matrix Math library

2015-08-28 Thread dlmarion
and purpose, and there is large opportunity for synergy in that Graphulo development may help Accumulo development and vice versa. We're in that happy middle spot where a sub-project makes sense. That said, this is a community decision, and so I'm open to other opinions. Regards, Dy

Re: Accumulo: "BigTable" vs. "Document Model"

2015-09-04 Thread dlmarion
Both will work, but I think the answer depends on the amount of data that you will be querying over and your query latency requirements. I would include the wikisearch[1] storage scheme into your list as well (k/v table + indices). Then, personally, I would rate them in the following order as d

RE: imbalance in number of zookeeper clients

2015-09-10 Thread dlmarion
Hey Jeff,  Take a look at [1] and see if the zookeeper balance issue mentioned is applicable. Dave [1] https://accumulo.apache.org/release_notes/1.6.2.html Original message From: Jeff Turner Date: 09/10/2015 7:42 PM (GMT-05:00) To: user@accumulo.apache.org Subject

RE: Error BAD_AUTHORIZATIONS for user root

2015-09-13 Thread dlmarion
It’s likely that you created a table, inserted data with column visibilities, but never gave the user the authorization tokens to see the data. Try setting the authorization tokens for the user with the setauths command. Since you are using the ‘root’ user, it would be something like: setauths –

Re: Presplitting tables for the YCSB workloads

2015-09-18 Thread dlmarion
I don't have a script for you, but if you need to create one you could use the script command in the shell to do something similar to the hbase script. Some examples are in the comments in jira[1]. If you can figure out how you want the table split, and it can be scripted, I might have time thi

[ADVISORY] Possible data loss during HDFS decommissioning

2015-09-23 Thread dlmarion
BLUF: There exists the possibility of data loss when performing DataNode decommissioning with Accumulo running. This note applies to installations of Accumulo 1.5.0+ and Hadoop 2.5.0+. DETAILS: During DataNode decommissioning it is possible for the NameNode to report stale block locations (HD

Re: [ADVISORY] Possible data loss during HDFS decommissioning

2015-09-23 Thread dlmarion
Known issue in the release notes on the web page? We would have to update every version though. Seems like we need a known issues document that lists issues in dependencies that transcend Accumulo versions. - Original Message - From: "Josh Elser" To: d...@accumulo.apache.org Cc: use

Re: How does Accumulo process a r-files for bulk ingesting?

2015-10-07 Thread dlmarion
I believe the RFile is assigned to each tablet that contain keys in the RFiles key range. So, 1 RFile is assigned to one or more tablets. The bulk import RFile is removed after it has been compacted by all assigned tablets. - Original Message - From: "Jeff Kubina" To: user@accumulo.a

Re: Tserver's strange state.

2015-10-22 Thread dlmarion
Are you trying to shut the whole system down, or just a couple of tablet servers?Is your application reading and writing from/to Accumulo during this time? Original message From: Denis Date: 10/22/2015 6:03 PM (GMT-05:00) To: user@accumulo.apache.org Subject: Re:

RE: Tserver's strange state.

2015-10-22 Thread dlmarion
ed. > Yes, it is a production system with a lot of reads and writes. > > On 10/22/15, dlmarion wrote: > > > > > > Are you trying to shut the whole system down, or just a couple of > > tablet servers?Is your application reading and writing from/to > > Accu

RE: Accumulo Iterator painful development because TS don't pick up changes to Jars

2015-10-29 Thread dlmarion
Can you provide the relevant classpath sections of your accumulo-site.xml file? > -Original Message- > From: Rob Povey [mailto:r...@maana.io] > Sent: Thursday, October 29, 2015 8:01 PM > To: user@accumulo.apache.org > Subject: Accumulo Iterator painful development because TS don't pick up

RE: Accumulo Iterator painful development because TS don't pick up changes to Jars

2015-10-29 Thread dlmarion
So, without seeing your configuration, I would suggest trying something before upgrading to 1.7. In 1.5 we changed the classloader so that it could load from different locations. At the same time, we added the concept of classloader contexts which are basically names for locations for jars. Tabl

Re: Accumulo Iterator painful development because TS don't pick up changes to Jars

2015-10-30 Thread dlmarion
Try replacing the vfs jar in lib with a 2.1-SNAPSHOT. Several issues have been fixed, but one of them is that if more than one monitored resource changed then it would miss some of them. - Original Message - From: "Rob Povey" To: user@accumulo.apache.org Sent: Friday, October 30, 2

Re: Accumulo Iterator painful development because TS don't pick up changes to Jars

2015-10-30 Thread dlmarion
Also, turn the logging on the tservers up to DEBUG for org.apache.accumulo.start.classloader.*. You should see a line in the log that starts with "monitoring " - Original Message - From: dlmar...@comcast.net To: user@accumulo.apache.org Sent: Friday, October 30, 2015 12:22:53 PM Su

Re: loading iterator jars from file:// or http://

2015-11-24 Thread dlmarion
Suggest replacing the VFS 2.0 jar in the Accumulo lib directory with a VFS 2.1-SNAPSHOT jar from [1]. There are some bugs that have been fixed, but VFS 2.1 has not been released yet. I did this and have not had many issues loading from HDFS since. Loading jars from file:/// should work, I believ

Re: Class path for shell commands

2016-03-07 Thread dlmarion
try: ./accumulo -add shell - Original Message - From: "Christopher" To: user@accumulo.apache.org, "Sravankumar Reddy Javaji" Sent: Monday, March 7, 2016 4:28:03 PM Subject: Re: Class path for shell commands Are you using a 1.6.x or earlier version? I think we fixed a bug in 1.7

Re: Class path for shell commands

2016-03-07 Thread dlmarion
Another option is to drop your jar into $ACCUMULO_HOME/lib/ext on the node where you will be running the shell. - Original Message - From: "Sravankumar Reddy Javaji (BLOOMBERG/ 731 LEX)" To: user@accumulo.apache.org Sent: Monday, March 7, 2016 4:31:49 PM Subject: Re: Class path for

RE: Optimize Accumulo scan speed

2016-04-10 Thread dlmarion
Some other thoughts in addition to the sharding: 1. Are your tablets spread out evenly across your tablet servers? 2. How many threads are you using in your batch scanner? 3. What is the table.scan.max.memory setting? From: Andrew Hulbert [mailto:ahulb...@ccri.com] Sent: Sunday, April 1

[ANNOUNCE] Timely - Secure Time Series Database

2016-06-22 Thread dlmarion
Timely is a time series database application that provides secure access to time series data. It is designed to be used with Apache Accumulo for persistence and Grafana for visualization. Timely is located at https://github.com/NationalSecurityAgency/timely .

Re: java.lang.NoClassDefFoundError with fields of custom Filter

2016-07-07 Thread dlmarion
+1 good catch Josh - Original Message - From: "Josh Elser" To: user@accumulo.apache.org Sent: Thursday, July 7, 2016 2:12:42 PM Subject: Re: java.lang.NoClassDefFoundError with fields of custom Filter Beware using the HDFS classloader in any Accumulo release that does not contain

Re: Configuring batch writers

2016-07-15 Thread dlmarion
The batch writer has several knobs (latency time, memory buffer, etc) that you can tune to meet your requirements. The values for those settings will depend on a lot of variables, to include: - number of tablet servers - size of mutations - desired latency - memory buffer - configuration se

Re: Accumulo Seek performance

2016-08-24 Thread dlmarion
Doesn't this use the 6 batch scanners serially? - Original Message - From: "Sven Hodapp" To: "user" Sent: Wednesday, August 24, 2016 11:56:14 AM Subject: Re: Accumulo Seek performance Hi Josh, thanks for your reply! I've tested your suggestion with a implementation like that:

Re: Accumulo Seek performance

2016-08-25 Thread dlmarion
Calling BatchScanner.iterator() is what starts the work on the server side. You should do this first for all 6 batch scanners, then iterate over all of them in parallel. - Original Message - From: "Sven Hodapp" To: "user" Sent: Thursday, August 25, 2016 4:53:41 AM Subject: Re: Ac

Re: Accumulo Seek performance

2016-08-25 Thread dlmarion
But does toList exhaust the first iterator() before going to the next? - Dave - Original Message - From: "Sven Hodapp" To: "user" Sent: Thursday, August 25, 2016 9:42:00 AM Subject: Re: Accumulo Seek performance Hi dlmarion, toList should also call ite

Re: Get accumulo monitoring stats via java code

2016-09-07 Thread dlmarion
The internal metrics are exposed via JMX. You could connect to each server and pull the metrics. Or, you could use the Hadoop Metrics2 framework and push the metrics to some listening service. If you choose the latter route, then you can configure Hadoop also and get its metrics. An example of c

Re: 1 of 20 TServers unresponsive/slow, all writes fail?

2016-09-09 Thread dlmarion
What was happening on that 1 tserver? Was it in garbage collection? Was it having network or O/S issues? - Original Message - From: "Michael Moss (BLOOMBERG/ 731 LEX)" To: user@accumulo.apache.org Sent: Friday, September 9, 2016 9:40:42 AM Subject: 1 of 20 TServers unresponsive/slow

Re: 1 of 20 TServers unresponsive/slow, all writes fail?

2016-09-09 Thread dlmarion
We have seen this before: a tserver that is hosting metadata tablets has issues and starts causing problems within the cluster. You could try using the HostRegexTableLoadBalancer[1,2] to segregate your metadata tablets from the other tables. This doesn't fully eliminate the SPOF, but it should

RE: how do I list user permissions per table

2016-09-23 Thread dlmarion
You can also use the admin dumpConfig command to dump all of the configuration information, then look at the user permission files. From: Christopher [mailto:ctubb...@apache.org] Sent: Friday, September 23, 2016 5:39 PM To: user@accumulo.apache.org Subject: Re: how do I list user permissions

Re: [ANNOUNCE] Apache Fluo 1.0.0-incubating released

2016-10-14 Thread dlmarion
Congrats! - Original Message - From: "Mike Walch" To: user@accumulo.apache.org Sent: Friday, October 14, 2016 3:21:00 PM Subject: [ANNOUNCE] Apache Fluo 1.0.0-incubating released The Apache Fluo (incubating) team is happy to announce the release of Fluo 1.0.0-incubating: https:/

RE: 1.7 native library performance boost

2016-10-19 Thread dlmarion
I don't remember seeing anything about native library performance improvement, but I do know that there has been talk of a performance regression in Accumulo 1.7.0 using the default configuration. I don't remember the specific details off-hand, but it had something to do with the default value o

Re: Iterator as a Filter

2016-10-21 Thread dlmarion
So if I understand this correctly, for this use case, you could do the following: courseId studentId For either of your queries (1 and 2 below) you could use a BatchScanner with the set of Ranges being the course ids from input C. In your client you would add the resulting columnFamily (stu

[ANNOUNCE] Timely 0.0.3 Released

2016-10-27 Thread dlmarion
For those interested, we released the third version of Timely today which includes several feature changes and performance improvements. Release notes are at: https://nationalsecurityagency.github.io/timely/docs/#release-notes

Re: Bulk ingestion of different locality groups at different times

2016-10-28 Thread dlmarion
>>> Is Accumulo able to import these files, considering that they are two >>> different locality groups Yes. >>> without triggering a huge major compaction? Depends on your table.compaction.major.ratio and table.file.max settings. Sorry, not a real answer, but I think the answer is "it de

RE: New Accumulo Blog Post

2016-11-02 Thread dlmarion
Regarding #2 – I think there are two options here: 1. Modify Accumulo to take advantage of HDFS Heterogeneous Storage 2. Modify Accumulo WAL code to support volumes From: Jeff Kubina [mailto:jeff.kub...@gmail.com] Sent: Wednesday, November 02, 2016 9:02 PM To: user@accumulo.apache.org Sub

Re: List of Metrics2 Metrics

2016-11-10 Thread dlmarion
It used[1] to be in the documentation when it was hosted on the monitor. I did not see it looking at the current documentation. Looks like it was lost (and [1] is likely now out of date). [1] https://github.com/apache/accumulo/blob/1.4.0/docs/metrics.html - Original Message - From:

RE: Detecting database changes

2016-11-22 Thread dlmarion
What kind of changes are you looking to alert off? Is it data changes or configuration changes? > -Original Message- > From: vaibhav thapliyal [mailto:vaibhav.thapliyal...@gmail.com] > Sent: Tuesday, November 22, 2016 7:20 AM > To: user@accumulo.apache.org > Subject: Detecting database cha

RE: how to reduce re-seeking rate?

2016-11-22 Thread dlmarion
In one case, the tserver will send data back to the client when it fills its buffer. When this happens, it's possible that the iterator could be torn down and re-seeked to the last key returned. You could increase the size of this buffer to see if that helps (http://accumulo.apache.org/1.8/accumulo

RE: maximize usage of cluster resources during ingestion

2017-07-13 Thread dlmarion
Regarding the referenced paper, pre-splitting the tables, using an optimized zookeeper deployment, and increasing concurrent minor / major compactions are good things. I'm not sure that we want to recommend turning off the write ahead logs and replication for production deployments. -Origin

RE: accumulo.metadata table online but scans hang

2017-08-30 Thread dlmarion
Re #2: Does your Accumulo processes run as the hdfs user on the O/S, or as the accumulo user? Make sure you are checking the correct users trash folder. Also, check the Accumulo garbage collector log to see if the GC process deleted the WAL files. Take a look at [1] to see if you are hitting thi

RE: Problems with accumulo replication

2017-12-29 Thread dlmarion
You can also use the tserver.walog.max.age property to ensure that the walogs roll if there is no activity. The default is 24h and was backported to 1.7.2. See ACCUMULO-4004 for more info. -Original Message- From: Josh Elser [mailto:els...@apache.org] Sent: Friday, December 29, 2017 10:

RE: Accumulo performance on various hardware configurations

2018-08-29 Thread dlmarion
This may suggest an issue with client, either getting the data to the client or the client itself (although I think there are other performance related changes you could make). I’m curious what the end goal is here. Is this a real world use case? If you are using this type of benchmark to evalua

RE: Uniquely identifying metrics with multiple tservers per node.

2019-08-26 Thread dlmarion
IIRC the 'service.name' property in the metrics properties file is used to note from which service the metric was emitted. I would suggest testing this by setting the service.name property to tserver1 and see if your metrics show up with that name. If they do, then you might be able to modify the s

RE: example thrift client to get master metrics

2020-07-23 Thread dlmarion
The Monitor[1] gets a MasterMonitorInfo[2] object from the Master that contains the information you need. The tableMap member contains a TableInfo[3] object for each table that holds the information you are looking for. [1] https://github.com/apache/accumulo/blob/1.9/server/monitor/src/main/

RE: Tablet Server Session Id Out of Range

2022-06-21 Thread dlmarion
Please see: https://lists.apache.org/thread/p7mwtkfpbyb551pw5k7yg61jopf50m8s From: Josef Roehrl - PHEMI Sent: Monday, June 20, 2022 7:02 PM To: user@accumulo.apache.org Subject: Re: Tablet Server Session Id Out of Range Hi Jonathan, We too have exactly this issue as of a couple of day

RE: Accumulo On S3

2022-10-12 Thread dlmarion
I believe that S3 Guard is OBE, but you still need to put the WAL on HDFS as S3 does not support sync. If you put your WAL in S3, and you have a tserver failure, then it’s possible that you will lose data. From: Christopher Sent: Wednesday, October 12, 2022 4:12 PM To: user@accumulo.apache.

RE: Init accumulo with custom properties file

2023-02-02 Thread dlmarion
Take a look at https://github.com/dlmarion/accumulo-k8s. You can put the Accumulo config in a ConfigMap (accumulo-config.yaml), then mount it at the correct location in another container (see the server process yaml files). I used a job to do the init phase, see accumulo-init.yaml. From

RE: Problem creating tables in Accumulo

2023-02-22 Thread dlmarion
It looks like the issue is that `maxTime` is not being set at [0], so `time` is not being set at [1]. If I had to guess, Accumulo is having an issue talking with ZooKeeper. Were there errors in the log when initializing Accumulo? Are you able to `exec` into one of the Accumulo Kubernetes pods and c

RE: Impact from Enabling TLS

2023-06-05 Thread dlmarion
Does this apply? https://bugs.openjdk.org/browse/JDK-8190917 From: Sanjay Deshmukh Sent: Monday, June 5, 2023 3:28 PM To: user@accumulo.apache.org Subject: Re: Impact from Enabling TLS Sure thing. We're running Accumulo 1.10.1. Our configuration has 15 tablet servers and a client pool of

RE: Scan thread resource management [SEC=UNOFFICIAL]

2024-01-23 Thread dlmarion
Bruce, There are no future planned releases for the 1.x line. In 2.1 Scan Executors[1] offer a way for the user to configure the resource pools used for scanning a table. You could use a prioritizer to influence the execution order of scans in the default scan executor, or setup different sca

RE: Issue with TableOperations

2024-04-22 Thread dlmarion
My guess is that the error is happening on the server side and being raised at https://github.com/openjdk/jdk/blob/master/src/java.base/share/classes/java/util/Base64.java#L713. If there is a stack trace in the server log, then that would help pinpoint the issue. I’m guessing here, but it

Re: Running Accumulo straight from Memory

2012-09-12 Thread dlmarion
Matt,   Did you see Eric Newton's response yesterday? Running on a ram disk has been done; however minor and major compactions will still occur. - Dave - Original Message - From: "Matthew J. Moore" To: user@accumulo.apache.org Sent: Wednesday, September 12, 2012 12:32:31

Re: bulk ingested table showing zero entries on the monitor page

2012-09-21 Thread dlmarion
The number of entries will show up on the monitor after a compaction. Dave - Original Message - From: "ameet kini" To: user@accumulo.apache.org Sent: Friday, September 21, 2012 10:42:32 AM Subject: bulk ingested table showing zero entries on the monitor page I'm ingest

Re: Number of partitions for sharded table

2012-10-30 Thread dlmarion
Krishmin, In the wikisearch example there is a non-sharded index table and a sharded document table. The index table is used to reduce the number of tablets that need to be searched for a given set of terms. Is your setup similar? I'm a little confused since you mention using a sharded index t

Re: Accumulo VM

2012-11-30 Thread dlmarion
John,   Where is this VM located? Dave - Original Message - From: "John Vines" To: user@accumulo.apache.org Sent: Friday, November 30, 2012 3:17:48 PM Subject: Accumulo VM I recut an Accumulo-1.4.2 VM for people to have a quick start with Accumulo. It is configured to

Re: upgrading to CDH3u5

2013-02-07 Thread dlmarion
ACCUMULO-837 - Original Message - From: "John Vines" To: user@accumulo.apache.org Sent: Thursday, February 7, 2013 12:23:48 PM Subject: Re: upgrading to CDH3u5 I don't think that CDH3u5 requires a recompile, that's only the case for CDH4/hadoop 0.23/hadoop2. On Thu, Feb 7, 20

Re: v1.6.0; What are the different levels in the shell classpath command?

2013-04-12 Thread dlmarion
I think these might be more accurate descriptions based on the changes that occurred to the code after the point at which that snippet was put into the ACCUMULO-708 comments. 1. JVM s ystem c lasspath ---> JDK jars on the local server 2. JVM application classpath  ---> Hadoop, ZK, acc

Re: Accumulo / HBase migration

2013-07-09 Thread dlmarion
I believe that Brian Loss committed code in 1.5 for a column visibility correction iterator or something that you could use to do this. You could use that and compact the table after the import. - Original Message - From: "Donald Miner" To: user@accumulo.apache.org Sent: Tuesday, Jul

RE: Move high ingest tablets to alternate node? [SEC=UNOFFICIAL]

2013-08-20 Thread dlmarion
You can create a custom load balancer for your table. I think you said that your tables are organized by date. If the hot spot are contiguous days, then you could write a load balancer that does a round robin assignment. Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone Origi

Re: Straggler problem in Accumulo BatchScans

2013-08-21 Thread dlmarion
You can set it in the shell on the table. Just override the default tablet balancer for the table. I think the master has to use the Table load balancer also if it is not set by default. - Original Message - From: "David M. Slater" To: user@accumulo.apache.org Sent: Wednesday, Augus

RE: Bulk Ingest data latency [SEC=UNOFFICIAL]

2013-10-17 Thread dlmarion
Data is available immediately. The counts on the monitor wont update until a compaction occurs. Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone Original message From: "Dickson, Matt MR" Date: 10/17/2013 8:35 PM (GMT-05:00) To: "'user@accumulo.apache.org'" Sub

RE: [VOTE] Deprecate mock in 1.6.0

2013-11-14 Thread dlmarion
+1 From: Keith Turner [mailto:ke...@deenlo.com] Sent: Thursday, November 14, 2013 3:42 PM To: d...@accumulo.apache.org; user@accumulo.apache.org Subject: [VOTE] Deprecate mock in 1.6.0 Should we deprecate mock accumulo for 1.6.0? This was considered [1] for 1.5.0. I started thinking abou

RE: utility to print Accumulo config (feedback appreciated)

2013-12-17 Thread dlmarion
There is a dumpconfig command in 1.6.0. The source should help you: https://github.com/apache/accumulo/blob/1.6.0-SNAPSHOT/server/base/src/main/ java/org/apache/accumulo/server/util/Admin.java. Look at the printConfig method. From: Sujee Maniyam [mailto:su...@sujee.net] Sent: Tuesday, December

RE: Found two locations for the same extent

2014-01-25 Thread dlmarion
Have you tried killing one or both of the tservers? Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone Original message From: Anthony F Date:01/25/2014 10:12 AM (GMT-05:00) To: user@accumulo.apache.org Subject: Found two locations for the same extent I have a ta

Re: Found two locations for the same extent

2014-01-25 Thread dlmarion
s on !METADATA to root and was able to delete one of the offending entries.  At that point, the system stabilized.  Any tips on avoiding getting into this situation? On Sat, Jan 25, 2014 at 10:51 AM, dlmarion wrote: Have you tried killing one or both of the tservers? Sent via the Samsung GALAXY S®4

RE: best practice for deploying server side iterator jars

2014-02-20 Thread dlmarion
You will also want to set the replication on the directory/jars in HDFS to something high. I think in my testing I could not set it above 50. -Original Message- From: Josh Elser [mailto:josh.el...@gmail.com] Sent: Thursday, February 20, 2014 12:30 PM To: user@accumulo.apache.org Subject:

RE: pre-split table tablet server assignment

2014-03-02 Thread dlmarion
The master runs a load balancer. Sent via the Samsung GALAXY S®4, an AT&T 4G LTE smartphone Original message From: "Kepner, Jeremy - 0553 - MITLL" Date:03/02/2014 8:34 PM (GMT-05:00) To: user@accumulo.apache.org Subject: pre-split table tablet server assignment If I pre

RE: Hadoop HA with Accumulo 1.5

2014-03-20 Thread dlmarion
We had an issue in our testing (https://issues.apache.org/jira/browse/ACCUMULO-2480). The root cause was a misconfiguration for automatic failover. The sshfence feature does not handle network failures, so you have to configure it with the shell(/bin/true) command also (separated by a newline, unli

RE: Advice on increasing ingest rate

2014-04-08 Thread dlmarion
How many threads are you using in the AccumuloOutputFormat? What is your latency set to? From: Adam Fuchs [mailto:afu...@apache.org] Sent: Tuesday, April 08, 2014 5:36 PM To: user@accumulo.apache.org Subject: Re: Advice on increasing ingest rate MIke, What version of Accumulo are yo