Accumulo-Master - Build # 2907 - Unstable

2019-08-15 Thread Apache Jenkins Server
The Apache Jenkins build system has built Accumulo-Master (build #2907)

Status: Unstable

Check console output at https://builds.apache.org/job/Accumulo-Master/2907/ to 
view the results.

[GitHub] [accumulo-testing] milleruntime merged pull request #100: Drop example files

2019-08-15 Thread GitBox
milleruntime merged pull request #100: Drop example files
URL: https://github.com/apache/accumulo-testing/pull/100
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] ctubbsii commented on issue #1327: No-chop merges

2019-08-15 Thread GitBox
ctubbsii commented on issue #1327: No-chop merges
URL: https://github.com/apache/accumulo/issues/1327#issuecomment-521809296
 
 
   This is a very involved change, but with high potential benefit.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo-testing] ctubbsii edited a comment on issue #100: Drop example files

2019-08-15 Thread GitBox
ctubbsii edited a comment on issue #100: Drop example files
URL: https://github.com/apache/accumulo-testing/pull/100#issuecomment-513972060
 
 
   > There are a few example files for other tests that I am not sure if we 
need them or not:
   > 
   > ```
   > ./test/agitator/agitator.ini.example
   > ./test/agitator/hosts.example
   > ./test/scalability/conf/Ingest.conf.example
   > ./test/scalability/conf/site.conf.example
   > ./test/stress/stress-env.sh.example
   > ```
   
   Probably best to leave them for now, since they're unrelated to these 
particular (most commonly used) files.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Resolved] (ACCUMULO-3235) No-chop merges

2019-08-15 Thread Christopher Tubbs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ACCUMULO-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Christopher Tubbs resolved ACCUMULO-3235.
-
Resolution: Duplicate

Closing as duplicate of https://github.com/apache/accumulo/issues/1327 since we 
track issues on GitHub now.

> No-chop merges
> --
>
> Key: ACCUMULO-3235
> URL: https://issues.apache.org/jira/browse/ACCUMULO-3235
> Project: Accumulo
>  Issue Type: Improvement
>  Components: tserver
>Reporter: Christopher Tubbs
>Assignee: Christopher Tubbs
>Priority: Major
>
> Merges currently require a special "chop" major compaction, which can be 
> slow. The idea is simple: in the metadata entries for files, we track the 
> range in use for that file with a notation.
> This additional metadata would be used to merge tablets without re-writing 
> data. Only this extra metadata would be updated (to a narrower range when 
> split, and possibly collapsed if merging adjacent ranges). If a range is 
> missing for a file, the tablet's range will be used instead. This simplifies 
> things for backwards compatibility, and means we don't need to include a 
> redundant range if the tablet isn't splitting/merging.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Resolved] (ACCUMULO-4118) DataFileValue must support specifying ranges

2019-08-15 Thread Christopher Tubbs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ACCUMULO-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Christopher Tubbs resolved ACCUMULO-4118.
-
Resolution: Duplicate

Closing as duplicate of https://github.com/apache/accumulo/issues/1327 since we 
track issues on GitHub now.

> DataFileValue must support specifying ranges
> 
>
> Key: ACCUMULO-4118
> URL: https://issues.apache.org/jira/browse/ACCUMULO-4118
> Project: Accumulo
>  Issue Type: Sub-task
>  Components: tserver
>Reporter: Christopher Tubbs
>Assignee: Christopher Tubbs
>Priority: Major
>
> Add support for byte ranges to DataFileValue schema



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[GitHub] [accumulo] ctubbsii opened a new issue #1327: No-chop merges

2019-08-15 Thread GitBox
ctubbsii opened a new issue #1327: No-chop merges
URL: https://github.com/apache/accumulo/issues/1327
 
 
   Related to #1050 
   Supersedes https://issues.apache.org/jira/browse/ACCUMULO-3235 and 
https://issues.apache.org/jira/browse/ACCUMULO-4118
   
   Merges currently require a special "chop" major compaction, which can be 
slow. The idea is simple: in the metadata entries for files, we track the range 
in use for that file with a notation.
   
   This additional metadata would be used to merge tablets without re-writing 
data. Only this extra metadata would be updated (to a narrower range when 
split, and possibly collapsed if merging adjacent ranges). If a range is 
missing for a file, the tablet's range will be used instead. This simplifies 
things for backwards compatibility, and means we don't need to include a 
redundant range if the tablet isn't splitting/merging.
   
   In order to achieve this, we will need a mechanism to serialize the range 
information into the file field of the tablet metadata, and we will need to 
ensure that all reads of the file are confined to the range serialized. Some 
pitfalls to look out for: files opened multiple times with different ranges, 
use of block cache, offline file reads, handling duplicate entries for a file 
with different ranges as the result of a merge, garbage collection of files, 
etc.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Reopened] (ACCUMULO-2873) Create utility that generates single line tablet information

2019-08-15 Thread Christopher Tubbs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ACCUMULO-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Christopher Tubbs reopened ACCUMULO-2873:
-

> Create utility that generates single line tablet information
> 
>
> Key: ACCUMULO-2873
> URL: https://issues.apache.org/jira/browse/ACCUMULO-2873
> Project: Accumulo
>  Issue Type: New Feature
>Reporter: Keith Turner
>Priority: Major
>  Labels: newbie
>
> It would be very useful to have a utility that generates single line tablet 
> info.  The output of this could be fed to sort, awk, grep, etc inorder to 
> answer questions like which tablets have the most files.
> The output could look something like the following
> {noformat}
> $accumulo admin listTablets --table bigTable3
> #files #walogs #entries #size #status #location #tableid #endrow
> 6 2 40,001 50M ASSIGNED 10.1.9.9 4:9997[abc]  3 admin
> 3 1 50,002 40M ASSIGNED 10.1.9.9 5:9997[abc]  3 helpful
> {noformat}
> All of the information can be obtained by scanning the metadata table and 
> looking into zookeeper.   Could possibly contact tablet servers to get info 
> about entires in memory.
> The order of the columns in the example above is arbitrary, except for end 
> row.  Maybe end row column should come last because it can be of arbitrary 
> length.  Also the end row could contain any character, could look into using 
> a CSV library.   It would be nice to design the utility so that columns can 
> be added in future versions w/o impacting current scripts that use the 
> utility.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Resolved] (ACCUMULO-2873) Create utility that generates single line tablet information

2019-08-15 Thread Christopher Tubbs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ACCUMULO-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Christopher Tubbs resolved ACCUMULO-2873.
-
Resolution: Duplicate

Duplicated by the issue on GitHub: 
https://github.com/apache/accumulo/issues/1317

> Create utility that generates single line tablet information
> 
>
> Key: ACCUMULO-2873
> URL: https://issues.apache.org/jira/browse/ACCUMULO-2873
> Project: Accumulo
>  Issue Type: New Feature
>Reporter: Keith Turner
>Priority: Major
>  Labels: newbie
>
> It would be very useful to have a utility that generates single line tablet 
> info.  The output of this could be fed to sort, awk, grep, etc inorder to 
> answer questions like which tablets have the most files.
> The output could look something like the following
> {noformat}
> $accumulo admin listTablets --table bigTable3
> #files #walogs #entries #size #status #location #tableid #endrow
> 6 2 40,001 50M ASSIGNED 10.1.9.9 4:9997[abc]  3 admin
> 3 1 50,002 40M ASSIGNED 10.1.9.9 5:9997[abc]  3 helpful
> {noformat}
> All of the information can be obtained by scanning the metadata table and 
> looking into zookeeper.   Could possibly contact tablet servers to get info 
> about entires in memory.
> The order of the columns in the example above is arbitrary, except for end 
> row.  Maybe end row column should come last because it can be of arbitrary 
> length.  Also the end row could contain any character, could look into using 
> a CSV library.   It would be nice to design the utility so that columns can 
> be added in future versions w/o impacting current scripts that use the 
> utility.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[GitHub] [accumulo-docker] ctubbsii commented on issue #11: Accumulo 2.0.0 is out...

2019-08-15 Thread GitBox
ctubbsii commented on issue #11: Accumulo 2.0.0 is out...
URL: https://github.com/apache/accumulo-docker/issues/11#issuecomment-521784165
 
 
   It seems like what might be required is for the Accumulo PMC to vote to 
release this Dockerfile as an official ASF release, and then anybody can issue 
a pull request to add an accumulo specification that points to the tagged ASF 
release at: https://github.com/docker-library/official-images


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo-docker] keith-turner commented on issue #11: Accumulo 2.0.0 is out...

2019-08-15 Thread GitBox
keith-turner commented on issue #11: Accumulo 2.0.0 is out...
URL: https://github.com/apache/accumulo-docker/issues/11#issuecomment-521776143
 
 
   I can update it for 2.0, but I Am not sure how to publish an image.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo-docker] ctubbsii commented on issue #11: Accumulo 2.0.0 is out...

2019-08-15 Thread GitBox
ctubbsii commented on issue #11: Accumulo 2.0.0 is out...
URL: https://github.com/apache/accumulo-docker/issues/11#issuecomment-521776161
 
 
   @larsw Do you know what's involved in doing that?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] asfgit closed issue #1314: Garbage Collector problematic use of SimpleTimer

2019-08-15 Thread GitBox
asfgit closed issue #1314: Garbage Collector problematic use of SimpleTimer
URL: https://github.com/apache/accumulo/issues/1314
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] EdColeman merged pull request #1299: Add optional -t tablename to importdirectory shell command.

2019-08-15 Thread GitBox
EdColeman merged pull request #1299: Add optional -t tablename to 
importdirectory shell command.
URL: https://github.com/apache/accumulo/pull/1299
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] keith-turner commented on issue #1317: Provide single-line output of tablet info.

2019-08-15 Thread GitBox
keith-turner commented on issue #1317: Provide single-line output of tablet 
info.
URL: https://github.com/apache/accumulo/issues/1317#issuecomment-521765040
 
 
   > What is the proper way to determine status?
   
   Could use [TabletLocationState.getState()][2].  One way to get 
TabletLocationState objects is via [TabletStateStore][1].  The set of live 
tservers can be obtained using [LiveTServerSet][3].
   
   > The original ticket mentioned that some information could be pulled from 
zookeeper - with the current implementation, all information is coming from the 
metadata table. What information is missing / would be helpful?
   
   [LiveTServerSet][3] pulls the set of active tablet servers from zookeeper.
   
   [1]: 
https://github.com/apache/accumulo/blob/f39c2fef1c1c911e86985ce6352c5d1bc9967c5b/server/base/src/main/java/org/apache/accumulo/server/master/state/TabletStateStore.java
   
   [2]: 
https://github.com/apache/accumulo/blob/f39c2fef1c1c911e86985ce6352c5d1bc9967c5b/server/base/src/main/java/org/apache/accumulo/server/master/state/TabletLocationState.java#L104
   
   [3]: 
https://github.com/apache/accumulo/blob/f39c2fef1c1c911e86985ce6352c5d1bc9967c5b/server/base/src/main/java/org/apache/accumulo/server/master/LiveTServerSet.java


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] EdColeman merged pull request #1318: Fix #1314 fixes gc thread issue by creating one timed task instance

2019-08-15 Thread GitBox
EdColeman merged pull request #1318: Fix #1314 fixes gc thread issue by 
creating one timed task instance
URL: https://github.com/apache/accumulo/pull/1318
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] keith-turner closed issue #1326: TabletStateChangeIteratorIT fails due to #1309

2019-08-15 Thread GitBox
keith-turner closed issue #1326: TabletStateChangeIteratorIT fails due to #1309
URL: https://github.com/apache/accumulo/issues/1326
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] asfgit closed issue #1288: Back port new Testing MR Bulk Ingest test to 1.9

2019-08-15 Thread GitBox
asfgit closed issue #1288: Back port new Testing MR Bulk Ingest test to 1.9
URL: https://github.com/apache/accumulo/issues/1288
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo] milleruntime merged pull request #1312: Backport BulkIngest test. Closes #1288

2019-08-15 Thread GitBox
milleruntime merged pull request #1312: Backport BulkIngest test. Closes #1288
URL: https://github.com/apache/accumulo/pull/1312
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] [accumulo-docker] larsw opened a new issue #11: Accumulo 2.0.0 is out...

2019-08-15 Thread GitBox
larsw opened a new issue #11: Accumulo 2.0.0 is out...
URL: https://github.com/apache/accumulo-docker/issues/11
 
 
   Hi,
   
   Ref. the README; will you publish an official image soon?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services