[ANNOUNCE] Apache Kudu 1.14.0 Released

2021-01-28 Thread Grant Henke
The Apache Kudu team is happy to announce the release of Kudu 1.14.0! Kudu is an open source storage engine for structured data which supports low-latency random access together with efficient analytical access patterns. It is designed within the context of the Apache Hadoop ecosystem and

Re: [proposal] Kudu operating system requirements changes

2020-11-25 Thread Grant Henke
ures in Kudu but can't upgrade host operating > systems > > > in near future. > > > > > > Rest looks good to me and can't wait to start using the new C++ > features > > > this will unlock. > > > > > > Thanks, > > > -Bankim. > > > &

[proposal] Kudu operating system requirements changes

2020-11-23 Thread Grant Henke
Hello Kudu developers and users! The purpose of this email is to propose and collect feedback on changes to the documented "Operating System Requirements" on https://kudu.apache.org/docs/installation.html for the next Kudu release (1.14.0). There are a few goals to updating the documented

Re: kudu-spark2 Scala 2.12 support

2020-05-21 Thread Grant Henke
etely dropped Scala 2.11 >> support. See https://spark.apache.org/docs/2.4.5/: "For the Scala API, >> Spark 2.4.5 uses Scala 2.12. You will need to use a compatible Scala >> version (2.12.x).". >> >> Could you, please, build and publish to Maven repo kudu-spark2 Sc

Re: Will hash partitioning columns be used in query optimization?

2020-05-13 Thread Grant Henke
anding, because we already set b as hash partitioning column > Kudu will not do a full table scan. > > Instead Kudu will look for “c=2” in the hash bucket of “b=1”. > > Is my understanding correct? > > > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: real-time pipeline with Kudu

2020-05-13 Thread Grant Henke
2/ >> >> Our real-time infra was pretty crucial during Covid19 to support all kind >> of analytics. >> >> I really appreciate everyone help along the way. Todd, Adar, Alexey, >> Cliff, Andrew, Grant, Tim, Hao, JD - man, only now i realized how many >> people of

Re: [ANNOUNCE] Welcoming Bankim Bhavsar as Kudu committer and PMC member

2020-04-21 Thread Grant Henke
st and > >>> efficient. One of the challenges here has been integration with Apache > >>> Impala, and providing a common abstraction that can be used by both > >>> codebases. This work is still ongoing but is drawing to a close pretty > >>> soon. > >>> > >>> Please join me in congratulating Bankim! > >>> > >> > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: Partitioning Rules of Thumb

2020-03-13 Thread Grant Henke
at, Mar 7, 2020 at 9:52 AM Boris Tyukin >>>>>>> wrote: >>>>>>> >>>>>>>> hey guys, >>>>>>>> >>>>>>>> I asked the same question on Slack on and got no responses. I just >>>>&

Re: [ANNOUNCE] Welcoming Yifan Zhang as Kudu committer and PMC member

2020-01-07 Thread Grant Henke
icas from such tablet > >> servers, which is a valuable building block for tserver > >> decommissioning. > >> - Most recently, deduplicating RPCs sent by Kudu masters to tablet > >> servers. > >> > >> Please join me in congratulating Yifan! > >> > > > > > > -- > > Andrew Wong > > > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: Long text and complex data types support

2019-09-11 Thread Grant Henke
t.github.com/dnafault/e55ea987c55d2960c738d94e4811d043 >> [2] >> https://clickhouse-docs.readthedocs.io/en/latest/data_types/nested_data_structures/nested.html >> >> On Mon, 9 Sep 2019 at 08:18, Grant Henke wrote: >> > >> > Hi Boris, >> > >> >

Re: Long text and complex data types support

2019-09-08 Thread Grant Henke
s, maps etc.) > > Thanks > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: Steps to know kudu.num_tablets_replica parameter value

2019-08-28 Thread Grant Henke
ote: > Hi Everyone > > I'm new in Kudu database and have a requirement to find the replication > factor value set in all Kudu tables, request to suggest me the steps to > find this value. > > Please help me on the priority > > Reagrds > Naresh > > -- Grant Henke S

Kudu Community Survey

2019-08-27 Thread Grant Henke
Hello Kudu Users! Please fill out the first Apache Kudu community survey and tell us how you're using Kudu and how we can make it better. https://forms.gle/5pRzhh38sZwZt69U7 The results of this survey will help drive community focus, direction, and future blog content. It will also be used to

Re: [ANNOUNCE] Welcoming Lifu He, Yao Xu, and Yao Zhang as Kudu committers and PMC members

2019-08-26 Thread Grant Henke
of the largest Internet and video game > >>> companies in the world) where he helps operate their Kudu clusters. > >>> Both Yao Xu and Yao Zhang work at Ant Financial (Alipay Inc.) where > >>> they also help operate their very large Kudu deployments. All three > >>> have been instrumental in growing Kudu's presence within China as well > >>> as helping new Chinese users come up to speed with Kudu. > >>> > >>> Please join me in congratulating Lifu, Yao, and Yao! > >>> > >> > >> > >> -- > >> Andrew Wong > >> > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: Delete or Update by Query

2019-07-16 Thread Grant Henke
ions like the one below, but it deletes > only one row at the same time. > > > > Delete delete = table.newDelete(); > > ... > > session.apply(delete); > > > > Is there some way to delete multiple rows using a condition?, maybe via > a Scanner or si

Re: Empy RowResultIterator with RangePartitions

2019-07-11 Thread Grant Henke
I created a jira to improve the Javadoc for the `nextRows` API here: https://issues.apache.org/jira/browse/KUDU-2891 If you are interested in contributing it would be a super simple contribution. On Thu, Jul 11, 2019 at 1:00 PM Grant Henke wrote: > Hi John, > > If you can leverage

Re: Kudu Reader is not working with Java

2019-04-22 Thread Grant Henke
any or its affiliates and > subsidiaries. If you are not the intended recipient, you are hereby > notified that any use of the information contained in or transmitted with > the communication or dissemination, distribution, or copying of this > communication is strictly prohibited. If you have received this > communication in error, please notify the Western Union sender immediately > by replying to this message and delete the original message > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: Check existing range partitions using the Java API

2019-03-06 Thread Grant Henke
the range partitions >>> > (or partitions at all) using the Java API? My use-case at the moment >>> > is to create range partitions based on the data I am about to insert, >>> > and to do so I want to first check if that range partition already >>> > exists, to p

Re: Kudu Master and T-Server

2019-02-19 Thread Grant Henke
AM Vignesh Shenbagarajan wrote: > Hi, > > > > I am having difficulties in starting and stopping kudu servers. Is there > any particular command to execute these tasks in version 1.8? > > > > Looking forward to your response. > > > > Thank You, > &g

Re: Re[2]: How to load kudu RDD with correct partitioner

2018-11-18 Thread Grant Henke
partitioning. So i > have to add rdd.partitionBy(new HashPartitioner(N)) before join operation > and it leads to shuffling > > Regards, Dmitry > > Среда, 14 ноября 2018, 18:58 +03:00 от Grant Henke : > > Unfortunately, I am not sure of a simple way to provide the parti

Re: How to load kudu RDD with correct partitioner

2018-11-14 Thread Grant Henke
ords from such table in key value fashion with > correct partitioner information in RDD? For example RDD[(record_id, row)] > Because when i try to use kuduRDD in spark the partitioner has None value > so im losing information about original (kudu) partitioning. > > Thanks

Re: Growing the Kudu community

2018-07-17 Thread Grant Henke
s shut down) >> >> Please offer your suggestions for where we can get a good bang for our >> collective buck, and if there is anything you would like to work on by all >> means please either speak up or feel free to reach out directly. >> >> Thanks, >> >> Grant and Mike >> >> > > -- > Mauricio Aristizabal > Architect - Data Pipeline > mauri...@impact.com | 323 309 4260 > https://impact.com ><https://www.facebook.com/ImpactMarTech/> > <https://twitter.com/impactmartech> > > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

[ANNOUNCE] Apache Kudu 1.7.1 released

2018-07-12 Thread Grant Henke
The Apache Kudu team is happy to announce the release of Kudu 1.7.1. Kudu is an open source storage engine for structured data which supports low-latency random access together with efficient analytical access patterns. It is designed within the context of the Apache Hadoop ecosystem and supports

[ANNOUNCE] Apache Kudu 1.7.0 released

2018-03-23 Thread Grant Henke
://kudu.apache.org/releases/1.7.0/ Full release notes: https://kudu.apache.org/releases/1.7.0/docs/release_notes.html Regards, The Apache Kudu team -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke

Re: INT128 Column Support Interest

2017-11-21 Thread Grant Henke
cimal precisions 39 to 77 with internal INT256 support (or VarInt). On Mon, Nov 20, 2017 at 6:51 PM, Todd Lipcon <t...@cloudera.com> wrote: > On Mon, Nov 20, 2017 at 1:12 PM, Grant Henke <ghe...@cloudera.com> wrote: > > > Thank you for the feedback. Below are some respon