[jira] [Updated] (FLINK-16627) when insert into kafkas ,how can i remove the keys with null values of json

jackray wang (Jira) Mon, 16 Mar 2020 22:23:16 -0700


     [ 
https://issues.apache.org/jira/browse/FLINK-16627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


jackray wang updated FLINK-16627:
---------------------------------
    Description: 
{code:java}
//代码占位符
{code}
CREATE TABLE sink_kafka ( subtype STRING , svt STRING ) WITH (……）
{code:java}
//代码占位符
{code}
CREATE TABLE source_kafka ( subtype STRING , svt STRING ) WITH (……）
{code:java}
//代码占位符
{code}
class ScalaUpper extends ScalarFunction {

def eval(str: String) : String= { 
 if(str == null){
 return ""
 }else{
 return str
 }
 }
 
}
btenv.registerFunction("scala_upper", new ScalaUpper())
{code:java}
//代码占位符
{code}
insert into sink_kafka select subtype, svt  from source_kafka

 
----
Sometimes the svt's value is null, inert into kafkas json like  
\{"subtype":"qin","svt":null}

If the amount of data is small, it is acceptable，but we process 10TB of data 
every day, and there may be many nulls in the json, which affects the 
efficiency. If you can add a parameter to remove the null key when defining a 
sinktable, the performance will be greatly improved

 

 

 

 
        Summary: when insert into kafkas ,how can i remove the keys with null 
values of json  (was: when insert into kafka ,how can i remove the keys with 
null value of json)

> when insert into kafkas ,how can i remove the keys with null values of json
> ---------------------------------------------------------------------------
>
>                 Key: FLINK-16627
>                 URL: https://issues.apache.org/jira/browse/FLINK-16627
>             Project: Flink
>          Issue Type: Improvement
>          Components: Table SQL / Client
>    Affects Versions: 1.10.0
>            Reporter: jackray wang
>            Priority: Major
>
> {code:java}
> //代码占位符
> {code}
> CREATE TABLE sink_kafka ( subtype STRING , svt STRING ) WITH (……）
> {code:java}
> //代码占位符
> {code}
> CREATE TABLE source_kafka ( subtype STRING , svt STRING ) WITH (……）
> {code:java}
> //代码占位符
> {code}
> class ScalaUpper extends ScalarFunction {
> def eval(str: String) : String= { 
>  if(str == null){
>  return ""
>  }else{
>  return str
>  }
>  }
>  
> }
> btenv.registerFunction("scala_upper", new ScalaUpper())
> {code:java}
> //代码占位符
> {code}
> insert into sink_kafka select subtype, svt  from source_kafka
>  
> ----
> Sometimes the svt's value is null, inert into kafkas json like  
> \{"subtype":"qin","svt":null}
> If the amount of data is small, it is acceptable，but we process 10TB of data 
> every day, and there may be many nulls in the json, which affects the 
> efficiency. If you can add a parameter to remove the null key when defining a 
> sinktable, the performance will be greatly improved
>  
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (FLINK-16627) when insert into kafkas ,how can i remove the keys with null values of json

Reply via email to