[GitHub] [incubator-pinot] dubin555 commented on issue #6648: Support nested json schema in realtime ingest

GitBox Sun, 07 Mar 2021 21:54:26 -0800


dubin555 commented on issue #6648:
URL: 
https://github.com/apache/incubator-pinot/issues/6648#issuecomment-792486881



   > For the json field, you can configure it as string field in the schema, 
and enable json index for it (Check 
https://docs.pinot.apache.org/basics/indexing/json-index for more details)
   > Note that the input data needs to be json strings. You may use 
`jsonFormat()` function to convert object to json string if needed.
   
   > For Kafka real-time ingestion, each Kafka partition is consumed by one 
thread (one consuming segment), and you can configure multiple replications of 
the segment
   
   Thanks for the reply! I check the link that you posted. If the upstream data 
is in nested json, like this: `{"props":{"name":"tom"}}`, can data be stored in 
a column field like 'name' WITHOUT nested 'props'? 
   
   I don't quite understand. Suppose the upstream Kafka topic has 10 
partitions, and I want to configure 2 consumer threads. Question 1, where to 
config the setting. Question 2. If lag happens, I want to increase to 10 
threads, how to change the setting


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [incubator-pinot] dubin555 commented on issue #6648: Support nested json schema in realtime ingest

Reply via email to