dubin555 commented on issue #6648: URL: https://github.com/apache/incubator-pinot/issues/6648#issuecomment-792486881
> For the json field, you can configure it as string field in the schema, and enable json index for it (Check https://docs.pinot.apache.org/basics/indexing/json-index for more details) > Note that the input data needs to be json strings. You may use `jsonFormat()` function to convert object to json string if needed. > For Kafka real-time ingestion, each Kafka partition is consumed by one thread (one consuming segment), and you can configure multiple replications of the segment Thanks for the reply! I check the link that you posted. If the upstream data is in nested json, like this: `{"props":{"name":"tom"}}`, can data be stored in a column field like 'name' WITHOUT nested 'props'? I don't quite understand. Suppose the upstream Kafka topic has 10 partitions, and I want to configure 2 consumer threads. Question 1, where to config the setting. Question 2. If lag happens, I want to increase to 10 threads, how to change the setting ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
