Hi all, we are working a project which should collect traces, journal and audit entries produced by an application running on a tomcat server in a central data-store. We are expecting about one million entries per day, about 150 MB data.
1.) The trace entries must be collected in the same order like produced from the application and we need a failover mechanism. The aimed configuration trace collection would be: - exact one producer on tomcat server creating trace entries - the producer sends the messages always to the same partition/broker - 2 brokers on different physical servers - 2 consumers running on both broker server - the consumers belong to the same group (e.g. 'trace') - just one consumer is processing the messages and the second one is for failover How should be the zookeper configuration ? - one zookeeper server for each brokers running on the server where tomcat server runs - or 2 clustered zookeeper server each running on the brokers physical server Is it a good idea to run the consumers on the same physical server as the brokers ? Makes this configuration sense ? 2.) For the journal and audit the order of the entries are not important. So the aimed configuration for these would be: - n producers running on the tomcat server - the producers send the messages randomly to available brokers - at least 2 brokers with m partitions on different physical server - at least 2 consumers running on both broker server with m threads - the consumers belong to different groups (e.g. 'journal' and 'audit') My question here is how to figure out the number of partitions. Are there any measure values or hints ? Many thanks, Ismail.