I am looking at integrating flume ng with our rest service API to record click stream data. Flow would be browser sends data to this REST service, which then acts as a client and send it to flume async. Flume then stores it in hdfs. I just want to make sure that this is a right use of flume.
I do have another question, how does flume organizes hdfs files? Does it create new directory based on the timestamp? Could someone help me with this in understanding how to efficiently organize and store files such that data can be clustered based on timestamp?
