I have very large log(50T per day), My log event as follows
url,visitid,requesttime http://www.aaa.com?a=b&c=d&e=f, 1, 1463387380 http://www.aaa.com?a=b&c=d&e=fa, 1, 1463387280 http://www.aaa.com?a=b&c=d&e=fa, 2, 1463387280 http://www.aaa.com?a=b&c=d&e=fab, 2, 1463387280 http://www.aaa.com?a=b&c=d&e=f, 1, 1463387380 When a user enters a part of the url, and returns the uv(UniqueVisitor) pv(PageView)。 for example input: e=f* output: uv=2,pv=5, input: e=fa output:uv=2,pv=3 How to design rowkey? Thanks.
