Re: Scan result sequence

2014-08-16 Thread Mingtao Zhang
Gr8 :) Thank you! Mingtao On Sat, Aug 16, 2014 at 6:03 PM, Ted Yu wrote: > bq. hash(a.com) comes together with the timestamp sequence > > That should be the case - assuming your sessionId is of fixed width. > > Cheers > > > On Sat, Aug 16, 2014 at 2:55 PM, Mingtao Zhang > wrote: > > > Hi Ted,

Re: Scan result sequence

2014-08-16 Thread Ted Yu
bq. hash(a.com) comes together with the timestamp sequence That should be the case - assuming your sessionId is of fixed width. Cheers On Sat, Aug 16, 2014 at 2:55 PM, Mingtao Zhang wrote: > Hi Ted, > > I used murmurhash. Actually I don't care about the sequence between the > group of a.com a

Re: Scan result sequence

2014-08-16 Thread Mingtao Zhang
Hi Ted, I used murmurhash. Actually I don't care about the sequence between the group of a.com and b.com record. I am 120% :) as far as hash(a.com) comes together with the timestamp sequence. (hash(b.com) could come either before or after) Best Regards, Mingtao On Sat, Aug 16, 2014 at 5:44 PM, T

Re: Scan result sequence

2014-08-16 Thread Ted Yu
How do you generate hash based on pageurl ? The order between hash(a.com) and hash(b.com ) may not be what you expected. BTW See http://hbase.apache.org/book.html#row and http://hbase.apache.org/book.html#dm.sort Cheers On Sat, Aug 16, 2014 at 2:26 PM, Mingtao Zhang wrote: > Hi