On Wed, Feb 22, 2012 at 8:40 AM, Jacques <[email protected]> wrote: > We have a crawl table and here are a couple quick thoughts: > > - I'd suggest that you use reverse url as your primary key. Specifically, > reversed host name but normal path and query string.
Maybe this utility in hbase helps do what Jacque suggests above? http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/util/Keying.html#createKey(java.lang.String) St.Ack
