On Tue, Mar 15, 2011 at 10:19 AM, sreejith P. K. <[email protected]> wrote: > I need to maintain a huge table for a 'web crawler' project in HBASE. > Basically it contains thousands of keywords and for each keyword i need to > maintain a list of urls (it again will count in thousands). Corresponding to > each url, i need to store a number, which will in turn resemble the priority > value the keyword holds. > Let me explain you a bit, Suppose i have a keyword 'united states', i need > to store about ten thousand urls corresponding to that keyword. Each keyword > will be holding a priority value which is an integer. Again i have thousands > of keywords like that. The rare thing about this is i need to do the project > in PHP.
Have you looked at ElasticSearch? Seems like it would do what you want out of the box. In your PHP app you simply need to make REST calls with a bit of JSON here and there, and that would be all. http://www.elasticsearch.org/ -- Benoit "tsuna" Sigoure Software Engineer @ www.StumbleUpon.com
