On Tue, Mar 15, 2011 at 10:19 AM, sreejith P. K. <[email protected]> wrote:
> I need to maintain a huge table for a 'web crawler' project in HBASE.
> Basically it contains thousands of keywords and for each keyword i need to
> maintain a list of urls (it again will count in thousands). Corresponding to
> each url, i need to store a number, which will in turn resemble the priority
> value the keyword holds.
> Let me explain you a bit, Suppose i have a keyword 'united states', i need
> to store about ten thousand urls corresponding to that keyword. Each keyword
> will be holding a priority value which is an integer. Again i have thousands
> of keywords like that. The rare thing about this is i need to do the project
> in PHP.

Have you looked at ElasticSearch?  Seems like it would do what you
want out of the box.  In your PHP app you simply need to make REST
calls with a bit of JSON here and there, and that would be all.
http://www.elasticsearch.org/

-- 
Benoit "tsuna" Sigoure
Software Engineer @ www.StumbleUpon.com

Reply via email to