A little bit better than plain scraping..use lynx..
You don't have to parse HTML at least.


Thanks,
Abhishek


-----Original Message-----
From: Patai Sangbutsarakum [mailto:[email protected]] 
Sent: Thursday, October 18, 2012 2:47 PM
To: [email protected]
Subject: i am about to scrape a page

I finding a way to retrieve info about what jobs are running by what user, and 
on what pool(s); i am on cdh3u4 with fair scheduler.
I do know that jobtracker_host:50030/scheduler   is  showing that, so
scraping the page would be one way and handle with html table.

Is that any other more civilized way, json format, command line ?
hadoop job -list doesn't show the pool.. that's pretty sad.

Input is really appreciate :-)

Thanks
Patai

Reply via email to