A little bit better than plain scraping..use lynx.. You don't have to parse HTML at least.
Thanks, Abhishek -----Original Message----- From: Patai Sangbutsarakum [mailto:[email protected]] Sent: Thursday, October 18, 2012 2:47 PM To: [email protected] Subject: i am about to scrape a page I finding a way to retrieve info about what jobs are running by what user, and on what pool(s); i am on cdh3u4 with fair scheduler. I do know that jobtracker_host:50030/scheduler is showing that, so scraping the page would be one way and handle with html table. Is that any other more civilized way, json format, command line ? hadoop job -list doesn't show the pool.. that's pretty sad. Input is really appreciate :-) Thanks Patai
