Hi, I am working on a demo application which should have the following functionality: 1. There is a web interface that allows people to submit a data analysis job. It also allows to list all previois jobs and view the results of the jobs. 2. After a job is submitted, a hadoop job is submitted by my application. The progress of the job is reported on the job results web page of my app. 3. After the job is completed, my application would read the hadoop output directory and generate a report which would be available on the job results webpage. Though the definition is simple, there are a few queries that I had: 1. Would I require a server (in terms of software) or my application which would act as a job scheduling system and store the current status of the job and which would submit jobs to hadoop? 2. If I do not require a server, then how to get the status of the hadoop job to display it in my webpage? Any idea/thoughts about how this application can be developed would be of help. ~ Neeraj
