Hi All, I am quite new to hadoop world and trying to work on a project using hadoop and pig. The data is continuously being written in hadoop by many producers. All producers concurrently write data to the same file for 30 minutes duration. After 30 minutes, new file is created and they start writing on it. I need to run pig jobs to analyze the data from hadoop incrementally and push the resulted data in RDBMS. I am wondering what will be the right way to implement it. Thanks,RS
- Getting Data into Data Warehouse from Pig rakesh sharma
- Re: Getting Data into Data Warehouse from Pig Guy Bayes
- Re: Getting Data into Data Warehouse from Pig Dmitriy Ryaboy
- Oracle as Pig data store rakesh sharma
- Re: Oracle as Pig data store Russell Jurney
- Re: Oracle as Pig data store Dmitriy Ryaboy
- RE: Oracle as Pig data store rakesh sharma
- Re: Oracle as Pig data store Hakan İlter
- RE: Oracle as Pig data store rakesh sharma
