btw, I am considering using it to speedup (parallel) online queries over large dataset. Is pig suitable for this, or just suitable for offline large data analysis? Will it be a better choice than distributed(parallel) database in terms of scalability and latency?
I really like the pig's programming interface. So I want to try to use it instead of using parallel database. Thanks! cheers, W. On Wed, Aug 4, 2010 at 9:08 PM, Wenhao Xu <xuwenhao2...@gmail.com> wrote: > Thanks! > Can PigServer handle concurrent requests? Because the store is a > synchronous interface, is there any asynchronous one? > > cheers, > W. > > > On Wed, Aug 4, 2010 at 9:01 PM, Harsh J <qwertyman...@gmail.com> wrote: > >> You need to use the class PigServer. >> >> PigServer pigServer = new PigServer("mapreduce"); // Or "local" for local >> mode >> pigServer.registerQuery("A = LOAD ..."); >> (...) // Your statements here. >> pigServer.store("A", "filename"); >> >> On Thu, Aug 5, 2010 at 9:26 AM, Wenhao Xu <xuwenhao2...@gmail.com> wrote: >> > Hi all, >> > I am new to pig. I am wondering is there any recommended way to call >> Pig >> > code from Java? >> > Is there any Java interface which can be called directly from Java and >> > makes them work smoothly? It seems each keyword (filter, group, cogrape, >> > generate) and data types in Pig can have a counterpart in Java by using >> > Class, interface and data type. Is these Java interface available to >> Java >> > programmers to use? If not, why not? >> > Thanks very much for help! >> > >> > regards, >> > Wenhao >> > >> > -- >> > ~_~ >> > >> >> >> >> -- >> Harsh J >> www.harshj.com >> > > > > -- > ~_~ > -- ~_~