Also, will it handle if the split results in no input? From the looks of it, it will write an empty string with a count of 1.
On May 27, 2010, at 5:14 PM, Robin Anil wrote: > fpg uses regex to split. Just add another option for using the regex > to match instead of splitting. Less work I guess > > > > On Fri, May 28, 2010 at 2:42 AM, Grant Ingersoll <[email protected]> wrote: >> I'd like to take a bunch of logs and extract a bit of each line and then put >> them into format for FPG. Was thinking a simple M/R job that took in a >> regex would suffice and then output in the format for FPG. Is that >> generally useful or am I missing something obvious? I want to do FPG on my >> query logs and it seems like a generally useful conversion. I suppose, in >> fact, it isn't even log specific. >> >> -Grant
