When I need reports, I do:

echo "My\tcolumn\tnames\n" > report.tsv
hdfs -cat my/pig/output/* >> report.tsv

If you need something more elaborate, you could use something like
http://search.cpan.org/dist/Spreadsheet-WriteExcel/ or simply load your TSV
into a database with a script after your pig job finishes, and use any of
the database reporting tools.

MySQL (with the Infobright engine if you have bigger data output) and
something like Pentaho would work: http://www.pentaho.com/
Tableau is really nice, and can load smaller TSV directly, but is Windows
only and a bit pricey.  http://www.tableausoftware.com/

Russ

On Mon, Jun 14, 2010 at 4:24 PM, elein <[email protected]> wrote:

>
> Does there exist any reporting tools that can run on top of
> pig or using pig?  Or does everyone load TSV results in some type of excel.
>
> I will need to create reports with labels and sequential pig queries
> and any fancy display stuff I can send out with email.
>
>
> elein
> [email protected]
>
>
>
>
>

Reply via email to