RE: Using CommonCrawl for POI regression-mass-testing

2016-01-15 Thread Allison, Timothy B.
e/collaborate on? > > Cheers, > >Tim > > > [0] > http://events.linuxfoundation.org/sites/events/files/slides/TikaEval_A > CNA15_allison_herceg_v2.pdf > [1] > https://issues.apache.org/jira/secure/attachment/12782054/reports_pdfb > ox_1_8_11-rc1.zip > &

RE: Using CommonCrawl for POI regression-mass-testing

2016-01-14 Thread Allison, Timothy B.
[mailto:dominik.stad...@gmx.at] Sent: Wednesday, January 13, 2016 2:09 PM To: POI Developers List <dev@poi.apache.org> Subject: Using CommonCrawl for POI regression-mass-testing Hi, FYI, I am playing with CommonCrawl data for some talk that I plan to do in 2016. As part of this I built a small framework to

Re: Using CommonCrawl for POI regression-mass-testing

2016-01-14 Thread Dominik Stadler
files/slides/TikaEval_ACNA15_allison_herceg_v2.pdf > [1] > https://issues.apache.org/jira/secure/attachment/12782054/reports_pdfbox_1_8_11-rc1.zip > > From: Dominik Stadler [mailto:dominik.stad...@gmx.at] > Sent: Wednesday, January 13, 2016 2:09 PM > To: POI Developers List <de

Using CommonCrawl for POI regression-mass-testing

2016-01-13 Thread Dominik Stadler
Hi, FYI, I am playing with CommonCrawl data for some talk that I plan to do in 2016. As part of this I built a small framework to let me run the POI integrationtest-framework on a large number of documents that I extracted from a number of CommonCrawl-runs. This is somewhat similar to what Tim is

Re: Using CommonCrawl for POI regression-mass-testing

2016-01-13 Thread Andreas Beeker
Hi Dominik, I'd like to have the X/HSLF files, so a few of the first 280ies and the NPE. Thank you for your efforts! Andi. - To unsubscribe, e-mail: dev-unsubscr...@poi.apache.org For additional commands, e-mail: