This is an automated email from the ASF dual-hosted git repository.
nick pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/tika.git.
from a0ffec1 Handle .epub files using .htm rather than .html extensions
for the embedded contents (TIKA-1288)
new d0324f8 Add a test .sas7bdat file with labels, and generate the
columnar/tabular test file in a few more formats
new ca2f5bc Add a time column to the test columnar files
new 1d7a113 CSV assert as best we can (no dedicated parser), start on XLS
and SAS7BDAT consistency tests
new 7f89db3 Check header contents, check data rows count, add XLSX test
new cfd6256 Remaining values to check
The 5 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../org/apache/tika/parser/TabularFormatsTest.java | 196 +++++++++++++++++++--
.../apache/tika/parser/sas/SAS7BDATParserTest.java | 51 ++++--
.../resources/test-documents/test-columnar.csv | 37 ++--
.../resources/test-documents/test-columnar.sas.xml | 113 ++++++++++++
.../test-documents/test-columnar.sas7bdat | Bin 9216 -> 17408 bytes
.../resources/test-documents/test-columnar.xls | Bin 0 -> 6656 bytes
.../resources/test-documents/test-columnar.xlsx | Bin 0 -> 4941 bytes
.../resources/test-documents/test-columnar.xpt | Bin 0 -> 4720 bytes
.../src/test/resources/test-documents/testSAS2.sas | 67 +++++++
9 files changed, 405 insertions(+), 59 deletions(-)
create mode 100644
tika-parsers/src/test/resources/test-documents/test-columnar.sas.xml
create mode 100644
tika-parsers/src/test/resources/test-documents/test-columnar.xls
create mode 100644
tika-parsers/src/test/resources/test-documents/test-columnar.xlsx
create mode 100644
tika-parsers/src/test/resources/test-documents/test-columnar.xpt
create mode 100644 tika-parsers/src/test/resources/test-documents/testSAS2.sas
--
To stop receiving notification emails like this one, please contact
[email protected].