Tilman Hausherr created TIKA-3848:
-------------------------------------
Summary: IllegalArgumentException in DBFColumnHeader.setType()
Key: TIKA-3848
URL: https://issues.apache.org/jira/browse/TIKA-3848
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 2.4.1
Reporter: Tilman Hausherr
{noformat}
"commoncrawl/CC-MAIN-2021-31/2c/80/2c80ca18e8a34133b8defc9813f31bf0058341ab119959ed5f3a2d75affd919f",1,False,"1450","java.lang.IllegalArgumentException:
Unrecognized column type for column: 0� *. I regret I don't recognize: H
at
org.apache.tika.parser.dbf.DBFColumnHeader.setType(DBFColumnHeader.java:55)
at
org.apache.tika.parser.dbf.DBFFileHeader.readCol(DBFFileHeader.java:111)
at org.apache.tika.parser.dbf.DBFFileHeader.parse(DBFFileHeader.java:77)
at org.apache.tika.parser.dbf.DBFReader.<init>(DBFReader.java:59)
at org.apache.tika.parser.dbf.DBFReader.open(DBFReader.java:65)
at org.apache.tika.parser.dbf.DBFParser.parse(DBFParser.java:70)
{noformat}
Possible solution: in {{DBFFileHeader.readCol()}} catch the
{{IllegalArgumentException}} thrown by {{col.setType(colType);}} and throw a
better exception.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)