Tilman Hausherr created TIKA-3848:
-------------------------------------

             Summary: IllegalArgumentException in DBFColumnHeader.setType()
                 Key: TIKA-3848
                 URL: https://issues.apache.org/jira/browse/TIKA-3848
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 2.4.1
            Reporter: Tilman Hausherr


{noformat}
"commoncrawl/CC-MAIN-2021-31/2c/80/2c80ca18e8a34133b8defc9813f31bf0058341ab119959ed5f3a2d75affd919f",1,False,"1450","java.lang.IllegalArgumentException:
 Unrecognized column type for column: 0�       *. I regret I don't recognize: H
        at 
org.apache.tika.parser.dbf.DBFColumnHeader.setType(DBFColumnHeader.java:55)
        at 
org.apache.tika.parser.dbf.DBFFileHeader.readCol(DBFFileHeader.java:111)
        at org.apache.tika.parser.dbf.DBFFileHeader.parse(DBFFileHeader.java:77)
        at org.apache.tika.parser.dbf.DBFReader.<init>(DBFReader.java:59)
        at org.apache.tika.parser.dbf.DBFReader.open(DBFReader.java:65)
        at org.apache.tika.parser.dbf.DBFParser.parse(DBFParser.java:70)
{noformat}

Possible solution: in  {{DBFFileHeader.readCol()}} catch the 
{{IllegalArgumentException}} thrown by {{col.setType(colType);}} and throw a 
better exception.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to