Chinh Nguyen created TIKA-2857:
----------------------------------
Summary: Cannot parse PKCS7 files
Key: TIKA-2857
URL: https://issues.apache.org/jira/browse/TIKA-2857
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 1.20
Reporter: Chinh Nguyen
I cannot get tika to parse any PKCS7 files. All seem to generate Java
exceptions:
* Create a "cert-only" file: {{openssl crl2pkcs7 -nocrl -certfile
/usr/share/ca-certificates/mozilla/Entrust_Root_Certification_Authority.crt
-out cert.p7 -outform der}}
* Create a signed file: {{echo "hello world" | openssl cms -sign -inkey key.pem
-signer cert.pem -md sha256 -outform der -nodetach -out sign.p7}}
* Create a digest file: {{echo "hello world" | openssl cms -digest_create -md
sha256 -outform der -out digest.p7}}
Try to parse these files:
{quote}java -jar tika-app-1.20.jar cert.p7
Exception in thread "main" org.apache.tika.exception.TikaException: cannot
parse detached pkcs7 signature (no signed data to parse)
at org.apache.tika.parser.crypto.Pkcs7Parser.parse(Pkcs7Parser.java:81)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:209)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:496)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:149){quote}
{quote}java -jar tika-app-1.20.jar sign.p7
Exception in thread "main" org.apache.tika.exception.TikaException: TIKA\-198:
Illegal IOException from org.apache.tika.parser.crypto.Pkcs7Parser@76a4d6c
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143){quote}
{quote}java -jar tika-app-1.20.jar --config=tika.xml digest.p7
Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected
RuntimeException from org.apache.tika.parser.ParserDecorator$1@5dfcfece
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:282)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:209)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:496)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:149)
Caused by: java.lang.ClassCastException:
org.bouncycastle.asn1.DERSequenceParser cannot be cast to
org.bouncycastle.asn1.ASN1SetParser
at
org.bouncycastle.asn1.cms.SignedDataParser.getDigestAlgorithms(Unknown
Source){quote}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)