Hi,

I am getting an exception trying to parse an eml file with the following
code:

String fileName = "215.eml"
FileInputStream inputStream = new FileInputStream(fileName);
Tika tika = new Tika();
String text = tika.parseToString(inputStream, metadata);

but I know that it is parsed OK with Luke.

Here is what I get though

org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.parser.mail.RFC822Parser@b7415b
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.Tika.parseToString(Tika.java:357)
at org.freeeed.main.FreeEedParser.parse(FreeEedParser.java:19)
at org.freeeed.main.FileProcessor.extractMetadata(FileProcessor.java:136)
at org.freeeed.main.FileProcessor.processFileEntry(FileProcessor.java:53)
at org.freeeed.main.SingleFileProcessor.process(SingleFileProcessor.java:15)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:30)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.extractEmails(PstProcessor.java:47)
at org.freeeed.main.PstProcessor.process(PstProcessor.java:24)
at
org.freeeed.main.ZipFileProcessor.processZipEntry(ZipFileProcessor.java:41)
at org.freeeed.main.ZipFileProcessor.process(ZipFileProcessor.java:29)
at org.freeeed.main.Map.map(Map.java:19)
at org.freeeed.main.Map.map(Map.java:11)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
Caused by: java.lang.NullPointerException
at
org.apache.tika.parser.mail.MailContentHandler.field(MailContentHandler.java:130)
at
org.apache.james.mime4j.parser.MimeStreamParser.parse(MimeStreamParser.java:121)
at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:60)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
... 22 more
java.lang.NullPointerException: value cannot be null
at org.apache.lucene.document.Field.<init>(Field.java:396)
at org.apache.lucene.document.Field.<init>(Field.java:371)
at org.apache.lucene.document.Field.<init>(Field.java:350)
at org.freeeed.main.FileProcessor.createDocument(FileProcessor.java:115)
at org.freeeed.main.FileProcessor.isResponsive(FileProcessor.java:94)
at org.freeeed.main.FileProcessor.processFileEntry(FileProcessor.java:54)
at org.freeeed.main.SingleFileProcessor.process(SingleFileProcessor.java:15)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:30)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.extractEmails(PstProcessor.java:47)
at org.freeeed.main.PstProcessor.process(PstProcessor.java:24)
at
org.freeeed.main.ZipFileProcessor.processZipEntry(ZipFileProcessor.java:41)
at org.freeeed.main.ZipFileProcessor.process(ZipFileProcessor.java:29)
at org.freeeed.main.Map.map(Map.java:19)
at org.freeeed.main.Map.map(Map.java:11)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.parser.mail.RFC822Parser@baba1a
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135)
at org.apache.tika.Tika.parseToString(Tika.java:357)
at org.freeeed.main.FreeEedParser.parse(FreeEedParser.java:19)
at org.freeeed.main.FileProcessor.extractMetadata(FileProcessor.java:136)
at org.freeeed.main.FileProcessor.processFileEntry(FileProcessor.java:53)
at org.freeeed.main.SingleFileProcessor.process(SingleFileProcessor.java:15)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:30)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35)
at org.freeeed.main.PstProcessor.extractEmails(PstProcessor.java:47)
at org.freeeed.main.PstProcessor.process(PstProcessor.java:24)
at
org.freeeed.main.ZipFileProcessor.processZipEntry(ZipFileProcessor.java:41)
at org.freeeed.main.ZipFileProcessor.process(ZipFileProcessor.java:29)
at org.freeeed.main.Map.map(Map.java:19)
at org.freeeed.main.Map.map(Map.java:11)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
Caused by: java.lang.NullPointerException
at
org.apache.tika.parser.mail.MailContentHandler.field(MailContentHandler.java:130)
at
org.apache.james.mime4j.parser.MimeStreamParser.parse(MimeStreamParser.java:121)
at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:60)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197)
... 22 more


What should I do?
Thank you very much!

Sincerely,
Mark

Reply via email to