Hi, I am getting an exception trying to parse an eml file with the following code:
String fileName = "215.eml" FileInputStream inputStream = new FileInputStream(fileName); Tika tika = new Tika(); String text = tika.parseToString(inputStream, metadata); but I know that it is parsed OK with Luke. Here is what I get though org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mail.RFC822Parser@b7415b at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135) at org.apache.tika.Tika.parseToString(Tika.java:357) at org.freeeed.main.FreeEedParser.parse(FreeEedParser.java:19) at org.freeeed.main.FileProcessor.extractMetadata(FileProcessor.java:136) at org.freeeed.main.FileProcessor.processFileEntry(FileProcessor.java:53) at org.freeeed.main.SingleFileProcessor.process(SingleFileProcessor.java:15) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:30) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.extractEmails(PstProcessor.java:47) at org.freeeed.main.PstProcessor.process(PstProcessor.java:24) at org.freeeed.main.ZipFileProcessor.processZipEntry(ZipFileProcessor.java:41) at org.freeeed.main.ZipFileProcessor.process(ZipFileProcessor.java:29) at org.freeeed.main.Map.map(Map.java:19) at org.freeeed.main.Map.map(Map.java:11) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) Caused by: java.lang.NullPointerException at org.apache.tika.parser.mail.MailContentHandler.field(MailContentHandler.java:130) at org.apache.james.mime4j.parser.MimeStreamParser.parse(MimeStreamParser.java:121) at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:60) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) ... 22 more java.lang.NullPointerException: value cannot be null at org.apache.lucene.document.Field.<init>(Field.java:396) at org.apache.lucene.document.Field.<init>(Field.java:371) at org.apache.lucene.document.Field.<init>(Field.java:350) at org.freeeed.main.FileProcessor.createDocument(FileProcessor.java:115) at org.freeeed.main.FileProcessor.isResponsive(FileProcessor.java:94) at org.freeeed.main.FileProcessor.processFileEntry(FileProcessor.java:54) at org.freeeed.main.SingleFileProcessor.process(SingleFileProcessor.java:15) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:30) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.extractEmails(PstProcessor.java:47) at org.freeeed.main.PstProcessor.process(PstProcessor.java:24) at org.freeeed.main.ZipFileProcessor.processZipEntry(ZipFileProcessor.java:41) at org.freeeed.main.ZipFileProcessor.process(ZipFileProcessor.java:29) at org.freeeed.main.Map.map(Map.java:19) at org.freeeed.main.Map.map(Map.java:11) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mail.RFC822Parser@baba1a at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:199) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:135) at org.apache.tika.Tika.parseToString(Tika.java:357) at org.freeeed.main.FreeEedParser.parse(FreeEedParser.java:19) at org.freeeed.main.FileProcessor.extractMetadata(FileProcessor.java:136) at org.freeeed.main.FileProcessor.processFileEntry(FileProcessor.java:53) at org.freeeed.main.SingleFileProcessor.process(SingleFileProcessor.java:15) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:30) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.collectEmails(PstProcessor.java:35) at org.freeeed.main.PstProcessor.extractEmails(PstProcessor.java:47) at org.freeeed.main.PstProcessor.process(PstProcessor.java:24) at org.freeeed.main.ZipFileProcessor.processZipEntry(ZipFileProcessor.java:41) at org.freeeed.main.ZipFileProcessor.process(ZipFileProcessor.java:29) at org.freeeed.main.Map.map(Map.java:19) at org.freeeed.main.Map.map(Map.java:11) at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212) Caused by: java.lang.NullPointerException at org.apache.tika.parser.mail.MailContentHandler.field(MailContentHandler.java:130) at org.apache.james.mime4j.parser.MimeStreamParser.parse(MimeStreamParser.java:121) at org.apache.tika.parser.mail.RFC822Parser.parse(RFC822Parser.java:60) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) ... 22 more What should I do? Thank you very much! Sincerely, Mark
