[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14212258#comment-14212258 ]
Tim Allison commented on TIKA-1445: ----------------------------------- This is what we're currently doing in CompositeParser#getParsers(ParseContext context) {noformat} clobbering: o.a.t.p.gdal.GDALParser@677556a0 with o.a.t.p.hdf.HDFParser@488a5770 for application/x-hdf clobbering: o.a.t.p.gdal.GDALParser@677556a0 with o.a.t.p.image.ImageParser@72729f44 for image/x-ms-bmp clobbering: o.a.t.p.gdal.GDALParser@677556a0 with o.a.t.p.image.ImageParser@72729f44 for image/png clobbering: o.a.t.p.gdal.GDALParser@677556a0 with o.a.t.p.image.ImageParser@72729f44 for image/gif clobbering: o.a.t.p.image.ImageParser@72729f44 with o.a.t.p.image.ImageParser@72729f44 for image/x-ms-bmp clobbering: o.a.t.p.gdal.GDALParser@677556a0 with o.a.t.p.jpeg.JpegParser@4336640f for image/jpeg clobbering: o.a.t.p.microsoft.TNEFParser@27e33742 with o.a.t.p.microsoft.TNEFParser@27e33742 for application/vnd.ms-tnef clobbering: o.a.t.p.gdal.GDALParser@677556a0 with o.a.t.p.netcdf.NetCDFParser@3640e283 for application/x-netcdf clobbering: o.a.t.p.image.ImageParser@72729f44 with o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/x-ms-bmp clobbering: o.a.t.p.jpeg.JpegParser@4336640f with o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/jpeg clobbering: o.a.t.p.image.ImageParser@72729f44 with o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/png clobbering: o.a.t.p.image.TiffParser@570bd519 with o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/tiff clobbering: o.a.t.p.image.ImageParser@72729f44 with o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/gif clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.image-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.spreadsheet-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.chart-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.formula clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.text-web clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.text clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.formula-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.spreadsheet clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.text-master clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.text-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.graphics clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.graphics-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.presentation clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.image clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.presentation-template clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with o.a.t.p.odf.OpenDocumentParser@49d388f4 for application/vnd.oasis.opendocument.chart clobbering: o.a.t.p.pkg.CompressorParser@5ec47109 with o.a.t.p.pkg.CompressorParser@5ec47109 for application/gzip {noformat} > Figure out how to add Image metadata extraction to Tesseract parser > ------------------------------------------------------------------- > > Key: TIKA-1445 > URL: https://issues.apache.org/jira/browse/TIKA-1445 > Project: Tika > Issue Type: Bug > Components: parser > Reporter: Chris A. Mattmann > Assignee: Chris A. Mattmann > Fix For: 1.8 > > Attachments: TIKA-1445.Mattmann.101214.patch.txt, > TIKA-1445.Palsulich.102614.patch, TIKA-1445_tallison_20141027.patch.txt, > TIKA-1445_tallison_v2_20141027.patch, TIKA-1445_tallison_v3_20141027.patch > > > Now that Tesseract is the default image parser in Tika for many image types, > consider how to add back in the metadata extraction capabilities by the other > Image parsers. -- This message was sent by Atlassian JIRA (v6.3.4#6332)