[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14212258#comment-14212258
]
Tim Allison commented on TIKA-1445:
-----------------------------------
This is what we're currently doing in CompositeParser#getParsers(ParseContext
context)
{noformat}
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with
o.a.t.p.hdf.HDFParser@488a5770 for application/x-hdf
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with
o.a.t.p.image.ImageParser@72729f44 for image/x-ms-bmp
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with
o.a.t.p.image.ImageParser@72729f44 for image/png
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with
o.a.t.p.image.ImageParser@72729f44 for image/gif
clobbering: o.a.t.p.image.ImageParser@72729f44 with
o.a.t.p.image.ImageParser@72729f44 for image/x-ms-bmp
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with
o.a.t.p.jpeg.JpegParser@4336640f for image/jpeg
clobbering: o.a.t.p.microsoft.TNEFParser@27e33742 with
o.a.t.p.microsoft.TNEFParser@27e33742 for application/vnd.ms-tnef
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with
o.a.t.p.netcdf.NetCDFParser@3640e283 for application/x-netcdf
clobbering: o.a.t.p.image.ImageParser@72729f44 with
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/x-ms-bmp
clobbering: o.a.t.p.jpeg.JpegParser@4336640f with
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/jpeg
clobbering: o.a.t.p.image.ImageParser@72729f44 with
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/png
clobbering: o.a.t.p.image.TiffParser@570bd519 with
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/tiff
clobbering: o.a.t.p.image.ImageParser@72729f44 with
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/gif
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.image-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.spreadsheet-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.chart-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.formula
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.text-web
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.text
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.formula-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.spreadsheet
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.text-master
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.text-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.graphics
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.graphics-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.presentation
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.image
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.presentation-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with
o.a.t.p.odf.OpenDocumentParser@49d388f4 for
application/vnd.oasis.opendocument.chart
clobbering: o.a.t.p.pkg.CompressorParser@5ec47109 with
o.a.t.p.pkg.CompressorParser@5ec47109 for application/gzip
{noformat}
> Figure out how to add Image metadata extraction to Tesseract parser
> -------------------------------------------------------------------
>
> Key: TIKA-1445
> URL: https://issues.apache.org/jira/browse/TIKA-1445
> Project: Tika
> Issue Type: Bug
> Components: parser
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Fix For: 1.8
>
> Attachments: TIKA-1445.Mattmann.101214.patch.txt,
> TIKA-1445.Palsulich.102614.patch, TIKA-1445_tallison_20141027.patch.txt,
> TIKA-1445_tallison_v2_20141027.patch, TIKA-1445_tallison_v3_20141027.patch
>
>
> Now that Tesseract is the default image parser in Tika for many image types,
> consider how to add back in the metadata extraction capabilities by the other
> Image parsers.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)