[ 
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14212258#comment-14212258
 ] 

Tim Allison commented on TIKA-1445:
-----------------------------------

This is what we're currently doing in CompositeParser#getParsers(ParseContext 
context) 

{noformat}
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with 
o.a.t.p.hdf.HDFParser@488a5770 for application/x-hdf
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with 
o.a.t.p.image.ImageParser@72729f44 for image/x-ms-bmp
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with 
o.a.t.p.image.ImageParser@72729f44 for image/png
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with 
o.a.t.p.image.ImageParser@72729f44 for image/gif
clobbering: o.a.t.p.image.ImageParser@72729f44 with 
o.a.t.p.image.ImageParser@72729f44 for image/x-ms-bmp
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with 
o.a.t.p.jpeg.JpegParser@4336640f for image/jpeg
clobbering: o.a.t.p.microsoft.TNEFParser@27e33742 with 
o.a.t.p.microsoft.TNEFParser@27e33742 for application/vnd.ms-tnef
clobbering: o.a.t.p.gdal.GDALParser@677556a0 with 
o.a.t.p.netcdf.NetCDFParser@3640e283 for application/x-netcdf
clobbering: o.a.t.p.image.ImageParser@72729f44 with 
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/x-ms-bmp
clobbering: o.a.t.p.jpeg.JpegParser@4336640f with 
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/jpeg
clobbering: o.a.t.p.image.ImageParser@72729f44 with 
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/png
clobbering: o.a.t.p.image.TiffParser@570bd519 with 
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/tiff
clobbering: o.a.t.p.image.ImageParser@72729f44 with 
o.a.t.p.ocr.TesseractOCRParser@5dd72248 for image/gif
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.image-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.spreadsheet-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.chart-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.formula
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.text-web
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.text
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.formula-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.spreadsheet
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.text-master
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.text-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.graphics
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.graphics-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.presentation
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.image
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.presentation-template
clobbering: o.a.t.p.odf.OpenDocumentParser@49d388f4 with 
o.a.t.p.odf.OpenDocumentParser@49d388f4 for 
application/vnd.oasis.opendocument.chart
clobbering: o.a.t.p.pkg.CompressorParser@5ec47109 with 
o.a.t.p.pkg.CompressorParser@5ec47109 for application/gzip

{noformat}

> Figure out how to add Image metadata extraction to Tesseract parser
> -------------------------------------------------------------------
>
>                 Key: TIKA-1445
>                 URL: https://issues.apache.org/jira/browse/TIKA-1445
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 1.8
>
>         Attachments: TIKA-1445.Mattmann.101214.patch.txt, 
> TIKA-1445.Palsulich.102614.patch, TIKA-1445_tallison_20141027.patch.txt, 
> TIKA-1445_tallison_v2_20141027.patch, TIKA-1445_tallison_v3_20141027.patch
>
>
> Now that Tesseract is the default image parser in Tika for many image types, 
> consider how to add back in the metadata extraction capabilities by the other 
> Image parsers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to