[ https://issues.apache.org/jira/browse/TIKA-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14522905#comment-14522905 ]
Chris A. Mattmann commented on TIKA-443: ---------------------------------------- I fixed it by moving the extractContent function after the metadata extraction happens first. {noformat} [mattmann-0420740:~/tmp/tika1.9] mattmann% java -jar tika-app/target/tika-app-1.9-SNAPSHOT.jar -m tika-parsers/src/test/resources/test-documents/sampleFile.iso19139 May 01, 2015 12:57:16 AM org.apache.sis.internal.jaxb.gml.TM_Primitive setTimePeriod WARNING: This operation requires the “sis-temporal” module. AccessContraints : OTHER_RESTRICTIONS CharacterSet: UTF-8 CitationDate : CREATION-->Mon Dec 16 00:00:00 PST 2013 CitationDate : modified-->Wed Mar 11 00:00:00 PDT 2015 CitedResponsiblePartyEMail : holli...@gvsu.edu CitedResponsiblePartyName : Robert Hollister CitedResponsiblePartyName : Robert Hollister CitedResponsiblePartyRole : Role[POINT_OF_CONTACT] CitedResponsiblePartyRole : Role[AUTHOR] ContactPartyName-: UCAR/NCAR - CISL - ACADIS ContactRole: RESOURCE_PROVIDER Content-Length: 19370 Content-Type: text/iso19139+xml DateInfo : CREATION Mon Dec 16 05:26:08 PST 2013 DistributionFormatSpecificationAlternativeTitle : Other ASCII Distributor Contact : RESOURCE_PROVIDER Distributor Organization Name : UCAR/NCAR - CISL - ACADIS GeographicIdentifierAuthorityAlternativeTitle : Locations GeographicIdentifierAuthorityDate : REVISION Thu Aug 28 00:00:00 PDT 2014 GeographicIdentifierAuthorityTitle : NASA/GCMD Earth Science Keywords GeographicIdentifierCode : UNITED STATES OF AMERICA > ALASKA IdentificationInfoAbstract : These files contain data representing the periodic plant measures of species within each plot in a text tab delimited format. The data presented are seasonal growth of graminoids (length of leaf and length of inflorescence) and seasonal flowering of all species (number of inflorescences in flower within a plot), collected weekly during the summers of 2012-20XX for a subset of 30 grid plots at two sites (Barrow ARCSS grid and Atqasuk ARCSS grid). IdentificationInfoCitationTitle : Barrow Atqasuk ARCSS Plant IdentificationInfoLanguage-->: English IdentificationInfoStatus : ON_GOING IdentificationInfoTopicCategory-->: BIOTA Keywords 2: EARTH SCIENCE > BIOSPHERE > TERRESTRIAL ECOSYSTEMS > ALPINE/TUNDRA Keywords 3: FIELD SURVEY Keywords 4: POINT Keywords 5: LESS THAN 1 METER Keywords 6: DAILY TO WEEKLY KeywordsType 2: THEME KeywordsType 3: THEME KeywordsType 4: THEME KeywordsType 5: THEME KeywordsType 6: THEME MetaDataIdentifierCode: urn:x-wmo:md:org.aoncadis.www::4c1a919d-6690-11e3-9147-00c0f03d5b7c MetaDataResourceScope : DATASET MetaDataStandardEdition : ISO 19115:2003(E) MetaDataStandardTitle : ISO 19115 Geographic information - Metadata OtherConstraints : Access Constraints: No Access Constraints. Use Constraints: No Use Constraints. ParentMetaDataTitle: urn:x-wmo:md:org.aoncadis.www::d2e4e808-6830-11df-abb3-00c0f03d5b7c ResourceFormatSpecificationAlternativeTitle : Other ASCII ThesaurusNameAlternativeTitle 2: [Science and Services Keywords] ThesaurusNameAlternativeTitle 3: [Platforms] ThesaurusNameAlternativeTitle 4: [Spatial Data Type] ThesaurusNameAlternativeTitle 5: [Horizontal Data Resolution] ThesaurusNameAlternativeTitle 6: [Temporal Data Resolution] ThesaurusNameDate : REVISION-->Wed May 21 00:00:00 PDT 2014 ThesaurusNameDate : REVISION-->Tue Oct 07 00:00:00 PDT 2014 ThesaurusNameDate : REVISION-->Tue Oct 07 00:00:00 PDT 2014 ThesaurusNameDate : REVISION-->Wed May 21 00:00:00 PDT 2014 ThesaurusNameDate : REVISION-->Wed May 21 00:00:00 PDT 2014 ThesaurusNameTitle 2: NASA/GCMD Earth Science Keywords ThesaurusNameTitle 3: ACADIS Keywords ThesaurusNameTitle 4: ACADIS Keywords ThesaurusNameTitle 5: NASA/GCMD Earth Science Keywords ThesaurusNameTitle 6: NASA/GCMD Earth Science Keywords TransferOptionsOnlineDescription : Metadata Link TransferOptionsOnlineFunction : DOWNLOAD TransferOptionsOnlineLinkage : https://www.aoncadis.org/dataset/id/4c1a919d-6690-11e3-9147-00c0f03d5b7c.html TransferOptionsOnlineName : Barrow Atqasuk ARCSS Plant TransferOptionsOnlineProfile : browser TransferOptionsOnlineProtocol : https UserConstraints : OTHER_RESTRICTIONS X-Parsed-By: org.apache.tika.parser.DefaultParser X-Parsed-By: org.apache.tika.parser.geoinfo.GeographicInformationParser resourceName: sampleFile.iso19139 [mattmann-0420740:~/tmp/tika1.9] mattmann% {noformat} Works great! Committing. > Geographic Information Parser > ----------------------------- > > Key: TIKA-443 > URL: https://issues.apache.org/jira/browse/TIKA-443 > Project: Tika > Issue Type: New Feature > Components: parser > Reporter: Arturo Beltran > Assignee: Chris A. Mattmann > Labels: new-parser > Attachments: getFDOMetadata.xml > > > I'm working in the automatic description of geospatial resources, and I think > that might be interesting to incorporate new parser/s to Tika in order to > manage and describe some geo-formats. These geo-formats include files, > services and databases. > If anyone is interested in this issue or want to collaborate do not hesitate > to contact me. Any help is welcome. -- This message was sent by Atlassian JIRA (v6.3.4#6332)