[ 
https://issues.apache.org/jira/browse/TIKA-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14522905#comment-14522905
 ] 

Chris A. Mattmann commented on TIKA-443:
----------------------------------------

I fixed it by moving the extractContent function after the metadata extraction 
happens first.

{noformat}
[mattmann-0420740:~/tmp/tika1.9] mattmann% java -jar 
tika-app/target/tika-app-1.9-SNAPSHOT.jar -m 
tika-parsers/src/test/resources/test-documents/sampleFile.iso19139
May 01, 2015 12:57:16 AM org.apache.sis.internal.jaxb.gml.TM_Primitive 
setTimePeriod
WARNING: This operation requires the “sis-temporal” module.
AccessContraints : OTHER_RESTRICTIONS
CharacterSet: UTF-8
CitationDate : CREATION-->Mon Dec 16 00:00:00 PST 2013
CitationDate : modified-->Wed Mar 11 00:00:00 PDT 2015
CitedResponsiblePartyEMail : holli...@gvsu.edu
CitedResponsiblePartyName : Robert Hollister
CitedResponsiblePartyName : Robert Hollister
CitedResponsiblePartyRole : Role[POINT_OF_CONTACT]
CitedResponsiblePartyRole : Role[AUTHOR]
ContactPartyName-: UCAR/NCAR - CISL - ACADIS
ContactRole: RESOURCE_PROVIDER
Content-Length: 19370
Content-Type: text/iso19139+xml
DateInfo : CREATION Mon Dec 16 05:26:08 PST 2013
DistributionFormatSpecificationAlternativeTitle : Other ASCII
Distributor Contact : RESOURCE_PROVIDER
Distributor Organization Name : UCAR/NCAR - CISL - ACADIS
GeographicIdentifierAuthorityAlternativeTitle : Locations
GeographicIdentifierAuthorityDate : REVISION Thu Aug 28 00:00:00 PDT 2014
GeographicIdentifierAuthorityTitle : NASA/GCMD Earth Science Keywords
GeographicIdentifierCode : UNITED STATES OF AMERICA > ALASKA
IdentificationInfoAbstract : These files contain data representing the periodic 
plant measures of species within each plot in a text tab delimited format. The 
data presented are seasonal growth of graminoids (length of leaf and length of 
inflorescence) and seasonal flowering of all species (number of inflorescences 
in flower within a plot), collected weekly during the summers of 2012-20XX for 
a subset of 30 grid plots at two sites (Barrow ARCSS grid and Atqasuk ARCSS 
grid).
IdentificationInfoCitationTitle : Barrow Atqasuk ARCSS Plant
IdentificationInfoLanguage-->: English
IdentificationInfoStatus : ON_GOING
IdentificationInfoTopicCategory-->: BIOTA
Keywords 2: EARTH SCIENCE > BIOSPHERE > TERRESTRIAL ECOSYSTEMS > ALPINE/TUNDRA
Keywords 3: FIELD SURVEY
Keywords 4: POINT
Keywords 5: LESS THAN 1 METER
Keywords 6: DAILY TO WEEKLY
KeywordsType 2: THEME
KeywordsType 3: THEME
KeywordsType 4: THEME
KeywordsType 5: THEME
KeywordsType 6: THEME
MetaDataIdentifierCode: 
urn:x-wmo:md:org.aoncadis.www::4c1a919d-6690-11e3-9147-00c0f03d5b7c
MetaDataResourceScope : DATASET
MetaDataStandardEdition : ISO 19115:2003(E)
MetaDataStandardTitle : ISO 19115 Geographic information - Metadata
OtherConstraints : Access Constraints: No Access Constraints. Use Constraints: 
No Use Constraints.
ParentMetaDataTitle: 
urn:x-wmo:md:org.aoncadis.www::d2e4e808-6830-11df-abb3-00c0f03d5b7c
ResourceFormatSpecificationAlternativeTitle : Other ASCII
ThesaurusNameAlternativeTitle 2: [Science and Services Keywords]
ThesaurusNameAlternativeTitle 3: [Platforms]
ThesaurusNameAlternativeTitle 4: [Spatial Data Type]
ThesaurusNameAlternativeTitle 5: [Horizontal Data Resolution]
ThesaurusNameAlternativeTitle 6: [Temporal Data Resolution]
ThesaurusNameDate : REVISION-->Wed May 21 00:00:00 PDT 2014
ThesaurusNameDate : REVISION-->Tue Oct 07 00:00:00 PDT 2014
ThesaurusNameDate : REVISION-->Tue Oct 07 00:00:00 PDT 2014
ThesaurusNameDate : REVISION-->Wed May 21 00:00:00 PDT 2014
ThesaurusNameDate : REVISION-->Wed May 21 00:00:00 PDT 2014
ThesaurusNameTitle 2: NASA/GCMD Earth Science Keywords
ThesaurusNameTitle 3: ACADIS Keywords
ThesaurusNameTitle 4: ACADIS Keywords
ThesaurusNameTitle 5: NASA/GCMD Earth Science Keywords
ThesaurusNameTitle 6: NASA/GCMD Earth Science Keywords
TransferOptionsOnlineDescription : Metadata Link
TransferOptionsOnlineFunction : DOWNLOAD
TransferOptionsOnlineLinkage : 
https://www.aoncadis.org/dataset/id/4c1a919d-6690-11e3-9147-00c0f03d5b7c.html
TransferOptionsOnlineName : Barrow Atqasuk ARCSS Plant
TransferOptionsOnlineProfile : browser
TransferOptionsOnlineProtocol : https
UserConstraints : OTHER_RESTRICTIONS
X-Parsed-By: org.apache.tika.parser.DefaultParser
X-Parsed-By: org.apache.tika.parser.geoinfo.GeographicInformationParser
resourceName: sampleFile.iso19139
[mattmann-0420740:~/tmp/tika1.9] mattmann% 
{noformat}

Works great!
Committing. 

> Geographic Information Parser
> -----------------------------
>
>                 Key: TIKA-443
>                 URL: https://issues.apache.org/jira/browse/TIKA-443
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Arturo Beltran
>            Assignee: Chris A. Mattmann
>              Labels: new-parser
>         Attachments: getFDOMetadata.xml
>
>
> I'm working in the automatic description of geospatial resources, and I think 
> that might be interesting to incorporate new parser/s to Tika in order to 
> manage and describe some geo-formats. These geo-formats include files, 
> services and databases.
> If anyone is interested in this issue or want to collaborate do not hesitate 
> to contact me. Any help is welcome.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to