Hi,

when is the Dublin Core XML parser used to parse XML files?
Is there a configuration required to enable the DcXMLParser?

There is a difference between 1.27 and 2.1.0:

$> java -jar tika-app-1.27.jar -J \
      https://news.haltonhills.halinet.on.ca/dc.xml \
   | jq '.[0]."dc:title"'
"Deaths"
$> java -jar tika-app-2.1.0.jar ...
null

$> java -jar tika-app-1.27.jar -J \
      https://news.haltonhills.halinet.on.ca/dc.xml \
   | jq '.[0]."X-Parsed-By"'
[
  "org.apache.tika.parser.DefaultParser",
  "org.apache.tika.parser.xml.DcXMLParser"
]
$> java -jar tika-app-2.1.0.jar -J \
      https://news.haltonhills.halinet.on.ca/dc.xml \
   | jq '.[0]."X-TIKA:Parsed-By"'
[
  "org.apache.tika.parser.DefaultParser",
  "org.apache.tika.parser.xml.XMLParser"
]


Thanks,
Sebastian

Reply via email to