I believe this was corrected recently in Tika 1.9 https://issues.apache.org/jira/browse/TIKA-2761 <https://issues.apache.org/jira/browse/TIKA-2761>
Are you using the latest version of Tika? What tags in particular have you noticed are missing? Nick > On Nov 27, 2018, at 2:57 PM, Feng Ye <[email protected]> wrote: > > Hi Experts, > I found that XML tags are removed when using Tika to process the xml files. > As tags contain useful metadata info (such as author etc), is there an option > to keep the tags? Your timely reply will be appreciated! > > Thanks! > feng
