I am working with a collection of XML documents that I need to parse in
order to index them in Lucene. There is a DTD file that describes the
structure of these documents, but when I started working with it I found out
that it was an SGML DTD file. I found some advice on the internet about how
to convert an SGML DTD file so that it complies with XML DTD standards, and
I made changes to enable the DTD file to be used by an XML parser. There was
one !ENTITY declaration that I didn't know what to do with. It looks like
this:

 

<!ENTITY #DEFAULT SYSTEM >

 

Things seem to work ok if I just comment this out, but I wonder what it is
for, and whether there is a way to make it acceptable to an XML parser in
such a way that it does what the original author meant for it to do.
Apparently the XML parser doesn't like the # at the start of #DEFAULT.
Thanks.

Mike O'Leary

Reply via email to