Hi, Danny, Yes, the URI lexicon is enabled. That has told me how many XML files I have. Other than limiting processing to the XML files, I'm not sure how this helps. Maybe I'm under-utilizing this? I do not have any applicable range indexes in place today.
Regarding DTDs and namespaces, you can have XML files conforming to DTDs where namespaces are not defined. You only get the doctype declaration before the root node; i.e.: <!DOCTYPE mathml PUBLIC "-//W3C//DTD MathML 2.0//EN" "mathml2/mathml2.dtd"> <mathml [no namespace atts]>...</> In the above case, I'd like to retrieve public ID of "-//W3C//DTD MathML 2.0//EN" and system ID of "mathml2/mathml2.dtd", or simply the entire doctype declaration. Thank you. -Brent ________________________________ From: [email protected] [mailto:[email protected]] On Behalf Of Danny Sokolsky Sent: Thursday, December 18, 2008 1:12 PM To: General Mark Logic Developer Discussion Subject: RE: [MarkLogic Dev General] How best to identify all referenced contentmodels? Hi Brent, Do you have the URI lexicon enabled for your database? It might help. Also, do you have any range indexes on the element or attributes in question? Then you can do range queries or lexicon lookups on those values. I am not sure what you mean by your concern about DTDs and namespaces. Perhaps if you gave a sample XML snippet or two showing what the "public and system IDs" look like, that might help with a more specific answer. -Danny From: [email protected] [mailto:[email protected]] On Behalf Of Hartwig, Brent (CL Tech Sv) Sent: Thursday, December 18, 2008 9:48 AM To: General Mark Logic Developer Discussion Subject: [MarkLogic Dev General] How best to identify all referenced contentmodels? Hello and Happy Holidays, I'm trying to identify the public and system IDs of all content models our XML files reference. The XML files are in ML 3.2 and may conform to a DTD or XML Schema. Given the number of XML files, I would prefer not take the I/O hit for each file. I am also interested in the reverse: URIs of XML files not defining the content model. I see many namespace-related functions but am concerned this will not help the majority of files conforming to DTDs. Any ideas? Thank you in advance. -Brent
_______________________________________________ General mailing list [email protected] http://xqzone.com/mailman/listinfo/general
