[lxml] Re: python lxml.objectify gives no attribute access to gco:CharacterString node

Stefan Behnel Thu, 03 Mar 2022 00:52:14 -0800

Dr. Volker Jaenisch schrieb am 01.03.22 um 16:06:

To find the desired sibling the code loops over all childern and matches(parentNamespace, propertyName) against them.
The correct operation of _findFollowingSibling should IMHO be:
Make a lookup on all children (with the python property name only). If onematch is found then return this match. If none or more than one match isfound then no answer is possible.

I see a major drawback with this behaviour, and that is non-localdependencies. If you have this XML:


    <a:root xmlns:a="A" xmlns:b="B">
        <b:ch1/>
        <b:ch2/>
    </a:root>

then "root.ch1" would give you the first child. Great, so you use that inyour code. Now, someone decides to send you an input document that lookslike this:


    <a:root xmlns:a="A" xmlns:b="B" xmlns:c="C">
        <b:ch1/>
        <b:ch2/>
        <c:ch1/>
    </a:root>

And your code will suddenly fail to find "root.ch1". Depending on what yourcode does and how it does it, it may fail with an exception, or it may failsilently to find the desired data and just keep working without it.

Note that the content of the XML file that your code is designed to processdid not change at all. It's just that some entirely unrelated content wasadded, in a completely different and unrelated namespace. And it was justexternally added to the input data, or maybe just some tiny portion it,without telling you or your code about it. Especially in places withoptional content, where different namespaces are already a little morecommon than elsewhere, this is fairly likely to go unnoticed.

I find this kind of behaviour dangerous enough to restrict the "magic" inthe API to what is easy to understand and predict.


Stefan
_______________________________________________
lxml - The Python XML Toolkit mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/lxml.python.org/
Member address: [email protected]

[lxml] Re: python lxml.objectify gives no attribute access to gco:CharacterString node

Reply via email to