On Sunday, 28 June 2015 03:46:56 UTC-4, Stefan Behnel  wrote:
> Denis McMahon schrieb am 26.06.2015 um 09:44:
> > xml data is an unordered list, and are trying to assign an order to it.
> > 
> > If the xml data was ordered, either each tag would be different, or each 
> > tag would have an attribute specifying a sequence number.
> 
> XML is not unordered. The document order is well defined and entirely
> obvious from the data. Whether this order is relevant and has a meaning or
> not is, however, not part of XML itself but is left to the semantics of the
> specific document format at hand. Meaning, XML document formats can choose
> to ignore that order and define it as irrelevant. That doesn't mean it's
> not there for a given document, but it may mean that a re-transmission of
> the same document would be allowed to use a different order without
> changing the information.
> 
> This property applies to pretty much all structured data formats and not
> just XML, by the way, also to CSV and other tabular formats.
> 
> Stefan

@Stefan, Ned, and Robert: You have all hit the nail on the head. I do not have 
an authentic and veritable XSD (or XML data structures for that matter). So far 
it is all deprecated and/or anonymized data from the client. Therefore, I can 
only hypothesize what the end output will be for the database architecture that 
I am working with. 

>From what I understand, therefore, based on your constructive insight, is that 
>the 14 occurrences of the same tag (regardless of placement relative to 
>neighbouring children and the root are all being defined as the same key. 
>However, their individual values are also being treated as the same (from the 
>algorithm that I wrote in my Stack Overflow post (please see above)). The 
>constraint is that I am anticipating terabytes of data every day from the 
>client in the coming months. The algorithm should be able to parse, and write 
>out to CSV in the most efficient manner. That is my design constraint. I 
>welcome your feedback on this. 

Here is the post, again, for your convenience: 

http://stackoverflow.com/questions/31058100/enumerate-column-headers-in-csv-that-belong-to-the-same-tag-key-in-python
 


-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to