|
Hello!
I have got an XML which I don�t know how to create
a valid schema for.
<info>
Om
<emph
type="italic">vacker</emph>
syftar
p�
<emph type="italic">det h�ga landet</emph>
�r det kongruensfel
</info>
Note the String objects between the
emph-tags.
I thought of something like this:
<xs:element name="info" minOccurs="1" maxOccurs="1"
>
<xs:complexType mixed="true" > <xs:sequence> <xs:element name="emph" minOccurs="0" maxOccurs="unbounded" > <xs:complexType mixed="true" > <xs:attribute name="type" type="xs:string" use="required" /> </xs:complexType> </xs:element> </xs:sequence> </xs:complexType> </xs:element> But as you can see, I will loose the order between
the String-objects and the Emph-objects. The method in Info will be getContent()
for the whole String "Om syftar p� �r det kongruensfel." and a sequence of two
Emph-objects.
Is it possible to parse this and keep the Emph's
positions between the String objects? The main point is that I want to keep the
String-objects in the Emph-objects, so the result would be "Om vacker syftar p�
det h�ga landet �r det kongruensfel".
Please note that this is a small part of a
bigger document I am trying to parse. The cheapest but somewhat danger solution
would be to filter the incoming stream from the emph tag ( and the
counterpart </emph>). Danger because this annoying emph-tag may occur at
other places in the document.
Best Regards
/Stefan H Westlund
PS. Don't blame me for writing this weird document
style, it was done before my time :-) DS.
|
- Re: [castor-dev] Schema for a weird document Stefan H Westlund
- Re: [castor-dev] Schema for a weird document Arnaud Blandin
- Re: [castor-dev] Schema for a weird document Cyril Godefroy
