Nick - I often use xmlstarlet to wade through unfamiliar XML files, like so:

first, the structure

$ xml el -u b008s14v.xml

iplayerMedia
iplayerMedia/concept
iplayerMedia/concept/longSynopsis
iplayerMedia/concept/masterbrand
iplayerMedia/concept/masterbrand/ident
iplayerMedia/concept/masterbrand/ident/height
iplayerMedia/concept/masterbrand/ident/identifier
iplayerMedia/concept/masterbrand/ident/mediaType
iplayerMedia/concept/masterbrand/ident/server
iplayerMedia/concept/masterbrand/ident/width
iplayerMedia/concept/masterbrand/thumbnail
iplayerMedia/concept/masterbrand/thumbnail/height
iplayerMedia/concept/masterbrand/thumbnail/mediaType
iplayerMedia/concept/masterbrand/thumbnail/url
iplayerMedia/concept/masterbrand/thumbnail/width
iplayerMedia/concept/mediumSynopsis
iplayerMedia/concept/pid
iplayerMedia/concept/relatedConcepts
iplayerMedia/concept/relatedConcepts/concept
iplayerMedia/concept/relatedConcepts/concept/mediumSynopsis
iplayerMedia/concept/relatedConcepts/concept/pid
iplayerMedia/concept/relatedConcepts/concept/subtitle
iplayerMedia/concept/relatedConcepts/concept/thumbnail
iplayerMedia/concept/relatedConcepts/concept/thumbnail/height
iplayerMedia/concept/relatedConcepts/concept/thumbnail/mediaType
iplayerMedia/concept/relatedConcepts/concept/thumbnail/url
iplayerMedia/concept/relatedConcepts/concept/thumbnail/width
iplayerMedia/concept/relatedConcepts/concept/title
iplayerMedia/concept/relatedConcepts/concept/url
iplayerMedia/concept/shortSynopsis
iplayerMedia/concept/subtitle
iplayerMedia/concept/thumbnail
iplayerMedia/concept/thumbnail/height
iplayerMedia/concept/thumbnail/mediaType
iplayerMedia/concept/thumbnail/url
iplayerMedia/concept/thumbnail/width
iplayerMedia/concept/title
iplayerMedia/concept/url
iplayerMedia/concept/versions
iplayerMedia/concept/versions/version
iplayerMedia/concept/versions/version/available
iplayerMedia/concept/versions/version/duration
iplayerMedia/concept/versions/version/guidance
iplayerMedia/concept/versions/version/guidance/text
iplayerMedia/concept/versions/version/name
iplayerMedia/concept/versions/version/pid


Then, the sel command to output:

$ xml sel -t -m "//iplayerMedia/concept"  -v "title" -n -v "subtitle"
-v "versions/version/guidance" b008s14v.xml

Torchwood: Series 2
Kiss Kiss, Bang Bang
            Contains some violence.

$

xmlstarlet runs on everything. What I love about it is that the same
operation can then be applied to a thousand files. It generates XML
par default but can output text as above, ready for formatting or
parsing by anything else.

Sean


On Jan 18, 2008 10:56 AM, Nick Ludlam <[EMAIL PROTECTED]> wrote:
> On 17 Jan 2008, at 19:08, Matthew Somerville wrote:
>
> > Andy wrote:
> >> Also I am almost certain there was an XML meta file stored somewhere
> >> that corresponded to each programme and now I can't find it. Any
> >> help?
> >
> > http://www.bbc.co.uk/iplayer/metafiles/episode/b008s14v.xml - you
> > just need the programme episode PIP.
> >
> > Related concepts for that episode include Doctor Who and...
> > Doctors. ;-)
>
> It doesn't seem to contain anything about being 16 or over though. The
> guidance section, after wading through all that whitespace, is:
>
> <version>
>    <name>AudioDescribed,Original</name>
>    <pid>b008s14n</pid>
>    <guidance>
>      <text>Contains some violence.</text>
>    </guidance>
>    <duration>00:50:00</duration>
>    <available>true</available>
> </version>
>
> This is something I'd love to see included in the broadcast EPG, as
> well.
> Currently, I don't think any UK broadcaster is using the Parental rating
> descriptor that's available in the EN 300 468 S.I. spec.
>
>
> --
> Nick Ludlam
> [EMAIL PROTECTED]
>
>
>
>
>
> -
> Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
> visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
> Unofficial list archive: 
> http://www.mail-archive.com/backstage@lists.bbc.co.uk/
>
-
Sent via the backstage.bbc.co.uk discussion group.  To unsubscribe, please 
visit http://backstage.bbc.co.uk/archives/2005/01/mailing_list.html.  
Unofficial list archive: http://www.mail-archive.com/backstage@lists.bbc.co.uk/

Reply via email to