On Tue, 25 Sep 2001 19:23:36 +0200, in perl.perl5.porters you wrote:

> [Disclaimer: I haven't read perlpodspec yet, so some of my questions may
> be answered there. Feel free to point me there.]

OK, here are my comments after having read perlpodspec.

> the concept of "paragraph" doesn't appear to get defined explicitly.

Ah, these appear to be better defined in perlpodspec.

> And you don't seem to say explicitly whether the format name on the =end
> has to match the format name used on the =begin

Perlpodspec mentions that this closes "the end of the region opened by
the matching "=begin formatname" region.".

> (it could conceivably just be documentation, so you could use =begin html
> / =end stuff, or perhaps even =begin html / =end (without format name on
> the =end, sort of like #ifdef foo matching #endif).

So I suppose the format name has to be specified on =end, and it has to
match a =begin command.

> Also, whether =begin/=end blocks may be nested, e.g. for specialisations
> of format, and if so, what constraints are placed be on the =end
> commands.

But perlpodspec does not appear to say anything about nested formats. To
contrive an example,

    =begin html

    <p>This is (generic) HTML</p>

    =begin html4

    <p>This is HTML that needs a parser for HTML 4.0 or higher</p>

    =end html4

    <p>And more generic HTML</p>

    =end html

Or, more strangely,

    =begin html

    <p>This is (generic) HTML</p>

    =begin html4

    <p>This is HTML that needs a parser for HTML 4.0 or higher</p>

    =end html

    <p>This is, er, what?</p>

    =end html4

(overlapping regions)

Hm, near the end there's an example using nested regions. But it doesn't
explicitly prohibit overlapping regions.

> Also, what's the format of an I<formatname>? /\S+/? /\w+/? /.*/?
> /[:\w]+/?

I see that perlpodspec has some advice (viz., m/^:?[-a-zA-Z0-9_]+$/s
[sic]).

> Any valid HTML entity name? Or only those that map to Latin-1? What
> about &trade;, for example? Or &euro;?

Ah, perlpodspec implies "all of them", since it explicitly mentions that
some may only support those mapping to Latin-1.

> As I understand it, the intent is that exactly one whitespace character
> is deleted while rendering such multiple-bracket delimiters

perlpodspec vaguely implies that all whitespace following the <{2,}
delimiter and preceding the >{2,} delimiter is snipped by specifying the
format as "two or more "<"'s, one or more whitespace characters, any
number of characters, one or more whitespace characters, and ending with
the first matching sequence of two or more ">"'s". But it doesn't say
so.

Cheers,
Philip

Reply via email to