On 7/7/25 18:05, Hans Hagen via ntg-context wrote:
> [...]
> Indeed. When there are useful and reliable mappings we can of course add 
> more. In that case there also should be intended structure mentioned 
> because we can have a richer structure in context than possible with 
> those mappings to pdf which then can result in an invalidated mapping.
> 
> It's mostly about playing safe.

Of course, I totally agree with that. I only want to play to be able to
investigate which options are the right ones.

This is my way of looking into this, just by testing (or breaking) it first.

> [...]
> Did you test it with ua 1 replaced by ua 2 ?

No, it works fine. Sorry, but with lines such as:

        description        = { pua = "ua1", pdf = "Note"       },

`pua = "ua1"` made me think that it was UA-1 only.

> If needed we can pick up on it next month or so unless there are clean 
> and easy additions to make (we're currently not in that mode and it's 
> not the most interesting stuff to play with so we like to concentrate 
> it, and the more it rains the better).

Tagging is boring and despairing for me (but subcolumnsets for parallel
texts are in pause mode now).

As far as I can recall, there are three things pending regarding spans:
languages, alternative text and actual text (/StructElem is missing there).

Language tagging should be able for some block elements (at least,
delimitedtext and probably description).

> Thanks for looking into it (I have no use cases and no possibilities to 
> test and Mikael depends on what means and demands the university puts on 
> it.)

Many thanks for mentioning it.

The latest tagging sample mentioned in the list (shorter version,
https://perspectivia.net/servlets/MCRFileNodeServlet/pnet_derivate_00007290/nolte_spaces.pdf)
contains also some tagging that is really hard to believe it might be
right, such as:

  /P <</Lang (de-DE)/MCID 312 >>BDC
  0.145 Tw 9 0 0 9 66.4399 67.6929 Tm
  [(e)13.6 (.)27.4 (g)16.2 (. Peter Frankopan)-7 (, )]TJ
  EMC
  /P <</Lang (de-DE)/MCID 313 >>BDC
  /TT1 1 Tf
  10.022 0 Td
  [(The Earth Transformed: An )-0.6 (Untold History)]TJ
  EMC

At least, there are the following issues here:

1. Too many /P elements for what should be /Span elements.
2. “The Earth Transformed: An Untold Story” is English, not German.
3. /MCID are orphan (they lack their corresponding /StructElem).

#2 may be a human honest mistake, but #1 and #3?

My impression is that many (when not most) people may be struggling with
proper tagging.

Many thanks for your reply and your help,

Pablo
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / 
https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

Reply via email to