On 7/7/25 18:05, Hans Hagen via ntg-context wrote: > [...] > Indeed. When there are useful and reliable mappings we can of course add > more. In that case there also should be intended structure mentioned > because we can have a richer structure in context than possible with > those mappings to pdf which then can result in an invalidated mapping. > > It's mostly about playing safe.
Of course, I totally agree with that. I only want to play to be able to investigate which options are the right ones. This is my way of looking into this, just by testing (or breaking) it first. > [...] > Did you test it with ua 1 replaced by ua 2 ? No, it works fine. Sorry, but with lines such as: description = { pua = "ua1", pdf = "Note" }, `pua = "ua1"` made me think that it was UA-1 only. > If needed we can pick up on it next month or so unless there are clean > and easy additions to make (we're currently not in that mode and it's > not the most interesting stuff to play with so we like to concentrate > it, and the more it rains the better). Tagging is boring and despairing for me (but subcolumnsets for parallel texts are in pause mode now). As far as I can recall, there are three things pending regarding spans: languages, alternative text and actual text (/StructElem is missing there). Language tagging should be able for some block elements (at least, delimitedtext and probably description). > Thanks for looking into it (I have no use cases and no possibilities to > test and Mikael depends on what means and demands the university puts on > it.) Many thanks for mentioning it. The latest tagging sample mentioned in the list (shorter version, https://perspectivia.net/servlets/MCRFileNodeServlet/pnet_derivate_00007290/nolte_spaces.pdf) contains also some tagging that is really hard to believe it might be right, such as: /P <</Lang (de-DE)/MCID 312 >>BDC 0.145 Tw 9 0 0 9 66.4399 67.6929 Tm [(e)13.6 (.)27.4 (g)16.2 (. Peter Frankopan)-7 (, )]TJ EMC /P <</Lang (de-DE)/MCID 313 >>BDC /TT1 1 Tf 10.022 0 Td [(The Earth Transformed: An )-0.6 (Untold History)]TJ EMC At least, there are the following issues here: 1. Too many /P elements for what should be /Span elements. 2. “The Earth Transformed: An Untold Story” is English, not German. 3. /MCID are orphan (they lack their corresponding /StructElem). #2 may be a human honest mistake, but #1 and #3? My impression is that many (when not most) people may be struggling with proper tagging. Many thanks for your reply and your help, Pablo ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl webpage : https://www.pragma-ade.nl / https://context.aanhet.net (mirror) archive : https://github.com/contextgarden/context wiki : https://wiki.contextgarden.net ___________________________________________________________________________________