Martin Duerst wrote:

Good to know. The following are potential problems:
- Can the properties be set for each mode individually?
- Can the properties change based on context? As a simple
  example, attributes in XML can be quoted with single quotes
  or with double quotes. In the attribute value, the 'other'
  quote can then be used literally. So if we have something
  like
  <element attr1="What's up?" attr2='He cried "Help!" and ran away.' />
                 1    2     1       1         2     2              1
  then for the places marked 1, which are XML syntax, we want to
  change the properties (e.g. to LTR), but for the cases marked
  with 2, which are content, we do not want to change the properties.
  Would such a thing be possible?
- In our current approach, we not only change the properties of some
  characters, but also introduce additional embedding (or occasionally
  override) levels. Is this possible in your implementation?

Have you taken a look at the source for nxml mode? You might be able to hack it up do do something like this. As I understand it, in Emacs you can set whatever properties you like on each character in a buffer. See node 32.19, "Text Properties", of the Elisp manual.

Now the interesting possibility presents itself, which is to add Unicode-related "special" (ie. built-in) text properties. In particular, directional class. Then allow the user to override the Unicode-defined class for any particular character or set of chars in a buffer. So in your example above, the nxml parser identifies weakly directional chars in the XML syntax, and switches their "unicode-directionality" property to strong LTR. Voila! The bidi reordering obeys the user-set directionality and does the right thing.

-g


_______________________________________________
emacs-bidi mailing list
[email protected]
http://lists.gnu.org/mailman/listinfo/emacs-bidi

Reply via email to