FYI It seems like 08xx is reserved for RTL scripts. http://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedBidiClass.txt
# The unassigned code points that default to R are in the ranges: # [\u0590-\u05FF *\u07C0-\u089F* \uFB1D-\uFB4F \U00010800-\U00010FFF \U0001E800-\U0001EDFF \U0001EF00-\U0001EFFF] http://unicode.org/roadmaps/bmp/ 08 Samaritan <http://www.unicode.org/charts/PDF/U0800.pdf> Mandaic <http://www.unicode.org/charts/PDF/U0840.pdf> (SyrSup) <http://www.unicode.org/L2/L2015/15156-syriac-malayalam.pdf> ??? ??? ??? Arabic Extended-A <http://www.unicode.org/charts/PDF/U08A0.pdf> http://unicode.org/roadmaps/smp/ 00010800-00010FFF Alphabetic and syllabic RTL scripts 0001E800-0001EFFF RTL scripts - Color highlighting is used to indicate blocks and unassigned ranges which default to right-to-left character behavior. markus

