Hi,

I've found what seems to be a minor bug in fossil (details below), and
I'm not sure what the procedure is for reporting it. Can someone
enlighten me?

The bug:
In lookslike.c, invalid_utf8() returns 'invalid' for the input 0xE0,
0xB8, 0x94, which is the Thai character 'do dek' (U+0E14). This can be
easily reproduced by trying to commit a file that contains those three
bytes and nothing else - you will get the "this file contains invalid
UTF-8..." warning.

I replaced the code in invalid_utf8() with this UTF-8 validator:
https://www.cl.cam.ac.uk/~mgk25/ucs/utf8_check.c
and that worked OK, so I'm pretty sure invalid_utf8() is incorrect.

Thanks,
Ross
_______________________________________________
fossil-users mailing list
fossil-users@lists.fossil-scm.org
http://lists.fossil-scm.org:8080/cgi-bin/mailman/listinfo/fossil-users

Reply via email to