subject:"\[Issue 18844\] std.utf.decode skips valid character on invalid multibyte sequence"

[Issue 18844] std.utf.decode skips valid character on invalid multibyte sequence

2023-04-26 Thread d-bugmail--- via Digitalmars-d-bugs

https://issues.dlang.org/show_bug.cgi?id=18844

Ate Eskola  changed:

   What|Removed |Added

 CC||ajiesk...@gmail.com

--- Comment #2 from Ate Eskola  ---
Additional observation: the documentation is misleading with both the present
and proposed behaviour. It says "If the code point is not well-formed, then a
UTFException is thrown and index remains unchanged.".

Well, we don't throw here since we're using replacement characters, so maybe
it's a hint that the part about index remaining unchanged does not apply
either. On the other hand, the documentation doesn't say what happens to the
index instead. At least for me, it gave the wrong impression that index
wouldn't be advanced.

--

[Issue 18844] std.utf.decode skips valid character on invalid multibyte sequence

2022-12-17 Thread d-bugmail--- via Digitalmars-d-bugs

https://issues.dlang.org/show_bug.cgi?id=18844

Iain Buclaw  changed:

   What|Removed |Added

   Priority|P1  |P4

--

[Issue 18844] std.utf.decode skips valid character on invalid multibyte sequence

2018-05-09 Thread d-bugmail--- via Digitalmars-d-bugs

https://issues.dlang.org/show_bug.cgi?id=18844

--- Comment #1 from FeepingCreature  ---
Repro:

string s = cast(string) [cast(ubyte) 'ä', 't'];
size_t i = 0;
auto ch = decode!(UseReplacementDchar.yes, string)(s, i);
writefln("ch = %s, i = %s, should be 1", ch, i);

ch = �, i = 2, should be 1.

--

[Issue 18844] std.utf.decode skips valid character on invalid multibyte sequence

[Issue 18844] std.utf.decode skips valid character on invalid multibyte sequence

[Issue 18844] std.utf.decode skips valid character on invalid multibyte sequence

3 matches

Site Navigation

Mail list logo

Footer information