Mike Beckerle created DAFFODIL-2883:
---------------------------------------
Summary: Pre-existing PUA characters in data cause SDE
Key: DAFFODIL-2883
URL: https://issues.apache.org/jira/browse/DAFFODIL-2883
Project: Daffodil
Issue Type: Bug
Components: Back End
Affects Versions: 3.6.0
Reporter: Mike Beckerle
If data contains Unicode PUA characters, these cause the Infoset Outputter to
convert the RemapPUACharDetected into an SDE.
We can't get away with this. We need to tolerate PUA characters in data and
have them either cause a ParseError, or just tolerate them. (Or have a switch
to choose between those modes)
This was discovered by fuzz testing.
If the existence of PUA characters means the data is gibberish, then perhaps
the parser is speculating down a path that should be backtracked. We need a
parse error in that case.
If the existence of PUA characters is acceptable, then we need no error at all
from them.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)