[
https://issues.apache.org/jira/browse/DAFFODIL-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576725#comment-17576725
]
Dave Thompson commented on DAFFODIL-1959:
-----------------------------------------
Below is the results from running the nightly data using the -I exi infoset
option.
|*EXI Parse/Unparse Results*|
|*Data Format*|*Source File*|*Issue/Comments*|
|Non-Public-Schemas| |\” are converted to " in unparsed file|
|< and > are converted to < and > in unparsed file|
|& are converted to & in unparsed file|
|*Not an issue with Xml parse/unparse*|
| |[error] Unparse Error: Value '' is not a valid xs:int
Schema context: element reference ucops:SelectID Location line 639 column 12 in
file:/home/dfdl/dfdl-dataformats/data-formats/uscg_ucop_lh/src/main/resources/
com/tresys/ uscg_ucop_lh/xsd/sets.dfdl.xsd
*Not an issue with Xml parse/unparse*|
| |- (dashes) are removed
*Not an issue with Xml parse/unparse*|
|bmp|MARBLES.BMP
21m-file.bmp|*No issue, same as source file.*|
|cef|2012-04-25-16-43-37.done.cef.txt|\” are converted to " in unparsed
file|
|< and > are converted to < and > in unparsed file|
|*Not an issue with Xml parse/unparse*|
|disv6|DIS-v6-EntityStatePDU-Valid.bin|*No issue, same as source file.*|
|DIS-v6-FirePDU-Valid.bin|*No issue, same as source file.*|
|gif|950k-gif-file.gif|*No issue, same as source file.*|
|gmtif|kirkuk_7d.4607|Hex FF 9F FE changed to hex FC 00 00
*Same as xml parse/unparsed.*|
|HL7|ORU.txt|[error] Unparse Error: Data too long by 32 bits. Unable to
truncate.
Schema context: hl7:SubComponentSeparator Location line 5039 column 22 in
file:/home/dfdl/dfdl-dataformats/data-formats/HL7/src/main/resources/segments.xsd
Data location was preceding byte 12
*Not an issue with Xml parse/unparsed*|
|Xml parse/unparsed removed an &|
|iCal|DTCalendar-2.ics|” within html are converted to " in unparsed file
*Not an issue with Xml parse/unparse*|
|< and > within html are converted to < and > in unparsed file
*Not an issue with Xml parse/unparse*|
|“ (quotes) removed from entries
Example:
DTEND;TZID="Eastern Standard Time":20171006T150000
Changed to:
DTEND;TZID=Eastern Standard Time:20171006T150000
*Same as with Xml parse/unparse*|
|Carriage returns removed and replaced with ; (semi-colon)
Example:
MS Exchange Server ve
rsion
Changed to:
MS Exchange Server ve;rsion
*Same as with Xml parse/unparse*|
|, (comma) change to ; (semi colons) in entries
Example:
X-MS-OLK-WKHRDAYS:MO,TU,WE,TH,FR
Changed to:
X-MS-OLK-WKHRDAYS:MO;TU;WE;TH;FR
*Same as with Xml parse/unparse*|
|Imf|CalendarInvite-Series.imf|Removes carriage returns in some cases
Example:
Received: from Metascan (Unknown [10.231.35.100])
by mail.sw1.mdex.org with ESMTPA
; Tue, 24 Apr 2018 11:14:33 -0400
Changed to:
Received: from Metascan (Unknown [10.231.35.100]) by
mail.sw1.mdex.org with
ESMTPA; Tue, 24 Apr 2018 11:14:33 -0400
*Same as with Xml parse/unparse*|
|Adds “ (quotes) to entries
Example:
Content-Type: text/calendar; method=REQUEST;
Changed to:
Content-Type: text/calendar; method="REQUEST";
*Same as with Xml parse/unparse*|
|Adds carriage returns in some entries
Example:
x-tituslabs-classifications-30: TLPropertyRoot=Titus;…..
Changed to:
x-tituslabs-classifications-30:
TLPropertyRoot=Titus;CLASSIFICATION=UNCLASSIFIED;
*Same as with Xml parse/unparse*|
|” (quotes) in entries are converted to " in unparsed file
Example:
ATTENDEE;CN="Dan Thomas";RSVP=TRUE:mailto:[email protected]
Changed to:
ATTENDEE;CN="Dan Thomas";RSVP=TRUE: mailto:[email protected]
*Not an issue with Xml parse/unparse*|
|Removes leading tabs
Example:
DESCRIPTION:Switched to using skype.\n\n...................................
..........................................................................
............................\n\n \n\nJoin by phone\n\n \n\n….
+1 (345) 567-8765\, access code: 12342254…..
English (United States) \n\n \n\nFind a local….
alin.lync.com/51a211f3-a1be-470b-ac21-cbafc….
Conference ID: 12342254 (same as access code….
…….
Changed to:
DESCRIPTION:Switched to using skype.\n\n...................................
...........................................................................
...........................\n\n \n\nJoin by phone\n\n \n\nToll number:
+1 (345) 567-8765\, access code: 12342254 (Dial-in Number)
English (United States) \n\n \n\nFind a local number…..
lin.lync.com/51a211f3-a1be-470b-ac21-cbafc69f839e?.....
nConference ID: 12342254 (same as access code above)\.....
………………………………………………………
*Same as with Xml parse/unparse*|
|” within html are converted to " in unparsed file
*Not an issue with Xml parse/unparse*|
|< and > within html are converted to < and > in unparsed file
*Not an issue with Xml parse/unparse*|
|& within  \; is changed to &
Example:
Number)  \; \; \; \; \; \;…….
Changed to:
Number) &nbsp\;&nbsp\;&nbsp\;&…….
*Not an issue with Xml parse/unparse*|
|Jpeg|IMG_2308.JPG|No issue, same as source file.|
|Jpeg2000|55mb-jpeg2000-file.jp2|[error] Unparse Error: Data too long by 1344
bits. Unable to truncate.
Schema context: Content Location line 977 column 18 in
file:/home/dfdl/dfdl-dataformats/data- formats/jpeg2000/
src/main/resources/com/mitre/jpeg2000/xsd/ jpeg_2000.dfdl.xsd
Data location was preceding byte 893
*Not an issue with Xml parse/unparse*|
|relax.jp2|*No issue, same as source file.*|
|Json|DIS-v6-FirePDU-Valid.bin.json|Resulting json is output on a single line.
*Same as with Xml parse/unparse*|
|NACHA|ctx_valid_1.txt|[error] Unparse Error: Data too long by 32 bits. Unable
to truncate.
Schema context: element reference \{ach:2013}PaymentRelatedInfo Location line
244 column 14 in
file:/home/dfdl/dfdl-dataformats/data-formats/NACHA/2013/nacha_records.xsd
Data location was preceding byte 463
*Not an issue with Xml parse/unparse*|
|nitf|ns3119b.nsf|[error] Unparse Error: Value '' is not a valid xs:date:
Failed to parse xs:date from string:
Schema context: DowngradeDate Location line 877 column 8 in
file:/home/dfdl/dfdl-dataformats/data-formats/NITF/
src/main/resources/com/tresys/nitf/xsd/nitf.dfdl.xsd
*Not an issue with Xml parse/unparse*|
|Pcap|10k.pcap
100m.pcap|*No issue, same as source file.*|
|png|100kb_file.png|*No issue, same as source file.*|
|shp|ne_10m_airports.shp|*No issue, same as source file.*|
> EXIficient Inputter and outputter for XML EXI representation
> ------------------------------------------------------------
>
> Key: DAFFODIL-1959
> URL: https://issues.apache.org/jira/browse/DAFFODIL-1959
> Project: Daffodil
> Issue Type: New Feature
> Components: Back End
> Affects Versions: 2.1.0
> Reporter: Mike Beckerle
> Assignee: Josh Adams
> Priority: Critical
> Labels: beginner
> Fix For: 3.4.0
>
>
> Create EXI (dense binary XML) representation using an EXI-specific
> InfosetInputter and InfosetOutputter.
> These would be very similar to the XML InfosetInputter and Outputter - as EXI
> libraries such as EXIfficient https://github.com/EXIficient/exificient
> already have SAX/StAX, APIs, etc.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)