[ 
https://issues.apache.org/jira/browse/DAFFODIL-1959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576725#comment-17576725
 ] 

Dave Thompson commented on DAFFODIL-1959:
-----------------------------------------

Below is the results from running the nightly data using the -I exi infoset 
option.
|*EXI Parse/Unparse Results*|
|*Data Format*|*Source File*|*Issue/Comments*|
|Non-Public-Schemas| |\” are converted to " in unparsed file|
|< and > are converted to &lt; and &gt; in unparsed file|
|& are converted to &amp; in unparsed file|
|*Not an issue with Xml parse/unparse*|
| |[error] Unparse Error: Value '' is not a valid xs:int
Schema context: element reference ucops:SelectID Location line 639 column 12 in 
file:/home/dfdl/dfdl-dataformats/data-formats/uscg_ucop_lh/src/main/resources/ 
com/tresys/ uscg_ucop_lh/xsd/sets.dfdl.xsd
*Not an issue with Xml parse/unparse*|
| |- (dashes) are removed
*Not an issue with Xml parse/unparse*|
|bmp|MARBLES.BMP
21m-file.bmp|*No issue, same as source file.*|
|cef|2012-04-25-16-43-37.done.cef.txt|\” are converted to &quot; in unparsed 
file|
|< and > are converted to &lt; and &gt; in unparsed file|
|*Not an issue with Xml parse/unparse*|
|disv6|DIS-v6-EntityStatePDU-Valid.bin|*No issue, same as source file.*|
|DIS-v6-FirePDU-Valid.bin|*No issue, same as source file.*|
|gif|950k-gif-file.gif|*No issue, same as source file.*|
|gmtif|kirkuk_7d.4607|Hex FF 9F FE changed to hex FC 00 00
*Same as xml parse/unparsed.*|
|HL7|ORU.txt|[error] Unparse Error: Data too long by 32 bits. Unable to 
truncate.
Schema context: hl7:SubComponentSeparator Location line 5039 column 22 in 
file:/home/dfdl/dfdl-dataformats/data-formats/HL7/src/main/resources/segments.xsd
Data location was preceding byte 12
*Not an issue with Xml parse/unparsed*|
|Xml parse/unparsed removed an &|
|iCal|DTCalendar-2.ics|” within html are converted to &quot; in unparsed file
*Not an issue with Xml parse/unparse*|
|< and > within html are converted to &lt; and &gt; in unparsed file
*Not an issue with Xml parse/unparse*|
|“ (quotes) removed from entries
Example:
DTEND;TZID="Eastern Standard Time":20171006T150000
Changed to:
DTEND;TZID=Eastern Standard Time:20171006T150000
*Same as with Xml parse/unparse*|
|Carriage returns removed and replaced with ; (semi-colon)
Example:
MS Exchange Server ve
               rsion
Changed to:
MS Exchange Server ve;rsion
*Same as with Xml parse/unparse*|
|, (comma) change to ; (semi colons) in entries
Example:
X-MS-OLK-WKHRDAYS:MO,TU,WE,TH,FR
Changed to:
X-MS-OLK-WKHRDAYS:MO;TU;WE;TH;FR
*Same as with Xml parse/unparse*|
|Imf|CalendarInvite-Series.imf|Removes carriage returns in some cases
Example:
Received: from Metascan (Unknown [10.231.35.100])
               by mail.sw1.mdex.org with ESMTPA
               ; Tue, 24 Apr 2018 11:14:33 -0400
Changed to:
Received: from Metascan (Unknown [10.231.35.100])              by 
mail.sw1.mdex.org with
 ESMTPA; Tue, 24 Apr 2018 11:14:33 -0400
*Same as with Xml parse/unparse*|
|Adds “ (quotes) to entries
Example:
Content-Type: text/calendar; method=REQUEST;
Changed to:
Content-Type: text/calendar; method="REQUEST";
*Same as with Xml parse/unparse*|
|Adds carriage returns in some entries
Example:
x-tituslabs-classifications-30: TLPropertyRoot=Titus;…..
Changed to:
x-tituslabs-classifications-30:
 TLPropertyRoot=Titus;CLASSIFICATION=UNCLASSIFIED;
*Same as with Xml parse/unparse*|
|” (quotes) in entries are converted to &quot; in unparsed file
Example:
ATTENDEE;CN="Dan Thomas";RSVP=TRUE:mailto:[email protected]
Changed to:
ATTENDEE;CN=&quot;Dan Thomas&quot;;RSVP=TRUE: mailto:[email protected]
*Not an issue with Xml parse/unparse*|
|Removes leading tabs
Example:
DESCRIPTION:Switched to using skype.\n\n...................................
               
..........................................................................
               ............................\n\n \n\nJoin by phone\n\n \n\n….  
                     +1 (345) 567-8765\, access code: 12342254…..
                          English (United States)  \n\n \n\nFind a local….
               alin.lync.com/51a211f3-a1be-470b-ac21-cbafc….
               Conference ID: 12342254 (same as access code….
               …….
Changed to:
DESCRIPTION:Switched to using skype.\n\n...................................
 ...........................................................................
 ...........................\n\n \n\nJoin by phone\n\n \n\nToll number:    
     +1 (345) 567-8765\, access code: 12342254 (Dial-in Number)            
       English (United States)  \n\n \n\nFind a local number…..
 lin.lync.com/51a211f3-a1be-470b-ac21-cbafc69f839e?.....
 nConference ID: 12342254 (same as access code above)\.....
 ………………………………………………………
*Same as with Xml parse/unparse*|
|” within html are converted to &quot; in unparsed file
*Not an issue with Xml parse/unparse*|
|< and > within html are converted to &lt; and &gt; in unparsed file
*Not an issue with Xml parse/unparse*|
|& within &nbsp\; is changed to &amp;
Example:
Number) &nbsp\;&nbsp\;&nbsp\;&nbsp\;&nbsp\;&nbsp\;…….
Changed to:
Number) &amp;nbsp\;&amp;nbsp\;&amp;nbsp\;&amp;…….
*Not an issue with Xml parse/unparse*|
|Jpeg|IMG_2308.JPG|No issue, same as source file.|
|Jpeg2000|55mb-jpeg2000-file.jp2|[error] Unparse Error: Data too long by 1344 
bits. Unable to truncate.
Schema context: Content Location line 977 column 18 in 
file:/home/dfdl/dfdl-dataformats/data- formats/jpeg2000/ 
src/main/resources/com/mitre/jpeg2000/xsd/ jpeg_2000.dfdl.xsd
Data location was preceding byte 893
*Not an issue with Xml parse/unparse*|
|relax.jp2|*No issue, same as source file.*|
|Json|DIS-v6-FirePDU-Valid.bin.json|Resulting json is output on a single line.
*Same as with Xml parse/unparse*|
|NACHA|ctx_valid_1.txt|[error] Unparse Error: Data too long by 32 bits. Unable 
to truncate.
Schema context: element reference \{ach:2013}PaymentRelatedInfo Location line 
244 column 14 in 
file:/home/dfdl/dfdl-dataformats/data-formats/NACHA/2013/nacha_records.xsd
Data location was preceding byte 463
*Not an issue with Xml parse/unparse*|
|nitf|ns3119b.nsf|[error] Unparse Error: Value '' is not a valid xs:date: 
Failed to parse xs:date from string:
Schema context: DowngradeDate Location line 877 column 8 in 
file:/home/dfdl/dfdl-dataformats/data-formats/NITF/ 
src/main/resources/com/tresys/nitf/xsd/nitf.dfdl.xsd
*Not an issue with Xml parse/unparse*|
|Pcap|10k.pcap
100m.pcap|*No issue, same as source file.*|
|png|100kb_file.png|*No issue, same as source file.*|
|shp|ne_10m_airports.shp|*No issue, same as source file.*|

> EXIficient Inputter and outputter for XML EXI representation
> ------------------------------------------------------------
>
>                 Key: DAFFODIL-1959
>                 URL: https://issues.apache.org/jira/browse/DAFFODIL-1959
>             Project: Daffodil
>          Issue Type: New Feature
>          Components: Back End
>    Affects Versions: 2.1.0
>            Reporter: Mike Beckerle
>            Assignee: Josh Adams
>            Priority: Critical
>              Labels: beginner
>             Fix For: 3.4.0
>
>
> Create EXI (dense binary XML) representation using an EXI-specific 
> InfosetInputter and InfosetOutputter.
> These would be very similar to the XML InfosetInputter and Outputter - as EXI 
> libraries such as EXIfficient https://github.com/EXIficient/exificient 
> already have SAX/StAX, APIs, etc. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to