Short answer: No. Two bit/byte order combinations are easily visualized. The other two "mixed" ones are not.
bigEndian, mostSignificantBitFirst is easy. Write bytes left to right, number bytes left to right, number bits left to right. littleEndian, leastSignificantBitFirst is also easy. Write bytes right to left, number bytes right to left, number bits right to left. In both the above regimes, a numeric field that spans a byte boundary still occupies an adjacent string of bits as viewed on the display. Furthermore, in both the above regimes, a numeric field that spans (or doesn't span) a byte boundary can be understood looking at its bits/bytes with most significant digit/bit on the left, preceding to less significant bits to the right, which is the normal way we (humans) read numbers. For a signed integer, the left-most bit will be the sign bit. It is the mixed cases that are hard. E.g., littleEndian byte order, but mostSignificantBitFirst. Practically speaking, when I have seen this, the data was byte-centric and didn't have bit fields that consumed partial bytes and spanned from one byte to another. In that case it's not too bad. A *long* time ago I started a tutorial on bit order: https://daffodil.apache.org/tutorials/bitorder.tutorial.tdml.xml. This file is runnable TDML with XHTML annotations in tutorial elements, which is an approach to creating tutorials that works to make a web page display using just an XSLT stylesheet. It's not as easy to write as one would like, XHTML, like HTML, is still remarkably tedious. This tutorial is incomplete however. It has some TBDs, and at some point it just stops annotating the examples that are in the file. The source code for the tutorial lives in the Daffodil source tree here: https://github.com/apache/daffodil/blob/main/tutorials/src/main/resources/bitorder.tutorial.tdml.xml On Mon, Dec 11, 2023 at 8:40 AM Roger L Costello <coste...@mitre.org> wrote: > Last week Steve described how to identify the fields in a data format > (such as Link-16 or VMF) with bitOrder=leastSignificantBitFirst, when the > data is viewed in a hex editor: > > > > Ø read each *byte* left-to-right, but read the *bits within each byte* > right-to-left … tricky when fields overlap byte boundaries (which is very > common in VMF/Link-16) > > > > That is really complicated. > > > > Question: If I write the binary on a piece of paper and manually process > the bits right-to-left – the first field is the rightmost two bits, the > second field is the next five bits, the third field is the next two bits, > etc. – then each field is sequential, one after the other, right? > > > > In one of the emails, Mike mentioned that Link-16 is Little Endian. Does > Endianess impact the strategy described in the previous paragraph? I > imagine the Endianess would impact this situation: when processing the > fields right-to-left, I encounter a 16-bit unsigned integer field. As I > recall, Little Endian means this: the most significant byte is at the > highest memory address. In the 16-bit unsigned integer field, which byte is > at the highest memory address? Suppose the 16 bits are these: > > > > What is the (decimal) value of that field? > > > > > > *From:* Steve Lawrence <slawre...@apache.org> > *Sent:* Thursday, December 7, 2023 3:28 PM > *To:* users@daffodil.apache.org > *Subject:* Re: Bits are streamed into an application ... the bits are put > in memory ... is the first bit received at the lowest memory address? > > > > Lin16/VMF is definitely confusing. A hex editor will actually display the > data like this: 4E .. . . . 21 Another way to think about how to read > link16/vmf data (which is more useful when using hex editors which don't > understand the idea of right-to-left > > Lin16/VMF is definitely confusing. A hex editor will actually display > > the data like this: > > > > 4E ..... 21 > > > > Another way to think about how to read link16/vmf data (which is more > > useful when using hex editors which don't understand the idea of > > right-to-left data), is to read each *byte* left-to-right, but read the > > *bits within each byte* right-to-left. So if you look at your example > > data in a hex editor you'll see something like this > > > > low memory address high memory address > > 4 E . . . 2 1 > > 0100 1110 ........ ........ ........ 0010 0001 > > > > If we read the bytes left-to-right like a hex editor would show, our > > first byte is 01001110 (0x4E). But within that byte, we read the bits > > right-to-left, which means our first 2-bit field would be "10" (taken > > from the right of the 0xE, not left of the 0x4). As an further example, > > if there was a 3-bit field following our 2-bit field, it would be 011. > > > > Note that once we consume the entire first byte, we start reading from > > the right-most bit of the 2nd byte, and so on. This can make things > > tricky when fields overlap byte boundaries (which is very common in > > VMF/Link16). For example, let's say we've already consumed 5 bits in the > > first byte (marked in x's below), and the next field is 6-bits long > > (marked from 1-6 in the order to read the bits below), this is where the > > bits would come from for that field: > > > > 321x xxxx .... .654 > > > > So this field crosses a byte boundary, and some bits come from the left > > of the first byte and others from the right of the second byte. In this > > example, the the value of this 6-bit field 654321. > > > > It can definitely be confusing to read *bytes* left-to-right but *bits* > > right-to-left. So it is sometimes helpful to mentally (or even manually) > > reorder all the bytes so they are reversed (without reversing the bits), > > and then just read the bits from the end of the file, right-to-left and > > bottom-to-top. > > > > In fact, I believe the data viewer in the Daffodil VS Code extension is > > planning (or already has?) the ability to show bytes in the reverse > > order at bit resolution, which would make this a little easier so you > > wouldn't have to do it yourself. > > > > > > On 2023-12-07 02:38 PM, Roger L Costello wrote: > > > Thanks Steve. > > > > > > As it turns out, I am working with the Link-16 data format. You are > > > saying that the stream of received bits – 0 then 1 then 1 then 1 then 0 > > > then 0 then 1 then 0 then … - are viewed in this fashion: > > > > > > High memory address Low memory address > > > > > > 0010 0001 ................................................ 0100 1110 > > > > > > Hex: 21 ……………………………………. 4E > > > > > > That is consistent with the Link-16 specification. > > > > > > Here’s where I get confused. Suppose I store those bits into a file. > > > Then I open the file in a hex editor. How will the hex editor display > > > the data? > > > > > > Will the hex editor display the four bits at the lowest memory address > > > (1110) in hex digit form (E), followed by the four bits at the > > > next-to-lowest memory address (0100) in hex digit form (4): > > > > > > E4 ……………………………… 12 > > > > > > Or will the hex editor see the bits in reverse order: > > > > > > Low memory address High memory address > > > > > > 0111 0010 ………………………………………… 1000 0100 > > > > > > Hex: 72 ………………………………………………. 84 > > > > > > And display this: > > > > > > 72 ………………………………………….. 84 > > > > > > That is wildly different! > > > > > > Eek! I am so confused! Help! > > > > > > *From:*Steve Lawrence <slawre...@apache.org> > > > *Sent:* Thursday, December 7, 2023 1:16 PM > > > *To:* users@daffodil.apache.org > > > *Subject:* [EXT] Re: Bits are streamed into an application ... the bits > > > are put in memory ... is the first bit received at the lowest memory > > > address? > > > > > > The most likely interpretation is that the 2-bit field will have a value > > > of "01", with a decimal value of 1. This is what DFDL calls dfdl: > > > bitOrder="mostSignificantBitFirst" and is what most data formats use. > > > DFDL also has dfdl: bitOrder="leastSignificantBitFirst", > > > > > > The most likely interpretation is that the 2-bit field will have a value > > > > > > of "01", with a decimal value of 1. This is what DFDL calls > > > > > > dfdl:bitOrder="mostSignificantBitFirst" and is what most data formats use. > > > > > > DFDL also has dfdl:bitOrder="leastSignificantBitFirst", which is common > > > > > > in some old military formats like VMF and Link16. One way to imagine > > > > > > this is as if the memory addresses and bits were ordered and read > > > > > > right-to-left. So, your example would look like this: > > > > > > High memory address Low memory address > > > > > > 0 0 ................................................ 1 1 0 > > > > > > So we have the same data, and you still read the low memory address bits > > > > > > first, it's just all backwards. Because we read right-to-left, the first > > > > > > two bits are now "10". Note that interpreting the value of the bits is > > > > > > the same as normal, so "10" evaluates to 2--all that changes is the > > > > > > order in which we read bits. > > > > > > I haven't seen any formats where the high memory address bits would be > > > > > > read first and would lead to the 2-bit field being "00". DFDL doesn't > > > > > > have a way to model this. > > > > > > On 2023-12-07 12:28 PM, Roger L Costello wrote: > > > > > >> Hi Folks, > > > > > >> > > > > > >> A basic question about bits. > > > > > >> > > > > > >> Scenario: an application is receiving a message. The message arrives as a > >> stream of bits. The first bit received is 0. The second bit received is 1. > >> The third bit received is another 1. ... The second-to-last bit received > >> is 0. The last bit received is 0. > > > > > >> > > > > > >> The application stores the message in memory. > > > > > >> > > > > > >> Will the first bit received be in the lowest memory address and the last > >> bit received in the highest memory address? I.e., > > > > > >> > > > > > >> Low memory address High memory address > > > > > >> 0 1 1 ............................................................. 0 0 > > > > > >> > > > > > >> The message contains a series of bit fields. The specification for the > >> message says the first field is two bits. > > > > > >> > > > > > >> Is the first field the bits 0 1 or the bits 00? > > > > > >> > > > > > >> /Roger > > > > > > >