I think we're getting into CSS here. The acid3 html file has a text/css section at the top including this: #instructions:last-child { white-space: pre-wrap; white-space: x-bogus; }

What are your feelings about css? I have been making a claim that I think there's some evidence for, but I'm not positive: Even though the bulk of CSS is not useful or interesting to the edbrowse renderer, we might still be interested in CSS because sites use the presence of CSS names and values as a workaround for user-agent spoofing. The collection of results from poking and prodding 100 attributes is what they take to be your browser and OS fingerprint, overriding what you said it was. Diabolical, huh?

Do you think this is a compelling reason to get into CSS? I think I have found some 3rd-party JS code that we might be interested in, if we wanted to do something with this. It might save work. There's one object that is a CSS parser. It would turn a .css file into JSON, where it is easier to traverse afterwards. There is also a JS implementation of querySelectorAll, which works like getElementsByTagName, only the discernment of the result elements is based on selector syntax, rather than tag or name. The colon, the period, the hash mark have particular hardcoded meanings for different types of selections.

thanks
Kevin




On Sat, 19 Aug 2017, Karl Dahlke wrote:

With Kevin pointing the way, I started looking at the first of 100 acid tests.
It runs into a problem in that it expects a pure whitespace node that is not 
there.
Note the following html.

<body>
<p>paragraph 1</p>
<p>paragraph 2</p>
</body>

Browse with db5 and tidy gives us the two paragraph nodes in sequence, there is 
no node in between with the newline (whitespace) character.
The javascript expects it to be there.
Why is it not there?

Note html-tidy.c line 126.
I tell tidy not to drop empty elements, or empty paragraphs.
Geoff, or anyone else, any insights?

Karl Dahlke


--------
Kevin Carhart * 415 225 5306 * The Ten Ninety Nihilists
_______________________________________________
Edbrowse-dev mailing list
[email protected]
http://lists.the-brannons.com/mailman/listinfo/edbrowse-dev

Reply via email to