Re: JSON

Gabriel Sechan Tue, 25 Oct 2005 15:13:47 -0700

From: Andrew Lentvorski <[EMAIL PROTECTED]>
Everybody whines about XML.  I don't understand why.

Because XML is the Emperor's New Clothes of computer technology. Somepeople praise it to the heavans as the solution to everything. In reality,it does next to nothing. We've had parser generators since the 70s, atleast. And parsing is the *easy* part of dealing with data. XML doesn'thelp at all with the hard part- acting on the parsed tokens. So inexchange for doing a minimal amount of work for you, you get a bloated, CPUand bandwidth wasting format with a huge annoyingly overengineered spec anda slow ass parser. Oh, and you get to throw in XML on your buzzword page.

XML is overkill. It tries to swat a mosquito with a sledgehammer. It takesa problem that can generally solved in minutes, and gives you hours of fundebugging XML code. I have *never* seen a problem solved by XML thatcouldn't have been done just as easily- if not more so- without it.

Oh and for those who scream that XML is magicly open and anyone can now readany format- no you can't. Its just as easy to come up with a convoluted,proprietary schema as it is any other format. You buy yourself nothingthere.

Actually, I do:
First, parsers are *hard*. Every idiot CS major thinks he a can write aparser for his "little language". They are all wrong.

Writing a parser for a spec like XML is hard- thats why most XML parsersare buggy. Writing a parser for a small domain language is quite easy. Itsa simple state machine. For middle sized languages, you have lex and yacc.

The hard part of parsing data isn't the parsing- its dealing with thetokens after its parsed, and designing a good language to begin with. XMLhelps neither of those activities. You still need to deal with the tokens,you still need to design a good schema. The second one perhaps being thebiggest problem- when you see buggy non-XML parsers, chances are thelanguage spec is too convoluted. Of course, if they changed it to XML tagsit wouldn't be magicly better- you'd still have a convoluter schema, wrappedin tags.

XML *forces* these morons to have to interface with a structured, debuggedparser. SAX and DOM have their faults, but at least they

No it doesn't. THey still use regexes as often as not, which is a bad thingwith XML, since XML is such a top heavy, corner-case ridden spec.

get debugged. Watching programmers writhe in agony because the XML parserthrew an exception on a boundary case that their puny little minds are toonarrow to anticipate is a most rewarding experience.

And 90% of the time, this boundary case only exists in the parsers mind. Anadditional 9% of the time, the corner case is due to XML itself and notfailing to follow the DTD/schema.

Second, internationalization is hard. How many ways are there to spellTchaikovsky? The same morons from above get *forced* into dealing withthis kind of crud with XML when they bump into another program whichrefuses to accept that Author, Composer, etc is a unique key. Oops. Andthe whole fact that XML *specifies* Unicode is beautiful--no more slackingoff and only accepting ASCII or, worse, only accepting letters and digits.

In 99% of apps, internationalization is overkill. Unless a human is meantto be editing the file (such as a config file), its just a waste of CPUpower and time.

Third, XML parsers *complain* when you feed them garbage. If you don't getyour formatting and nesting correct, most XML parsers are free to dump yourcrud into the bitbucket any way they please.

Yup, because just dumping the doc rather than trying to route around theproblem is a great idea. Nah, I didn't really want all that data. So whatsa few missing bank transactions gonna cost anyway?

And herein lies the source of the XML verbosity that everybody complainsabout--balanced close tags. Syntax errors almost always *immediately*cause parsing errors because they tend to bump into unbalanced tags; nosilent degradation here--I approve.

Except they end up just using the <tagname /> syntax. Oops, now you havespelling errors and annoying syntax.

The same nitwits who think they can write parsers and can't deal with thefact that almost nothing in real life is a useful unique key desperatelywant XML parsers to be "liberal in what they accept" so that they don'thave to debug their XML generation code. Hogwash! Clap them in irons forpromulgating their dreck amongst the public!

I can think of plenty of things that are useful unique keys in real life.Space-time co-ordinats, SSNs, license plates, book title and author, etc.

As for being liberal in what you expect being a bad thing- lets try anexperiment. For the next month, you can only go to webpages that are WC3validated, and who's servers put out perfect HTTP. Come back to us with howmany sites you visited. I'll be impressed if you could make double digits.

There's a reason most real world programs are liberal with inputs- theyhave to be. You can't expect the other guy to get his shit right,especially if he's not employed with you. And failing to the end user isnot a good option, not when the error can be routed around.

I will happily accept the restrictions that XML places upon me because *Idon't find them to be restrictions*. I wind up putting in the work to dealwith this kind of stuff anyway. I can avoid most of the gnarly, nastycorners of XML (namespaces and schemas/DTD's) while still retaining most ofthe advantages all while knowing that the gnarly, nasty stuff is availableif I really need it.

And you could have saved yourself a lot of work in 99% of cases by not usingXML, and not having to worry about the nasty gnarly stuff at all. Justwrite a language that does what you need, no more no less.


Gabe


--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg

Re: JSON

Reply via email to