[discuss] Re: Email vital for Desktop Linux adoption, prime role available for OOo

Randomthots Mon, 05 Dec 2005 16:42:54 -0800

Daniel Carrera wrote:

Randomthots wrote:
So if I have two files... same format... but one is twice as big asthe other... the bigger file isn't going to take longer to load?
Irrelevant example. The fact that a bigger file loads slower doesn'tmean that the fault is on the size of the tag. There are several thingsthat increase with the size of the file. For example, the number ofelements, the complexity of the tree, the amount of content, etc. All ofthose can cause a slowdown and they are unrelated to the size of the tag.

I was speaking in general terms. Get away from ods and xml for a secondand consider two files, jpegs, for example. The bigger file will takelonger to process simply because it will take more cycles to work yourway through it.

Please understand the difference between data and data structures. Ifyou open a CSV file and immediately save it as OpenDocument you aresaving it into a more complex data structure. Just like an n-ary tree isa more complex data structure than a two dimmensional array, regardlessof what data you store in them.

I looked at the content.xml. Once you got past the namespacedeclarations and such, the overall structure was pretty much like this:


<sheet>
        <row>
                <cell>
                <cell>
                .
                .
                .
        <row>
        .
        .
        .
<sheet>
.
.
.

Very much like a table structure in html. I was sort of surprised thatthere wasn't any indication of row or cell addresses. And other than thestyle information, which just took on the defaults anyway, it was hardto see where the xml added much information. I understand that it canhave more information, but the overall architecture will be the same forany spreadsheet. It can't delve off into 16 dimensions for example, soit can't actually be any deeper than the above.

And are you telling me that the cell, sheet, chart, etc. objects inworking memory... the stuff you are actually manipulating when youwork with the spreadsheet... aren't the same regardless of the formatof the original data file?
I fail to see what this has to do with your argument.

Just that in one case you start with a 2 or 3 MB data file and in theother you start with a 45 MB xml, but you end up with precisely the sameinformation content to manipulate. Now after I add a couple of formulas,pretty it up, draw a graph or two, then csv doesn't work anymore;obviously odf is capable of representing much more than csv.

Statistically, it would be unlikely if the we were talking about adifference a couple MB. But 45 MB is a substantial fraction of 256.
But here's where you're making silly claims. The fact that unzipping thefile produces a 45MB XML set of files doesn't mean that when it's loadedinto memmory it will actually take up 45MB. It won't.

It has to if you don't write the unzipped file to disc first. Where elseis it going to go?


When you load an

XML file into memmory, XML tags are replaced by a pointer structure.


But not until you actually have the unzipped XML to start chewing on.

This goes back to the example of compiled software. It's just like, whenyou compile software, variable names are replaced by pointers and thesize of the binary is not affected by the size of the variables. In asimilar way, when you read an XML file, the tags are replaced bypointers, and the size of the XML tag does not affect the size of thebinary data stored in RAM.

But before you can get to the binary data you have to have the raw XMLto process.

Please read up on data structures. Find out what an n-ary tree is andwhat an array is.

I know what n-ary trees and arrays are. I was working with them (arraysanyway) on what passed for a desktop computer back in '85. 16K of RAM,no hard drive, and just a BASIC interpreter in ROM.

Another question: Is the XML processed in a serial fashion? Is itnecessary to hold the entire file in memory to parse it?
In theory it's not necessary, but in practice most content is in thesame place (content.xml) which puts a bit of a limit on how you canoptimize the parsing. For example, if all you wanted was to extract theauthor of the document, I could write a program that could get thatinformation lighting fast, regardless of the size of your document. Butmost of the time that's not what you want, you want to actually load thedocument contents into the application.

So you finally admit that the raw XML (content.xml, which is like 99% ofthis file) file has to reside in memory while you build the internaldata structure that the program actually uses? That 45 MB has to sitthere while the program does whatever it does, walking through it, toget to the point where you can use it. Finally, then it can unload thatfrom memory.

If I had the time I would. Unfortunately, I have to study forcertification exams and wade through some mostly useless labs forAdvanced Switching and Network Security classes. You see I'm nottechnically illiterate;
What year are you in?

Depends on how you count, I suppose. End of the second year of classes.I attended during the summer of '04 and I'll be done with classes inMay. In the end I'll have a Master's in Telecommunications andInformation Networking plus Cisco Network Professional, Wireless, andNetwork Security certs. Bring it on Verizon! :)

telling me how silly and stupid I am.



I never said you're stupid. I said you said some very silly things.

Still unnecessary and not very nice. For example, Ian made a comment theother day that calendars and email don't have much to do with eachother. I could have said that was silly given that Sunbird, Evolution,and Outlook all have a button or menu item that says "Send Invitation byEmail". You know, for people that aren't on you Ical server. But Iresisted. Ooops... sorry, I guess I just did it, huh?

I'm not sure I like you very much anymore.



My goal in life is not that you like me or dislike me.


Then it's kind of hard to fail in that regard, huh.  ;)

--

Rod


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[discuss] Re: Email vital for Desktop Linux adoption, prime role available for OOo

Reply via email to