Re: [PD-dev] strings

Hans-Christoph Steiner Sat, 16 Dec 2006 09:40:34 -0800


On Dec 16, 2006, at 4:55 AM, Bryan Jurish wrote:

morning,
On 2006-12-16 01:40:03, Mathieu Bouchard <[EMAIL PROTECTED]>appears to have written:
On Fri, 15 Dec 2006, Hans-Christoph Steiner wrote:
An advantage using the list-of-bytes approach is that because eachcharacter can be represented by a rather large integer, it can beextended to work on lists-of-characters meaning quickly, if thereis a [utf8decode] and [utf8encode] to turn bytes into charactersand back; also it's a method that is available now and reuses theexisting list objects; and it's a method that supports \0 (NUL)characters.Disadvantages are that it takes more time to convert to C stringsand back, it takes more space in .pd files, it isn't readable astext in .pd files, it takes up to 4 times more space to representin .pd files, and exactly 4 times more space in RAM (in the casethat just iso-latin-1 is used), and also that you can't make listsof strings like that.
i count (sizeof(int)+sizeof(float)-1)*strlen(message) wasted bytesper string object, not counting the selector. as i think we'vediscussed before, using ieee floats, which should be able tolosslessly encode a 24 bit integer, that can be tweaked down to(sizeof(int)+sizeof(float)-1)*strlen(message)/3 on average, but onmy system (32 bit floats), that still amounts to one wasted byteper character for the representation, and it's hellishly cryptic toboot.
(By the time we can have real strings, we can have nested-lists,and the other way around, because they'd use the same mechanisms.whether it's better to make them two types or one type, is a goodquestion.)
... but then again, what else are ascii 0x1c-0x1f (28-31 ={fs,gs,rs,us}) for? it's another ugly hack, would reserve some ofthe ascii range, and would require additional parsing objects(potentially constructable with [list]), but it's a possibility,should anyone actually need nested lists as strings...
please don't get me wrong: i'm all in favor of "real" strings,nested lists, and associative arrays - i wrote [pdstring] because ineeded to send some generated text over OSC to someone who couldonly interpret ascii values: i'm glad if it's helpful to anyonebesides myself, and i don't see much difficulty in adding supportfor low-level c-type string operations ([toupper], [tolower], atsome later point maybe even regexes), but i can't bring myself tobelieve that the list-of-bytes approach is really the "right" wayto do it, although i don't have a better idea at the moment...

One advantage of this approach is that many C string functions liketoupper, tolower, strcat, strcmp, etc. would be pretty easy toimplement in Pd, rather than C. A regexp object in C would be prettystraightforward.

How about using a selector "string" for these lists? I suppose thatcould cause mayhem since it would make the list into a selectorseries and run into all the vagaries of handling them.


.hc
------------------------------------------------------------------------

Man has survived hitherto because he was too ignorant to know how torealize his wishes. Now that he can realize them, he must eitherchange them, or perish. -William Carlos Williams




_______________________________________________
PD-dev mailing list
[email protected]
http://lists.puredata.info/listinfo/pd-dev

Re: [PD-dev] strings

Reply via email to