[Python-Dev] IO module improvements

Pascal Chambon Fri, 05 Feb 2010 04:51:52 -0800

Hello

The new modular io system of python is awesome, but I'm running intosome of its limits currently, while replacing the raw FileIO with a moreadvanced stream.So here are a few ideas and questions regarding the mechanisms of thisIO system. Note that I'm speaking in python terms, but these ideasshould also apply to the C implementation (with more programming hassleof course).

- some streams have specific attributes (i.e mode, name...), but sincethey'll often been wrapped inside buffering or encoding streams, theseattributes will not be available to the end user.

So wouldn't it be great to implement some "transversal inheritance",simply by delegating to the underlying buffer/raw-stream, attributeretrievals which fail on the current stream ? A little __getattr__should do it fine, shouldn't it ?By the way, I'm having trouble with the "name" attribute of raw files,which can be string or integer (confusing), ambiguous if containing arelative path, and which isn't able to handle the new case of mylibrary, i.e opening a file from an existing file handle (which is ALSOan integer, like C file descriptors...) ; I propose we deprecate it forthe benefit or more precise attributes, like "path" (absolute path) and"origin" (which can be "path", "fileno", "handle" and can be extended...).

Methods too would deserve some auto-forwarding. If you want to bufferizea raw stream which also offers size(), times(), lock_file() and othermethods, how can these be accessed from a top-level buffering/textstream ? So it would be interesting to have a system through which astream can expose its additional features to top level streams, and atthe same time tell these if they must flush() or not before callingthese new methods (eg. asking the inode number of a file doesn't requireflushing, but knowing its real size DOES require it.).

- I feel thread-safety locking and stream stream status checking arecurrently overly complicated. All methods are filled with locking callsand CheckClosed() calls, which is both a performance loss (most iostreams will have 3 such levels of locking, when 1 would suffice) anderror-prone (some times ago I've seen in sources several functions inwhich checks and locks seemed lacking).Since we're anyway in a mood of imbricating streams, why not simplyadding a "safety stream" on top of each stream chain returned by open()? That layer could gracefully handle mutex locking, CheckClosed() calls,and even, maybe, the attribute/method forwarding I evocated above. Iknow a pure metaprogramming solution would maybe not suffice forperformance-seekers, but static implementations should be doable as well.

- some semantic decisions of the current system are somehow dangerous.For example, flushing errors occuring on close are swallowed. It seemsto me that it's of the utmost importance that the user be warned if thebytes he wrote disappeared before reaching the kernel ; shouldn't wedecidedly enforce a "don't hide errors" everywhere in the io module ?.


Regards,
Pascal



_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

[Python-Dev] IO module improvements

Reply via email to