Re: Address of data that is static, be it shared or tls or __gshared or immutable on o/s

Cecil Ward via Digitalmars-d-learn Sun, 10 Sep 2017 14:41:28 -0700

On Wednesday, 6 September 2017 at 15:55:35 UTC, Ali Çehreli wrote:

On 09/06/2017 08:27 AM, Cecil Ward wrote:
> If someone has some static data somewhere, be it in tls or
marked shared
> __gshared or immutable or combinations (whatever), and
someone takes the
> address of it and pass that address to some other routine of
mine that
> does not have access to the source code of the original
definition of
> the object in question, then is it possible to just use 'the
address'
> passed without knowing anything about that data? I'm assuming
that the
> answer might also depend on compilers, machine architectures
and
> operating systems?
>
> If this kind of assumption is very ill-advised, is there
anything
> written up about implementation details in different
operating systems /
> compilers ?
Yes, they are all valid operations. Further, the object neednot be a static one; you can do the same with any object evenit's on the stack. However,
- The object must remain alive whenever the other routine usesit. This precludes the case of the object being on the stackand the other routine saving it for later use. When that lateruse happens, there is no object any more. (An exception: Theobject may be kept alive by a closure; so even that case isvalid.)
- Remember that in D data is thread-local by default; e.g. amodule variable will appear to be on the same address to allthreads but each thread will have its own copy. So, if the datais going to be used in another thread, it must be defined as'shared'. Otherwise, although the code will look like it'sworking, different threads will be accessing different data.(Sometimes this is exactly what is desired but not what you'relooking for.) (Fortunately, many high-level thread operationslike the ones in std.concurrency will not let you share dataunless it's 'shared'.)
Ali

Ali, I have worked on operating systems' development in r+d. Mydefinitions of terms are hopefully the same as yours. If we referto two threads, if they both belong to the same process, thenthey share a common address space, by my definition of the terms'thread' and 'process'. I use thread to mean basically a stack,plus register set, a cpu execution context, but has nothing to dowith virtual memory spaces or o/s ownership of resources, the oneexception being a tls space, which by definition isone-per-thread. A process is one or more threads plus an addressspace and a set of all the resources owned by the processaccording to the o/s. I'm just saying this so you know how I'mused to approving this.

Tls could I suppose either be dealt with by having allocatedregions within a common address space that are all visible to oneanother. Objects inside a tls could (1) be referenced by absolutevirtual addresses that are meaningful to all the threads in theprocess, but not meaningful to (threads belong to) otherprocesses. (By definition of 'process'.) or (2) be referencedmost often by section-offsets, relative addresses from the startof a tls section, which constantly have to be made usable byhaving the tls base virtual address added to them before they canbe dereferenced adding a big runtime cost and making tls very badnews. I have worked on a system like (2). But even in (2) anaddress of a type-2 tls object can still be converted to areadily usable absolute virtual address and used by any thread inthe process with zero overhead. A third option though could be touse processor segmentation, so tls objects have to (3a) bedereferenced using a segment prefixed operation, and then it'simpossible to just have a single dereference operation such asstar without knowing whether to use the segment prefix or not.But if it is again possible to use forbidden or officialknowledge to convert the segmented form into a process-widemeaningful straight address (as in 8086 20-bit addresses) then wecould term this 3a addressing. If this is not possible because vmhardware translation is in use then I will term this 3b. In 3a Iam going to assume that vm hardware is used merely to providerelocation, address offsetting, so the use of a segmentationprefix basically merely adds a per-thread fixed offset to thevirtual address and if you could discover that offset then youdon't need to bother with the segment prefix. In 3b, vm hardwaremaps virtual addresses to a set of per-tls pages usingwho-knows-what mechanism, anyway something that apps cannot justbypass using forbidden knowledge to generate a singleprocess-wide virtual address. This means that 3b threads areprobably breaking my definition of thread vs process, althoughthey threads of one process do also have a common address spaceand they share resources.

I don't know what d's assumptions if any are. I have very brieflylooked at some code generated by GDC and LDC for Linux x64. Itseems to me that these are 3a systems, optimised strongly enoughby the compilers to remove 3a inefficiency that they are nearly1. But I must admit, I haven't looked into it properly, justnoted a few things in passing and haven't written any test casesas I don't know d well enough yet. I haven't seen the code thesecompilers generate for Windows.

[Many thanks for your superb book btw, which I am just readingfor the second time round. I wouldn't have got very far withoutit.]

Re: Address of data that is static, be it shared or tls or __gshared or immutable on o/s

Reply via email to