Re: [openssl-dev] [openssl-team] Discussion: design issue: async and -lpthread

Nico Williams Mon, 23 Nov 2015 09:46:19 -0800

[Viktor asked me for my advice on this issue and bounced me the post
 that I'm following up to.  -Nico]

The summary of what I've to say is that making libcrypto and libssl need
-lpthread is something that does require discussion, as it will have
detrimental effects on some users.  Personally, I think that those
detrimental effects are a good thing (see below), but nonetheless I
encourage you to discuss whether this is actually what OpenSSL should
do.  In particular, it may be possible to avoid -lpthread on some
systems and still get a subset of lipthread functionality from libc or
the compiler (e.g., thread-locals), and that may be worth doing.

On Mon, Nov 23, 2015 at 01:53:47AM +0000, Viktor Dukhovni wrote:
> As a side-effect of the MacOS/X MR I've become aware that the async
> code in its current state links master with "-lphtread", and defines
> macros that enable multi-theaded (as opposed to merely thread-safe)
> compilation into OpenSSL.
> 
>     commit 757d14905e3877abaa9f258f3dd0694ae3c7c270
>     Author: Matt Caswell <m...@openssl.org>
>     Date:   Thu Nov 19 14:55:09 2015 +0000
> 
>       Add pthread support
> 
>       The forthcoming async code needs to use pthread thread local variables. 
> This
>       updates the various Configurations to add the necessary flags. In many 
> cases
>       this is an educated guess as I don't have access to most of these
>       environments! There is likely to be some tweaking needed.
> 
>       Reviewed-by: Kurt Roeckx <k...@openssl.org>
> 
> This is quite possibly not be the right thing to do, and deserves
> some attention from the team.  We might even seek outside som
> outside advice from folks well versed in platform-specific library
> engineering (Christos Zoulas from NetBSD, Nico Williams formerly
> from Sun, ...).
> 
> My concern is that introducing -lpthread automatically converts
> single-threaded applications that link with OpenSSL into threaded
> applications (with a single thread).  This may well have undesirable
> consequences.

Some background may be needed.  When threading was introduced in the
90s, and in some cases still to this day, generally the end result was
that the system had to support a number "process models", with potential
transitions from one to another at run-time:

 - single-threaded, dynamically linked
 - single-threaded, statically  linked
 - multi-threaded,  dynamically linked
 - multi-threaded,  statically  linked
 - multi-threaded,  mixed linkage

The statically-linked models can become mixed-linkage models via
dlopen().  The single-threaded model can become a threaded model via
dlopen() of an object linked with -lpthread, or by dlopen()ing
libpthread itself.

Solaris 9 and under used to have a veritable rats nest of code to deal
with the process model transitions from single-threaded to
multi-threaded.  Solaris 10 unified these and moved libpthread and libdl
into libc [with filters left behind for backwards compatibility].  Thus,
on Solaris 10 and up, and Illumos, OpenSSL using -lpthread or not makes
no difference.

I'm quite fond of the approach taken by Solaris 10 and up (and thus also
Illumos): there is but one process model, and it is threaded, with
pthreads in libc.  But that need not be the way it works everywhere.
Some systems may still support multiple process models.

For a library like OpenSSL making use of -lpthread does mean dictating
to users that they may only use a threaded process model.  OTOH, not
using -lpthread allows the user to choose a process model unconstrained
by such a library.

Until now OpenSSL has avoided forcing the user to choose any particular
process model.  Now with this commit OpenSSL is now taking the reverse
stance.  This seems like a very significant change that should at least
be noted prominently in the release notes, but it should also be
disucssed, indeed.

Personally, I believe this change is a good thing, as OpenSSL really
ought to either automatically initialize its "lock callbacks" or do away
with them completely (leaving backwards compatibility stubs) and use the
OS' locking facility by default / only.  Automatic lock callback
initialization without forcing the use of -lpthread and still allowing
static linking would be tricky indeed (for example: OpenSSL couldn't use
weak symbols to detect when -lpthread gets brought in).

Still, if -lpthread avoidance were still desired, you'd have to find an
alternative to pthread_key_create(), pthread_getspecific(), and friends.

For thread-specifics the obvious answer is to use C compiler
thread-local variable support.  This might or might not be available;
this would have to be determined at build configuration time.  Still, if
where the compiler supports thread-locals, OpenSSL could avoid
-lpthread.

For pthread mutex functions (for lock callbacks) and, perhaps,
pthread_once() (for automatic initialization of lock callbacks,
perhaps), it's very difficult to create alternatives to -lpthread, but
it can conceivably be done within a library like OpenSSL.  I don't think
it's worth doing though!  Instead I think it's best to just use
-lpthread and let the OS and its libc and libpthread take care of any
desirable optimizations in single-threaded cases, as well as any
automatic transitions to non-optimized implementations upon creation of
the first additional thread.

The single-threaded and statically linked process model has a number of
detrimental effects, and ISTM that by bending over backwards to avoid
forcing the choice of process model, OpenSSL and similar allow users to
believe in fairy tales.  One detrimental effect has been OpenSSL's
bloody lock callbacks, which in turn mean that it is impossible to use
OpenSSL perfectky thread-safely from another library!  Must every
program use -lcrypto and -lpthread, and setup lock callbacks just in
case some other library the program really does use calls dlopen()?

Other detrimental effects of static linking include:

 - the flattened dependency topology forced by static linking (with
   attendant confusion in *-cofig(1) programs), with attendant
   accidental symbol interposition;

 - partial loss of ASLR protection (one random load address for the
   entire executable text and all its libraries, versus one for each
   object)

 - the need for complex single- to multi-threaded transition support
   code in various system libraries (mostly in libc).

 - slower load times.  Yes, slower.  A single statically-linked
   executable will load faster than the same program dynamically-linked.
   But a larger system than just one executable (e.g., the host OS,
   related applications, helper programs, ...) will load slower because
   multiple copies of common texts will not be shareable.

If a major library like OpenSSL were to force operating systems to deal
with the shortcommings of mixed process models and static linking, that
would be a good thing.  That's what OpenSSL would be doing by using
-lpthread.

> The C-library on many platforms is carefully designed to operate
> efficiently and locklessly in single-threaded applications, and to
> only switch to thread-aware implementations of functions when the
> pthread library is loaded.

I don't think the loss of the lockless optimizations should be of
particular concern to OpenSSL: if that's a problem for a given OS, let
its implementors deal with it.  (They can do it, by, e.g., hot-swapping
lock functions at pthread_create() call time rather than at the time
that libpthread is loaded.)

> In NetBSD, for example, the <pthread.h> header file carefully
> defines non-threaded (or rather lazily threaded only when pthread
> is also loaded) versions of various pthread functions, see the
> extract of pthread.h below my signature.

It shouldn't be the loading of libpthread that causes transition to a
threaded model (and, therefore, loss of single-threaded optimizations),
but calling pthread_create().  If NetBSD really does the former, well,
too bad: it really ought to be changed to do the latter.

> Therefore, on any platform where the C-library is properly engineered
> in this way (at least NetBSD and MacOS/X), we should not be
> introducing -lpthread or definining "-D_REENTRANT", ...

Nonetheless, this is good advice.  If the necessary functionality from
libpthread can be found in libc (or, in the case of thread-locals, in
the compiler), then OpenSSL should avoid bringing in -lpthread.

As long as they support seamless upgrade to threaded model via dlopen(),
this is good advice.

Of course, this does mean that OpenSSL needs more complexity in its
configurator: to decide when to use -lpthread and when not to.

> We are a thread-safe library, *not* a threaded library, and must
> not introduce threading into threaded applications.

That's a fairy tale.

OpenSSL is only thread-safe when either a) used only in single-threaded
processes that never transition to multi-threaded process models, or b)
when the program initializes the OpenSSL lock callbacks before any
libraries that need OpenSSL.  Those two constraints are so difficult to
meet that I don't think it's fair to call OpenSSL a thread-safe library
at all.

It's time to fix this.  If OpenSSL will now require -lpthread, then it
gets easier, certainly, so I'm not exactly against OpenSSL requiring
-lpthread!  ;)

> We should think carefully about which platforms get async support
> (perhaps one at time, and before it has been tested), and whether
> "-pthread" is an acceptable penalty for any potential future
> advantage of async support (none at present).
> 
> Thoughts?  Comments?

See above.  I'm not sure you'll be please with mine :)

> -- 
>       Viktor.
> 
> [NetBSD header commentary extracts elided]

Nico
-- 
_______________________________________________
openssl-dev mailing list
To unsubscribe: https://mta.openssl.org/mailman/listinfo/openssl-dev

Re: [openssl-dev] [openssl-team] Discussion: design issue: async and -lpthread

Reply via email to