Re: vibe.d-lite v0.1.0 powered by photon

Sönke Ludwig via Digitalmars-d-announce Thu, 25 Sep 2025 08:50:24 -0700

Am 23.09.25 um 17:35 schrieb Dmitry Olshansky:

On Monday, 22 September 2025 at 11:14:17 UTC, Sönke Ludwig wrote:
Am 22.09.25 um 09:49 schrieb Dmitry Olshansky:
On Friday, 19 September 2025 at 17:37:36 UTC, Sönke Ludwig wrote:
So you don't support timeouts when waiting for an event at all?Otherwise I don't see why a separate API would be required, thisshould be implementable with plain Posix APIs within vibe-core-liteitself.
Photon's API is the syscall interface. So to wait on an event youjust call poll.
Behind the scenes it will just wait on the right fd to change state.
Now vibe-core-light wants something like read(buffer, timeout) whichis not syscall API but maybe added. But since I'm going to add newAPI I'd rather have something consistent and sane not just a bunch ofadhoc functions to satisfy vibe.d interface.
Why can't you then use poll() to for example implement `ManualEvent`with timeout and interrupt support? And shouldn't recv() with timeoutbe implementable the same way, poll with timeout and only read whenready?
Yes, recv with timeout is basically poll+recv. The problem is that thenI need to support interrupts in poll. Nothing really changed.As far as manual event goes I've implemented that with custom cond varand mutex. That mutex is not interruptible as it's backed by semaphoreon slow path in a form of eventfd.I might create custom mutex that is interruptible I guess but the notionof interrupts would have to be introduced to photon. I do not reallylike it.

I'd probably create an additional event FD per thread used to signalinterruption and also pass that to any poll() that is used forinterruptible wait.

I think we have a misunderstanding of what vibe.d is supposed to be.It seems like you are only focused on the web/server role, while to mevibe-core is a general-purpose I/O and concurrency system with noparticular specialization in server tasks. With that view, yourstatement to me sounds like "Clearly D is not meant to do multi-threading, since main() is only running in a single thread".
The defaults are what is important. Go defaults to multi-threading forinstance.D defaults to multi-threading because TLS by default is certainly a markof multi-threaded environment. std.concurrency defaults to new threadper spawn, again this tells me it's about multithreading. I intend tosupport multi-threading by default. I understand that we view this issuedifferently.

But you are comparing different defaults here. With plain D, you alsohave to import either `core.thread` or`std.concurrency`/`std.paralellism` to do any multi-threaded work. Thesame is true for vibe-core. What you propose would be more comparable tohaving foreach() operate like parallelForeach(), with far-reachingconsequences.

If we are just talking about naming - runTask/runWorkerTask vs.go/goOnSameThread - that is of course debatable, but in that case Ithink it's blown very much out of proportion to take that as the basisto claim "it's meant to be used single-threaded".

Anything client side involving a user interface has plenty ofopportunities for employing secondary tasks or long-running sparselyupdated state logic that are not CPU bound. Most of the time isspent idle there. Specific computations on the other hand can ofcourse still be handed off to other threads.
Latency still going to be better if multiple cores are utilized.
And I'm still not sure what the example is.
We are comparing fiber switches and working on data with a sharedcache and no synchronization to synchronizing data access and controlflow between threads/cores. There is such a broad spectrum ofpossibilities for one of those to be faster than the other that it'sjust silly to make a general statement like that.
The thing is that if you always share data between threads, you haveto pay for that for every single data access, regardless of whetherthere is actual concurrency going on or not.
Obviously, we should strive to share responsibly. Photon has Channelsmuch like vibe-core has Channel. Mine are MPSC though, mostly to modelInput/Output range concepts.

True, but it's still not free (as in CPU cycles and code complexity) andyou can't always control all code involved.

If you want a concrete example, take a simple download dialog with aprogress bar. There is no gain in off-loading anything to a separatethread here, since this is fully I/O bound, but it adds quite somecommunication complexity if you do. CPU performance is simply not aconcern here.
Channels tame the complexity. Yes, channels could get more expansive inmulti-threaded scenario but we already agreed that it's not CPU bound.

If you have code that does a lot of these things, this just degradescode readability for absolutely no practical gain, though.

The problem is that for example you might have a handle that wascreated in thread A and is not valid in thread B, or you set a statein thread A and thread B doesn't see that state. This would meanthat you are limited to a single task for the complete libraryinteraction.
Or just initialize it lazily in all threads that happen to use it.
Otherwise, this is basically stick to one thread really.
But then it's a different handle representing a different object -that's not the same thing. I'm not just talking about initializing thelibrary as a whole. But even if, there are a lot of libraries thatdon't use TLS and are simply not thread-safe at all.
Something that is not thread-safe at all is a dying breed. It's been 20years that we have multi-cores. Most libraries can be initialized onceper thread which is quite naturally modeled with TLS handle to saidlibrary. Communicating between fibers via shared TLS handle is notsomething I would recommend regardless of the default spawn behavior.

Unfortunately, those libraries are an unpleasant reality that you can'talways avoid.

BTW, one of the worst offenders is Apple's whole Objective-C API.Auto-release pools in particular make it extremely fragile to work withfibers at all and of course there are all kinds of hidden threaddependencies inside.

This doesn't make sense, in the original vibe-core, you can simplychoose between spawning in the same thread or in "any" thread.`shared`/`immutable` is correctly enforced in the latter case toavoid unintended data sharing.
I have go and goOnSameThread. Guess which is the encouraged option.
Does go() enforce proper use of shared/immutable when passing data tothe scheduled "go routine"?
It goes with the same API as we have for threads - a delegate, sosharing becomes user's responsibility. I may add function + args forbetter handling of resources passed to the lambda.

That means that this is completely un`@safe` - C++ level memory safety.IMO this is an unacceptable default for web applications.

The GC/malloc is the main reason why this is mostly false inpractice, but it extends to any central contention source within theprocess - yes, often you can avoid that, but often that takes a lotof extra work and processes sidestep that issue in the first place.
As is observable from the look on other languages and runtimes mallocis not the bottleneck it used to be. Our particular version of GCthat doesn't have thread caches is a bottleneck.
malloc() will also always be a bottleneck with the right load. Justthe n times larger amount of virtual address space required may startto become an issue for memory heavy applications. But even if ignorethat, ruling out using the existing GC doesn't sound like a good ideato me.
The existing GC is basically 20+ years old, ofc we need better GC and
thread cached allocation solves contention in multi-threaded environments.
Alternative memory allocator is doing great on 320 core machines. Icannot tell you which allocator that is or what exactly these serversare. Though even jemalloc does okayish.
And the fact is that, even with relatively mild GC use, a webapplication will not scale properly with many cores.
Only partially agree, Java's GC handles load just fine and runs fasterthan vibe.d(-light). It does allocations on its serving code path.

I was just talking about the current D GC here. Once we have a betterimplementation, this can very well become a much weaker argument!

However, speaking more generally, the other arguments for preferring toscale using processes still stand, and even with a better GC I wouldstill argue that leading library users to do multi-threaded requesthandling is not necessarily the best default (of course it still *can*be for some applications).

Anyway, the main point from my side is just that the semantics of what*is* in vibe-core-light should really match the corresponding functionsin vibe-core. Apart from that, I was just telling you that yourimpression of it being intended to be used single-threaded is not right,which doesn't mean that the presentation shouldn't probably emphasizethe multi-threaded functionality and multi-threaded request processing more.

Separate process also have the advantage of being more robust andenabling seamless restarts and updates of the executable. And theyfacilitate an application design that lends itself to scaling acrossmultiple machines.
Then give me the example code to run multiple vibe.d in parallelprocesses (should be simillar to runDist) and we can compareapproaches. For all I know it could be faster then multi-threadedvibe.d-light. Also honestly if vibe.d's target is multiple processesit should probably start like this by default.
Again, the "default" is a high-level issue and none of vibe-core'sbusiness. The simplest way to have that work is to use`HTTPServerOption.reusePort` and then start as many processes as desired.
So I did just that. To my surprise it indeed speeds up all of my Dserver examples.
The speed ups are roughly:

On vibe-http-light:
8 cores 1.14
12 cores 1.10
16 cores 1.08
24 cores 1.05
32 cores 1.06
48 cores 1.07

On vibe-http-classic:
8 cores 1.33
12 cores 1.45
16 cores 1.60
24 cores 2.54
32 cores 4.44
48 cores 8.56

On plain photon-http:
8 cores 1.15
12 cores 1.10
16 cores 1.09
24 cores 1.05
32 cores 1.07
48 cores 1.04
We should absolutely tweak vibe.d TechEmpower benchmark to run vibe.d asa process per core! As far as photon-powered versions go I see there isa point where per-process becomes less of a gain with more cores, so Iwould think there are 2 factors at play one positive and one negative,with negative being tied to the number of processes.
Lastly, I have found opportunities to speed up vibe-http even withoutswitching to vibe-core-light. Will send PRs.

Interesting, I wonder whether its the REUSE_PORT connection distributionthat gets more expensive when it's working cross-process. Agreed thatthe TechEmpower benchmark is in dire need of being looked at. In fact Ihad the code checked out for a long while, intending to look into it,because it obviously didn't scale like my own benchmarks, but then Inever got around to do it, being to busy with other things.

Re: vibe.d-lite v0.1.0 powered by photon

Reply via email to