Re: iopipe alpha 0.0.1 version

Steven Schveighoffer via Digitalmars-d-announce Fri, 13 Oct 2017 11:42:04 -0700

On 10/13/17 11:59 AM, Martin Nowak wrote:

On Thursday, 12 October 2017 at 04:22:01 UTC, Steven Schveighoffer wrote:
I added a tag for iopipe and added it to the dub registry so peoplecan try it out.
I didn't want to add it until I had fully documented and unittested it.

http://code.dlang.org/packages/iopipe
https://github.com/schveiguy/iopipe
Great news to see continued work on this.
I'll just use this thread to get started on design discussions. If thereis there a better place for that, let me know ;).

This is as good a place as any :) I may create some issue reports ongithub to track things better.

Questions/Ideas
- You can move docs out of the repo to fix search, e.g. by pushing themto a `gh-pages` branch of your repo.


When I tried the search it seemed to work...

Seehttps://github.com/MartinNowak/bloom/blob/736dc7a7ffcd2bbca7997f273a09e272e0484596/travis.sh#L13for an automated setup using Travis-CI and ddox/scod.

I admit complete ignorance on this, I need to look into it, but at themoment, I'm OK with committing the generated docs directly as an uglyextra step. When I looked at the options under adding a "pages" piecefor the project that if I put things under "docs" directory, it coulduse that, so that's what I went with.

- Standard device implementation?
You library already has the notion of devices as thin abstractionsover file/socket handles. Should we start with such an unbuffered IO library as foundationincluding support hooks for Fiber based event loops. Something along thelines of https://code.dlang.org/packages/io? Without a standard devicelib, IOPipe could not be used in APIs.

I absolutely think this would be a great idea. In fact, you could useJason White's io package with iopipes directly, as his low-level typeshave the necessary read function:https://github.com/jasonwhite/io/blob/master/source/io/file/stream.d#L335

Perhaps we could coax the basic types out of that library to provide abase for both iopipe and his high-level stuff. The stream portion of mylibrary is really just a throwaway piece that is not a focus of thelibrary. Indeed, I created it because unbuffered stream types didn'texist anywhere (the IODev type predates iopipe, as it was part of myoriginal attempt to rewrite Phobos io).

- What's the plan for @safe buffer/window invalidation, right now you'rehanding out raw access to internal buffers with an inherent memorysafety problem.

I don't plan to put any restrictions on this. In fact the core purposeof iopipe is to give raw buffer access to aid in writing higher-levelroutines around it. As I said here:https://github.com/schveiguy/iopipe/blob/master/source/iopipe/buffer.d#L217

If the Allocator supports deallocation I call it, but it may not be thecorrect thing to do. There is a sticky point instd.experiemental.allocator: the GC allocator defines deallocate,because it's available, but the *presence* of that member may be takento mean you have to call it to deallocate. There is no member sayingwhether deallocation is optional.

In my wrapper GCNoPointerAllocator (which I needed to support allocatingubyte buffers without having to scan them), I leave out the deallocatefunction, so technically it's @safe with that allocator.

I will say though, at some point, I'm going to focus on making @safe asmuch as possible in iopipe. That may require using the GC for buffering.

   ```d
   auto w = f.window();
   f.extend(random());
   w[0]; // ⚡ dangling pointer ⚡
   ```
I can see how the compiler could catch that if we'd go withcompile-time enforced safety for RC and friends. But that's stillunclear atm. and we might end up with a runtime RC/weak ptr mechanisminstead, which wouldn't be too good a fit for that window mechanism.

What would be nice is a mechanism to detect this situation, since theabove is both un-@safe and incorrect code.

Possibly you could instrument a window with a mechanism to check to seeif it's still correct on every access, to be used when compiled innon-release mode for checking program correctness.

But in terms of @safe code in release mode, I think the only option isreally to rely on the GC or reference counting to allow the window tostill exist.

- What about the principle that the caller should chooseallocation/ownership?


It can, BufferManager takes an Allocator compile-time option.

It's also possible to create your own ownership or allocation scheme aslong as you implement the required iopipe methods.

Having an extend methods means the IOPipe is responsible forgrowing/allocating buffers, so you'll end up with IOPipeMalloc,IOPipeGC, IOPipeAllocatorGrowExp (or their template alternatives), notvery nice for APIs.

extend is a core part of the iopipe system. The point of the library isthat you don't have to manage the buffering or allocation of yourhigher-level code in terms of memory ownership or allocation. I've usedso many buffered streams where I have to still create my own bufferbecause of a quirk in the way I have to process the data doesn't fit theAPI of the stream. This mitigates that by giving you direct control overhow much data should be buffered, but not burdening you with the detailsof managing that memory. The mechanism was clear to me in DmitryOlshansky's simple back-reference toy library that he made a while back(and actually was the inspiration for making iopipe instead of what Iwas doing before).


I can't find his library any more, but here is the post he made:

https://forum.dlang.org/post/[email protected]

- Why continuous memory? The current implementations reallocs and evenweirder memmoves data in extend.https://github.com/schveiguy/iopipe/blob/3589a4c9fc72b844eb4efd3ae718773faf9ab9ed/source/iopipe/buffer.d#L171
   Shouldn't a modern IO library be as zero-copy as possible?
The docs say random access, that should be supported by ringbuffersor lists/arrays of buffers. Any plans towards that direction?


Yes and no :)

My original idea was that once I got simple array buffers working, Iwould move on to circular buffers, and linked lists of buffers, etc,with all the details hidden by the range itself. I still might implementthis. Windows and Posix support the notion of scatter read so you caneasily implement a way for streams to fit perfectly on top of these things.

But what I realized is that in practice (and especially when battling tobeat Phobos byLine and libc's getline), avoiding copying may not be asimportant as I thought. For one thing, the focused data (the data youcare about currently) is generally much smaller than the real buffersize. So when it is calling memmove, you are generally only moving atiny piece of the buffer.

Second, the CPU is really good at dealing with arrays (and searchingthrough arrays), especially when dereferencing data.

Third, every single access to a non-array is going to have to go throughsome mechanism to check which actual array the index falls into. Whenimplementing iopipe's byline, I got a SIGNIFICANT speedup by copyingmembers of the ByLine struct (e.g. the dchar being searched for) into alocal variable. If you have a custom range for a circular buffer whosedivision point has to be read on every element index, the penalties aregoing to add up.

The trade-offs might still be worth it. For instance if your focuseddata is a larger percentage of the total buffer (like 70%), moving it tothe front of the buffer is going to hurt performance. I don't knowwhether it would overcome slower access per element. The good news is, Ican implement it, and see how it fares, since the higher level code isabstracted to the buffer type.

And of course, any existing (non-infinite) random-access range can behooked as a non-extendable iopipe (see how arrays are hooked).


Thanks for all your thoughts on this, Martin!

-Steve

Re: iopipe alpha 0.0.1 version

Reply via email to