Re: [akka-user] [akka-stream-experimental-1.0] How to reuse akka.stream.scaladsl.Tcp connections?

Simon Schäfer Tue, 25 Aug 2015 17:03:31 -0700

Hi Johannes,

On 08/25/2015 05:25 PM, Johannes Rudolph wrote:

Hi Simon,
I think there are two conceptual difficulties you need to tackle:
The first is the problem which you describe with infinite / finitestreams which is actually more one of the "traditional" (= actorbased) push-style asynchronous programming versus the "new" [*]pull-style of reactive/akka streams which was introduced to deal withbackpressure. The issue with backpressure is that it only works if allcomponents take part in it. If you have one component that opts-out ofbackpressure it will have to fail or drop elements if it becomesoverloaded and this component will become the weakest link (or the"Sollbruchstelle") of your application under load. Akka currentlysupports `Source.actorRef` (and `Sink.actorRef` respectively) whichdoes exactly this translation from a push-style Actor API to thepull-style streams API. You usually don't want to use them as theywill be limited by design and bound to fail under load.
Pull-style means that you need to write your program so that it iscompletely passive and waits for demand (you could also call thatstyle "reactive", you setup your program to passively wait for asource to provide elements and then react to them). Writing "passive"programs is perfectly suited to services that follow the request /response principle. You setup your handler as a flow and just put itbetween the Source[Request] / Sink[Response].
But what does it mean for a client program which usually activelytries to achieve something? I think you can also build such a programin a passive style: if it doesn't take any dynamic input it is easy asyou can create all the sources and sinks from static data. If it doestake dynamic input (like user input), you just need a Source of thatuser input that only prompts the user for more input if there'sdemand. It should be possible to structure a program like this but itwill be a pervasive change that cannot be recommended in all cases.
So, in reality for client applications you will probably use thingslike the brittle `Source.actorRef` and just statically configure thesize of the buffers and queues to be big enough for the common usecases. (You could say that `Source.actorRef` is not more brittle thanthe JVM itself which you also need to configure with a maximum heapsize.) In any case using streams will force you to think about thesekind of issues.

While I read about the concepts, I was not really aware of them. Thanksfor clarification on that area.

The second difficulty is a shortcoming in your description (IMO)regarding your notion of "reusing a connection" that is also uncoveredby your use of streams. Look at what this line means:
val resp =Source(byteString).via(tcpFlow).runFold(ByteString.empty)(_++_)
It says, "open a TCP connection, stream the source byteString to theconnection, read all data *until the connection closed by the otherside* and return this data". So, the end of the response is determinedby looking for the end of the TCP stream. To be able to reuse aconnection you will need a different end-of-response marker than thesignal that TCP connection has been closed. You will need some framingprotocol on top of TCP that can discern where one response ends andthe next one starts and implement a streaming parser for that. Youwould start by implementing a
def requestRenderer: Flow[Request, ByteString]

and a

def responseParser: Flow[ByteString, Response]

Between those you can put the tcp connection:
def pipeline: Flow[ByteString, ByteString] =Flow[Request].via(requestRenderer).via(Tcp.outgoingConnection).via(responseParser)
Now you still have the problem how to interface that Flow.(And maybethat is what all your question is about). If you can structure yourprogram like hinted above then you could create a
// prompts user for more input
def userInput: Source[UserInput]

and a

def userInputParser: Flow[UserInput, Request]

and a

def output: Sink[Response]

so you could finally create and run your program as

userInput.via(userInputParser).via(pipeline).to(output).run()
(If you are into functional programming, that may be actually verysimilar to how you would have structured your program in any case).
For the rest of us, it would be nice if we could wrap the `pipeline`above with something to either get a function `Request =>Future[Response]` or an ActorRef to which requests could be sent andwhich would send back a message after the response was received.Unfortunately, The Right Solution (TM) for that case is still missing.It would be nice if there was a a one-to-one Flow wrapper inakka-stream that would do exactly this translation but unfortunatelythere currently isn't one readily available. You can build such acomponent yourself (Mathias actually built a specific solution forakka-http to implement `Http.singleRequest()` which has exactly thesame problem).
So, how you can build something like that? Here is a sketch:

class RequestResponseActor extends Actor {
val pipelineActor =Source.actorRef[Request].via(pipeline).to(Sink.actorRef(self)).run()// should return the actorRef for the Source.actorRef
  def receive = {
    case req: Request =>
register(req, sender) // put request and sender ref at the endof a FIFO data structure
      pipelineActor ! req
    case res: Response =>
val (req, originalSender) = unregister() // gets originalrequest and sender from the head of the FIFO data structure
      originalSender ! req
// what happens on error? what on premature closing of theconnection? etc.
  }
}
All of this is based on the premise that your framing protocol and thesemantics of the service you are talking to are using arequest/response style (like HTTP with HTTP pipelining enabled) whererequests are answered with responses in a FIFO manner. Also, in thesketch I skimmed over a lot of configuration and subtle semanticdetails you may have to consider (this is another reason there's nosuch shrink-wrapped component in akka-stream).
Does that answer most of your question?

Yes, this was tremendously useful. You described exactly my use case anda way how I could achieve what I want. Big thank you!

For now it solved all of my problems. Actually it was more an answer tomy other question:https://groups.google.com/forum/#!topic/akka-user/GviQjB08rS0

Not creating multiple connections also solved my problem that some datawas lost, even though I don't understand where the problem was. I guessdata was sent to the wrong connection or something like this.

There are still a few open questions but since but nothing that blocksme, therefore I will look into that when the turn into problems. Withthe knowledge on how to feed the streams, how to get the values out ofthere and how to connect the subcomponents I should be less helpless.


The only thing I didn't understand was this part:

  Source.actorRef(1, OverflowStrategy.fail)

When I replace the 1 with a 0 (which is allowed according to thedocumentation) I get this error message: Dropping element because thereis no downstream demand

Why do I get it? I expected that since there is a client which awaitsfor data that I don't need a cache. Maybe it is because the client(connected over TCP) is not a reactive stream, i.e. didn't tellbeforehand that it awaits data? Anyway, 1 seems to be enough, even for100s of requests.


Thanks again for all your help!

Simon

This became quite a long answer but it also covers a lot of stuff :)

HTH
Johannes
[*] Of course, there's not too much conceptually new here. E.g. UNIXshell pipes and filters are very similar to the whole reactive streamsconcept (but constrained to byte streams): you have a buffer that canbe asynchronously written to from one side and read from on the otherside. The reader must poll if no data is currently available while thewriter must poll while the buffer is full. Demand is signalled overthe capacity of the shared buffer. Similar for TCP where demand isexchanged by notifying the peer of the capacity of the receive buffer.Etc.
--
>>>>>>>>>> Read the docs: http://akka.io/docs/
>>>>>>>>>> Check the FAQ:http://doc.akka.io/docs/akka/current/additional/faq.html
>>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user
---
You received this message because you are subscribed to a topic in theGoogle Groups "Akka User List" group.To unsubscribe from this topic, visithttps://groups.google.com/d/topic/akka-user/qpqWePkADwU/unsubscribe.To unsubscribe from this group and all its topics, send an email to[email protected]<mailto:[email protected]>.To post to this group, send email to [email protected]<mailto:[email protected]>.
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.

--

     Read the docs: http://akka.io/docs/
     Check the FAQ: http://doc.akka.io/docs/akka/current/additional/faq.html
     Search the archives: https://groups.google.com/group/akka-user

---You received this message because you are subscribed to the Google Groups "Akka User List" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.

Re: [akka-user] [akka-stream-experimental-1.0] How to reuse akka.stream.scaladsl.Tcp connections?

Reply via email to