Oh, my mistake then, sorry :-) I missed the fact that you're using a GZIPInputStream and not "just the file", all good then - InputStreamSource should be good for that use case :-)
-- Cheers, Konrad 'ktoso’ Malawski Akka @ Typesafe On 23 June 2015 at 17:30:16, BigAl (albert.pastr...@intenthq.com) wrote: @Konrad, not sure if we understood properly SynchronousFileSource but we can't find a way of making it work with a gzipped input file. @drewhk, thanks a lot, I will update the blog post with the information and the link you've provided! On Tuesday, 23 June 2015 17:14:41 UTC+2, drewhk wrote: And one more thing... (Columbo style ;)) There are other ways to exploit paralellisation other than mapAsyncUnordered. This section in the docs explains the most important patterns in detail: http://doc.akka.io/docs/akka-stream-and-http-experimental/1.0-RC3/scala/stream-parallelism.html -Endre On Tue, Jun 23, 2015 at 5:11 PM, Endre Varga <endre...@typesafe.com> wrote: Btw, in the blog you mention " you can use mapConcat instead (note that it works only onimmutable.Seq[T]).". This is no longer true after RC4, since you will be able to emit an immutable.Iterable. In fact you can emit an infinite sequence if that is what you want :) -Endre On Tue, Jun 23, 2015 at 3:31 PM, BigAl <albert....@intenthq.com> wrote: Thanks a lot! Will use the InputStreamSource for sure, looking forward for the RC4 :-) On Tuesday, 23 June 2015 15:08:25 UTC+2, drewhk wrote: Hi, On Tue, Jun 23, 2015 at 12:29 PM, BigAl <albert....@intenthq.com> wrote: Hi all, In my company we've been running some tests to evaluate if Akka Streams would be useful for us to process both the Wikidata and Wikipedia dumps. I have to say that both PoC were very satisfactory and we found Akka Streams a really nice framework to play with :-). I think what you do is actually a sweet spot for Akka Streams, these kind of use-cases match well with the underlying assumptions. After that, we wanted to share our findings with the rest of the world :) so we've created a small repo that contains part of the PoC, we have structured in a way that can be explain in a single (and simple) blog post: Github repo: https://github.com/intenthq/wikidata-akka-streams Blog post: http://engineering.intenthq.com/2015/06/wikidata-akka-streams/ We would really appreciate any comments about it. There is an InputStreamSource available, so you can use it directly where you read the file: http://doc.akka.io/api/akka-stream-and-http-experimental/1.0-RC3/#akka.stream.io.InputStreamSource$ The above does not do line parsing, but the upcoming RC4 contains a simple line parsing (or other delimiter based parsing) stage: https://github.com/akka/akka/blob/release-2.3-dev/akka-stream/src/main/scala/akka/stream/io/Framing.scala#L53 You can try that combination after RC4 comes out. -Endre Thanks, BigAl -- >>>>>>>>>> Read the docs: http://akka.io/docs/ >>>>>>>>>> Check the FAQ: >>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html >>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user --- You received this message because you are subscribed to the Google Groups "Akka User List" group. To unsubscribe from this group and stop receiving emails from it, send an email to akka-user+...@googlegroups.com. To post to this group, send email to akka...@googlegroups.com. Visit this group at http://groups.google.com/group/akka-user. For more options, visit https://groups.google.com/d/optout. -- >>>>>>>>>> Read the docs: http://akka.io/docs/ >>>>>>>>>> Check the FAQ: >>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html >>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user --- You received this message because you are subscribed to the Google Groups "Akka User List" group. To unsubscribe from this group and stop receiving emails from it, send an email to akka-user+...@googlegroups.com. To post to this group, send email to akka...@googlegroups.com. Visit this group at http://groups.google.com/group/akka-user. For more options, visit https://groups.google.com/d/optout. -- >>>>>>>>>> Read the docs: http://akka.io/docs/ >>>>>>>>>> Check the FAQ: >>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html >>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user --- You received this message because you are subscribed to the Google Groups "Akka User List" group. To unsubscribe from this group and stop receiving emails from it, send an email to akka-user+unsubscr...@googlegroups.com. To post to this group, send email to akka-user@googlegroups.com. Visit this group at http://groups.google.com/group/akka-user. For more options, visit https://groups.google.com/d/optout. -- >>>>>>>>>> Read the docs: http://akka.io/docs/ >>>>>>>>>> Check the FAQ: >>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html >>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user --- You received this message because you are subscribed to the Google Groups "Akka User List" group. To unsubscribe from this group and stop receiving emails from it, send an email to akka-user+unsubscr...@googlegroups.com. To post to this group, send email to akka-user@googlegroups.com. Visit this group at http://groups.google.com/group/akka-user. For more options, visit https://groups.google.com/d/optout.