Hello
This is what my use case looks like
*Use Case*
- Given many log files in range (2MB - 2GB), I need to parse each of these
logs and apply some processing, generate Java POJO.
- For this problem, lets assume that we have just 1 log file
- Also, the idea is to making best use of System. Multiple cores are
available.
*Alternative 1*
- Open file (synchronous), read each line, generate POJOs
FileActor -> read each line -> List<POJO>
*Pros*: simple to understand
*Cons*: Serial Process, not taking advantage of multiple cores in the system
*Alternative 2*
- Open File (synchronous), read N lines (N is configurable), pass on to
different actors to process
/ LogLineProcessActor
1FileActor -> LogLineProcessRouter (with 10 Actors) -- LogLineProcessActor 2
\ LogLineProcessActor 10
*Pros* Some parallelization, by using different actors to process part of
lines. Actors will make use of available cores in the system (? how, may
be?)
*Cons* Still Serial, because file read in serial fashion
*Questions*
- is any of the above choice a good choice?
- Are there better alternatives?
Please provide valuable thoughts here
Thanks a lot
--
>>>>>>>>>> Read the docs: http://akka.io/docs/
>>>>>>>>>> Check the FAQ:
>>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html
>>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user
---
You received this message because you are subscribed to the Google Groups "Akka
User List" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.