Hello 

This is what my use case looks like 

*Use Case*

- Given many log files in range (2MB - 2GB), I need to parse each of these 
logs and apply some processing, generate Java POJO.
- For this problem, lets assume that we have just 1 log file
- Also, the idea is to making best use of System. Multiple cores are 
available.

*Alternative 1*
- Open file (synchronous), read each line, generate POJOs

FileActor -> read each line -> List<POJO>  

*Pros*: simple to understand
*Cons*: Serial Process, not taking advantage of multiple cores in the system

*Alternative 2*
- Open File (synchronous), read N lines (N is configurable), pass on to 
different actors to process

                                                    / LogLineProcessActor 
1FileActor -> LogLineProcessRouter (with 10 Actors) -- LogLineProcessActor 2
                                                    \ LogLineProcessActor 10

*Pros* Some parallelization, by using different actors to process part of 
lines. Actors will make use of available cores in the system (? how, may 
be?)
*Cons* Still Serial, because file read in serial fashion

*Questions*
- is any of the above choice a good choice?
- Are there better alternatives?

Please provide valuable thoughts here

Thanks a lot

-- 
>>>>>>>>>>      Read the docs: http://akka.io/docs/
>>>>>>>>>>      Check the FAQ: 
>>>>>>>>>> http://doc.akka.io/docs/akka/current/additional/faq.html
>>>>>>>>>>      Search the archives: https://groups.google.com/group/akka-user
--- 
You received this message because you are subscribed to the Google Groups "Akka 
User List" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.

Reply via email to