Reporting ETLs are taking too long on RDS set-up
------------------------------------------------

                 Key: MIFOSBI-306
                 URL: http://mifosforge.jira.com/browse/MIFOSBI-306
             Project: mifos business intelligence
          Issue Type: Bug
          Components: Data Warehouse
            Reporter: emilytucker
            Priority: Major


>From John W:
> I just want to highlight an issue with running ETL against the 'RDS' 
> implementations of mysql
> 
> Basically the jobs take 4 times as long as opposed to when the ETL is 
> running on the same machine as mysql because masses of data gets moved 
> from mysql to java box (if mifos is chatty then ETL hogs the whole 
> conversation)
> 
> e.g. full ETL for Keef data on my laptop or the dev sandbox takes 
> about 1.25 hrs but took nearly 5 hours against the RDS in test keef 
> today

Also note that the SECDEP ETL takes about 9 hours

Johns asks: Is there anyway to get ETL runnning on same box as RDS? 

AdamM responds:
No, because RDS instances only provide the MySQL service. They may just be EC2 
instances under the hood, but we don't have low-level
(ie: shell) access to those machines.

Of course we can go back to running MySQL ourselves on EC2 instances, but I'm 
not keen on that since we'd have to once again shoulder that admin burden, and 
with fewer staff.

Some ideas off the top of my head:
* implement incremental ETL (or is this in-progress? I forget)
* test, then change back to the setup where ETL reads from the read
  replica and writes to the master (comes back to understanding why
  ETL is slow w/RDS, though; also, replication appears to have been
  working fine for the last week)
* optimize the MySQL server on RDS
  * ask Oracle/MySQL support for help (we're paying for it, and Todd
    Farmer is still around)
* move to a larger size RDS instance


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

------------------------------------------------------------------------------
What Every C/C++ and Fortran developer Should Know!
Read this article and learn how Intel has extended the reach of its 
next-generation tools to help Windows* and Linux* C/C++ and Fortran 
developers boost performance applications - including clusters. 
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Mifos-issues mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/mifos-issues

Reply via email to