After losing the database on Saturday completely I started at new. Currently I am facing almost EXACT the same issue as here http://groups.google.com/group/scalr-discuss/browse_thread/thread/b1cd2d259581db0a/7110e1d4ed56ec65
only that my db totally hangs. To me it seems almost clear that there is an issue of the data bundle with database activity. After restarting I left the db idle or mostly doing read actions. When I regained db population the problem surfaced again. I say again that this is a real issue. As far as I am concerned I cannot rely on this solution. When is the new release coming out? Arie. On Jan 24, 6:48 pm, Arie Fishler <[email protected]> wrote: > also...I just noticed, and this is the worst case now...the new master > extracted a snapshot successfully. All db files seems to be in place....db > even shows me the list of tables and still any select action on the tables > returns with an error that THE TABLE DOES NOT EXIST. > > I dont really know what is this status of the database and whether it is > totally lost at this point. slave behaves exactly the same. > > On Sat, Jan 24, 2009 at 6:29 PM, Arie Fishler <[email protected]> wrote: > > not exactly. the slave was 100% stuck in "not being the master". the second > > slave that started did not succeed in starting (several started and failed). > > I tried to shutdown mysql on the slave before executing the script -> > > failed > > tried executing the script -> failed (it also tried to stop mysql) > > only then rebooted. > > mysql did not start saying it executed slave2master on a non slave. > > Probably the reboot did initiate the process to become a master but the fact > > I ran it manually confused it. > > I was left with no master up....then terminated all instances and sclar > > automatically started instances right. > > Naturally there was no db during this period. > > > I will keep following. > > > On Sat, Jan 24, 2009 at 6:14 PM, Nickolas Toursky <[email protected]>wrote: > > >> Slave should promote itself is a master automatically. If it has > >> failed, the only way to do it - to execute /usr/local/aws/bin/mysql- > >> slave2master.sh script in the shell. > >> As I can see, you have rebooted the master instance before it was > >> initialized - this has broken this instance and did not allow slave to > >> initialize properly. > > >> On Jan 24, 10:19 am, afishler <[email protected]> wrote: > >> > Hi, > > >> > Currently after losing the master again, the slave is not promoted to > >> > be a master. How do I do it manually? > > >> > New instance that is starting does not seem to survive and keeps > >> > starting over. > > >> > This is real frustrating. I really don't think this is an issue of > >> > occasional disk issues on AWS. Master dying happens around twice a > >> > day....too frequent to be considered a "normal" problem > > >> > Here is the log of the new slave failing to start > >> > 24-01-2009 01:46:56 ERROR i-f9800290/instance-up.sh > >> /usr/local/ > >> > aws/bin/mysql-init.sh failed. Exiting. > >> > 24-01-2009 01:46:55 INFO i-f9800290/mysql-init.sh > >> Traceback (most > >> > recent call last): > >> > File "/usr/bin/s3cmd", line 415, in > >> > error("S3 error: " + str(e)) > >> > File "/usr/lib/python2.5/site-packages/S3/S3.py", line 41, in __str__ > >> > retval += (": %s" % self.info["Code"]) > >> > KeyError: 'Code'. Retrying. > >> > 24-01-2009 01:46:55 ERROR i-f9800290/mysql-init.sh Could > >> not fetch > >> > MySQL data snapshot using index s3://farm-1173-918348349691/farm-mysql/ > >> > mysql-snapshot.tar. > >> > 24-01-2009 01:46:55 ERROR i-f9800290/mysql-init.sh Failed > >> to fetch > >> > 's3://farm-1173-918348349691/farm-mysql/mysql-snapshot.tar' to '/mnt/ > >> > mysql-misc/tmp.TySmbs2490/mysql-snapshot.tar' for 4 tries. > >> > 24-01-2009 01:46:24 INFO i-f9800290/mysql-init.sh > >> Traceback (most > >> > recent call last): > >> > File "/usr/bin/s3cmd", line 415, in > >> > error("S3 error: " + str(e)) > >> > File "/usr/lib/python2.5/site-packages/S3/S3.py", line 41, in __str__ > >> > retval += (": %s" % self.info["Code"]) > >> > KeyError: 'Code'. Retrying. > >> > 24-01-2009 01:46:04 INFO i-f9800290/mysql-init.sh > >> Traceback (most > >> > recent call last): > >> > File "/usr/bin/s3cmd", line 415, in > >> > error("S3 error: " + str(e)) > >> > File "/usr/lib/python2.5/site-packages/S3/S3.py", line 41, in __str__ > >> > retval += (": %s" % self.info["Code"]) > >> > KeyError: 'Code'. Retrying. > >> > 24-01-2009 01:45:53 INFO i-f9800290/mysql-init.sh > >> Traceback (most > >> > recent call last): > >> > File "/usr/bin/s3cmd", line 415, in > >> > error("S3 error: " + str(e)) > >> > File "/usr/lib/python2.5/site-packages/S3/S3.py", line 41, in __str__ > >> > retval += (": %s" % self.info["Code"]) > >> > KeyError: 'Code'. Retrying. > >> > 24-01-2009 01:45:53 INFO i-f9800290/mysql-init.sh Trying > >> to fetch > >> > previous MySQL snapshot from s3://farm-1173-918348349691/farm-mysql/ > >> > mysql-snapshot.tar (10527569920 bytes). --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "scalr-discuss" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/scalr-discuss?hl=en -~----------~----~----~----~------~----~------~--~---
