RE: RMAN recovery
Dennis, I have been admiring you from far. Thank you so much to reply to my email. The renamed data file has been tested and there is no corruption of any kind. The nightly physical and logical backups were successful completed with no error. However, I got an ora-19502 error when I tried to use these backups to restore/duplicate the database from a remote node. It might be asynch io problem. I am trying to set the 'fileperset to 1'. Do you have any other ideas? Thanks again, Anne -Original Message- Sent: Thursday, August 07, 2003 12:25 PM To: Multiple recipients of list ORACLE-L Anne What version of Oracle is this? Okay, you renamed a production database file 10 days ago. Since then, has Oracle been able to use this file? Can you export the table that is stored on this file without error? Have you examined your RMAN backup log to ensure this file is specifically listed as being backed up? Is it possible that the error you are receiving has nothing to do with the production database, but is entirely due to your backup or test database? In other words, maybe the test system has a bad drive? Another possibility, awhile back on this list several people reported that they had datafiles with errors, but RMAN did not detect these errors when it was backing them up. That is why I suggest exporting the table. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 07, 2003 9:39 AM To: Multiple recipients of list ORACLE-L Dear List, I have renamed a datafile in a production database ten days ago. I have no error to back up this database but I cannot duplicate/recover this database since. I am getting ora-19502 write error on this file. Please advise. Many thanks, -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Anne Yu INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery
Anne What version of Oracle is this? Okay, you renamed a production database file 10 days ago. Since then, has Oracle been able to use this file? Can you export the table that is stored on this file without error? Have you examined your RMAN backup log to ensure this file is specifically listed as being backed up? Is it possible that the error you are receiving has nothing to do with the production database, but is entirely due to your backup or test database? In other words, maybe the test system has a bad drive? Another possibility, awhile back on this list several people reported that they had datafiles with errors, but RMAN did not detect these errors when it was backing them up. That is why I suggest exporting the table. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 07, 2003 9:39 AM To: Multiple recipients of list ORACLE-L Dear List, I have renamed a datafile in a production database ten days ago. I have no error to back up this database but I cannot duplicate/recover this database since. I am getting ora-19502 write error on this file. Please advise. Many thanks, -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery
Anne Gee, things must appear better from Texas! Thanks for the comment. The only other thing I can suggest is to have your network administrator take a careful look at any connection settings. I'm sorry I can't be more specific, but we had a situation where the network/system administrator assumed RMAN would be opening just one or two connections and it turned out that RMAN was opening a whole bunch of connections. That can cause weird problems. I think you are at the point of opening a TAR with Metalink and hoping they will have some other suggestions. In our case, we had an NFS-mounted drive and they insisted we move off that. We did, the problem went away, so the sys admin got real interested in digging into his setup. Again, this is a long shot. Good luck. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 07, 2003 1:01 PM To: '[EMAIL PROTECTED]' Cc: DENNIS WILLIAMS Dennis, I have been admiring you from far. Thank you so much to reply to my email. The renamed data file has been tested and there is no corruption of any kind. The nightly physical and logical backups were successful completed with no error. However, I got an ora-19502 error when I tried to use these backups to restore/duplicate the database from a remote node. It might be asynch io problem. I am trying to set the 'fileperset to 1'. Do you have any other ideas? Thanks again, Anne -Original Message- Sent: Thursday, August 07, 2003 12:25 PM To: Multiple recipients of list ORACLE-L Anne What version of Oracle is this? Okay, you renamed a production database file 10 days ago. Since then, has Oracle been able to use this file? Can you export the table that is stored on this file without error? Have you examined your RMAN backup log to ensure this file is specifically listed as being backed up? Is it possible that the error you are receiving has nothing to do with the production database, but is entirely due to your backup or test database? In other words, maybe the test system has a bad drive? Another possibility, awhile back on this list several people reported that they had datafiles with errors, but RMAN did not detect these errors when it was backing them up. That is why I suggest exporting the table. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 07, 2003 9:39 AM To: Multiple recipients of list ORACLE-L Dear List, I have renamed a datafile in a production database ten days ago. I have no error to back up this database but I cannot duplicate/recover this database since. I am getting ora-19502 write error on this file. Please advise. Many thanks, -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery
Title: Message Dear List, I have renamed a datafile in a production databaseten days ago. I have no error to back up this database but I cannotduplicate/recover this database since. I am getting ora-19502 write error on this file. Please advise. Many thanks,
RE: RMAN Recovery Question: Restore with loss of control files an
1. I have configured RMAN for controlfile autobackup, so that every DB backup produces a backup of the controlfiles and spfile as well. With these, I can easily recover from a loss of every single database file. I can't imagine why anyone wouldn't do it this way (reponses invited). 2. MetaLink states that the location of a backup file is embedded in the file itself, that it can't be used if it's moved to another location, and that this is a feature (yeah, right). On Unix, symlinks will do the trick; on Windoze, the suggested workaround is to back up the backup to another location that matches the one from which you want to use it. I ran into this when restoring a backup on another system that had a different drive layout; rather than backing up the backup, I just juggled drive letters. The lesson here is to ensure that your backup directories are configured and named consistently across all database servers. --- DENNIS WILLIAMS [EMAIL PROTECTED] wrote: Timothy I will tell you what I've figured out. Perhaps someone else can provide additional information. 1. Some people have reported success in extracting the control file from the RMAN backup. I got stuck on the issue that RMAN wouldn't function unless the target instance was up with STARTUP MOUNT, but I couldn't mount the database until I had a control file to mount. I finally gave up and took the expedient of doing a separate SQL BACKUP CONTROLFILE statement, using the RMAN backup directory. To perform a disaster recovery, I just move my copy of the control file into position first. 2. I have not been able to convince RMAN to use a different backup location. Fortunately with Unix it is easy to create a link that looks like the original location. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Monday, June 30, 2003 10:35 AM To: Multiple recipients of list ORACLE-L catalo database Hi there everyone, I have read the documentation and I am missing something, I am stuck on two points: (1) How do I recover from a RMAN backup to disk if I have lost all the original database files (control files, data files and redo logs) (2) How do I change the location for RMAN to check for the backup up file i.e. Backup file was on E: but now in F:, when I enter restore it only checks the original location. Specifications: Oracle 8.1.7.4.10 Windows 2000SP4 I know that in the backup part I can use the FORMAT string to specify the filename and location. Cheers Tim Clarkson -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). __ Do you Yahoo!? SBC Yahoo! DSL - Now only $29.95 per month! http://sbc.yahoo.com -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Paul Baumgartel INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
Re: RMAN Recovery Question: Restore with loss of control files an
I'm now working on the problem of restoring a database from one server to another, with different directory locations and names. ( on Windoze ) I've got the syntax down, but there's something missing in the Veritas docs, and I'm not quite there yet. Guess I'll have to call Veritas support now. If you've ever complained about Oracle support... I'll post the solution when done. Jared On Tuesday 01 July 2003 08:40, Paul Baumgartel wrote: 1. I have configured RMAN for controlfile autobackup, so that every DB backup produces a backup of the controlfiles and spfile as well. With these, I can easily recover from a loss of every single database file. I can't imagine why anyone wouldn't do it this way (reponses invited). 2. MetaLink states that the location of a backup file is embedded in the file itself, that it can't be used if it's moved to another location, and that this is a feature (yeah, right). On Unix, symlinks will do the trick; on Windoze, the suggested workaround is to back up the backup to another location that matches the one from which you want to use it. I ran into this when restoring a backup on another system that had a different drive layout; rather than backing up the backup, I just juggled drive letters. The lesson here is to ensure that your backup directories are configured and named consistently across all database servers. --- DENNIS WILLIAMS [EMAIL PROTECTED] wrote: Timothy I will tell you what I've figured out. Perhaps someone else can provide additional information. 1. Some people have reported success in extracting the control file from the RMAN backup. I got stuck on the issue that RMAN wouldn't function unless the target instance was up with STARTUP MOUNT, but I couldn't mount the database until I had a control file to mount. I finally gave up and took the expedient of doing a separate SQL BACKUP CONTROLFILE statement, using the RMAN backup directory. To perform a disaster recovery, I just move my copy of the control file into position first. 2. I have not been able to convince RMAN to use a different backup location. Fortunately with Unix it is easy to create a link that looks like the original location. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Monday, June 30, 2003 10:35 AM To: Multiple recipients of list ORACLE-L catalo database Hi there everyone, I have read the documentation and I am missing something, I am stuck on two points: (1) How do I recover from a RMAN backup to disk if I have lost all the original database files (control files, data files and redo logs) (2) How do I change the location for RMAN to check for the backup up file i.e. Backup file was on E: but now in F:, when I enter restore it only checks the original location. Specifications: Oracle 8.1.7.4.10 Windows 2000SP4 I know that in the backup part I can use the FORMAT string to specify the filename and location. Cheers Tim Clarkson -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). __ Do you Yahoo!? SBC Yahoo! DSL - Now only $29.95 per month! http://sbc.yahoo.com -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Jared Still INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN Recovery Question: Restore with loss of control files an
Timothy I will tell you what I've figured out. Perhaps someone else can provide additional information. 1. Some people have reported success in extracting the control file from the RMAN backup. I got stuck on the issue that RMAN wouldn't function unless the target instance was up with STARTUP MOUNT, but I couldn't mount the database until I had a control file to mount. I finally gave up and took the expedient of doing a separate SQL BACKUP CONTROLFILE statement, using the RMAN backup directory. To perform a disaster recovery, I just move my copy of the control file into position first. 2. I have not been able to convince RMAN to use a different backup location. Fortunately with Unix it is easy to create a link that looks like the original location. Dennis Williams DBA, 80%OCP, 100% DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Monday, June 30, 2003 10:35 AM To: Multiple recipients of list ORACLE-L catalo database Hi there everyone, I have read the documentation and I am missing something, I am stuck on two points: (1) How do I recover from a RMAN backup to disk if I have lost all the original database files (control files, data files and redo logs) (2) How do I change the location for RMAN to check for the backup up file i.e. Backup file was on E: but now in F:, when I enter restore it only checks the original location. Specifications: Oracle 8.1.7.4.10 Windows 2000SP4 I know that in the backup part I can use the FORMAT string to specify the filename and location. Cheers Tim Clarkson -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
Re: RMAN Recovery Question: Restore with loss of control files and no catalo database
Hi Timothy , I am guessing about the following answers, and I work on 9.2.0.1.0 but anyway here goes : (1) How do I recover from a RMAN backup to disk if I have lost all the original database files (control files, data files and redo logs) In nomount First run SET DBID SET UNTIL SEQUENCE number THREAD 1; Then RESTORE CONTROLFILE ... FROM 'media_handle' or TAG; mount database; restore database; recover database ; open resetlogs database; Don't know the answer to the second question, let me know if you find out . -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: [EMAIL PROTECTED] INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
Re: RMAN Recovery - RBS (Non System) tbs
--- Layzee DBA [EMAIL PROTECTED] wrote: Hi DBA's I am in the process of establishing Crash Recovery scenarios using RMAN and i came across one situation wherein i am able to recover a tbs having (non system tbs) rbs in it using svrmgrl but the same damn thing does not work with rman. I want to know where i am going wrong with rman. Info is avbl.in http://www.geocities.com/layzeedba/rman_rbs.html Any input on this would be highly appreciated. Please reply to me and directly to the list. === Not so Lazy DBA === Hi Guys, Lot of RMAN talk has been going on in this forum. Good. But can any gentleman help me sort out this issue?. Thanks __ Do you Yahoo!? Yahoo! Tax Center - File online, calculators, forms, and more http://tax.yahoo.com -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Layzee DBA INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
For -Rachel Carmichael - re Rman recovery...
Rachel, Did you ever determine if the unix crossmount or such to be the complete problem? Did the Rman backup turn out to be ok after all- no corruptions? Ever find out if they run validate command? Thanks Rachel, Brian Spears -Original Message- Sent: Monday, March 03, 2003 1:24 PM To: Multiple recipients of list ORACLE-L now you know why I script everything and run it as a background job on the database server itself. :) That I know of, there is no way to restart an import... unless you have primary key or unique constraints on all objects and you are will to do ignore=y so that you get constraint errors on every existing row but do load all the remaining ones. There isn't anything clean that I know of though. Sorry --- Scott Canaan [EMAIL PROTECTED] wrote: I ran into a problem yesterday afternoon. I had exported a production database, via a full database export, and ftp'd it to another machine to import it there and make that the new production machine. It was a long import (about 8 hours) and about 2/3 through it, my network connection was dropped. I was running it in a ssh shell. When the connection disappeared, the import died. Since I had to get the job done by morning, I chose to drop the database, rebuild it, then re-import. That worked. What I am wondering is: Is there a way to resume an import, once it has been interrupted? I have been looking in metalink and on the net and haven't found anything that says how to do it. I tried technet, but it was way too slow. Any thoughts would be greatly appreciated. Thank you, -- Scott Canaan ([EMAIL PROTECTED]) (585) 475-7886 Life is like a sewer, what you get out of it depends on what you put into it. - Tom Lehrer. -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Scott Canaan INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). __ Do you Yahoo!? Yahoo! Tax Center - forms, calculators, tips, more http://taxes.yahoo.com/ -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Rachel Carmichael INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: Spears, Brian INET: [EMAIL PROTECTED] Fat City Network Services-- 858-538-5051 http://www.fatcity.com San Diego, California-- Mailing list and web hosting services - To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery stuck - SOLVED!
Thanks to everyone for your help. I finally was able to complete the RMAN disaster recovery! I am backing up to NFS-mounted disk, which is subsequently copied to tape. Oracle 8.1.6 on Compaq Tru64. Here is what I learned: 1. Even if you use the RMAN catalog to back up your database, you can recover the database from the control file alone. But I wouldn't set the parameter that clears RMAN data from the control file really low. 2. Just because you write RMAN backup files to an NFS-mounted drive doesn't mean RMAN will be able to recover them. That was the bottom line on my stuck symptoms. Eventually Oracle support suggested that I take NFS out of the picture. I moved the files to local disk and recovery worked fine. Since then my sys admin pushed harder with the system vendor, got further suggestions for things like IP threads, and I am recovering successfully from NFS. 3. You will need the archive log files from around the time of the backup. Evidently when RMAN is doing its online backup, it is counting on using the data in the redo logs that are being produced while the backup is in progress to complete the recovery. If you are using RMAN to back up your archive logs, this isn't an issue. In my case, it seemed to just be doubling the disk space needed for storing archive logs online. 4. Thanks to everyone for the date format suggestions. The one that finally worked was: set until time to_date('082620021229','mmddhh24mi'); 5. Even after applying the archive logs, RMAN was still not able to complete the recovery. It was probably wanting the active redo logs from the time of the backup. I exited RMAN and fired up svrmgrl and issued alter database open resetlogs and I was home free. Recovery was complete. In a true disaster recovery situation where you only have the backup tape, I'm not going to quibble about the last transaction or so. Again, thanks to everyone for their help, and I hope others can learn something from my experiences. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Friday, August 16, 2002 12:05 PM To: '[EMAIL PROTECTED]' Thanks Jim. This morning I did something close to what you describe. I just did run { allocate channel d1 type disk; restore database; } Same result. It recovers the same set of files and hangs. At least this rules out a lot of things like the time format of time. Now I'm thinking maybe I should kill this RMAN session and issue it again. Since it reports successful recovery of these files, maybe it will go past them and recover more of the files. Thanks to you and everyone for their great suggestions. As long as I have ideas I can keep plugging away at this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Friday, August 16, 2002 9:39 AM To: Multiple recipients of list ORACLE-L Dennis, Try this: sql startup mount; sql exit rman target sys/password nocatalog then, run { allocate channel d1 type disk; restore database; recover database until cancel; alter database open resetlogs; } Is you're controlfile coming from the RMAN backup set ora are you copying it from the production box? I discovered the control file is backed-up during an RMAN Level 0 at the beginning of the process before the datafiles, therefore the backed-up control file doesn't know about the backup set being run ( at least this was the case during a database clone attempt). ...JIM... [EMAIL PROTECTED] 8/15/02 6:29:16 PM Oops, I forgot to clarify that I have the production database in archivelog mode, but the recovery database not in archivelog mode. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 5:08 PM To: '[EMAIL PROTECTED]' James I think you may have put your finger on a possible misconception of mine. Here is my situation/understanding. - On production, - Archive logging. - RMAN backup to disk without shutting the database down. - Not using RMAN to backup the archive logs. - Disaster recovery scenario. - This is a burn the server scenario. Imagine the computer room no longer exists. All you have is the backup tape from the offsite storage. No stringent recovery timeframe. If you tell the managers that it will take you a week to recover the data, no big deal. If you tell them you cannot recover the data because you forgot to copy some critical file to tape, that is a big deal. - I would assume that all of you that use RMAN have performed such a test. - My concept was to perform the equivalent of a cold backup/cold recovery. Just recover using the RMAN backup set that was written to disk and subsequently written to tape. - What RMAN commands should I use to perform this recovery? I have made assumptions, but they may not be correct. - I assumed this would be an incomplete recovery since I can't recover to the present time. So I inserted the SET UNTIL TIME
RE: RMAN recovery stuck
Dennis, Try this: sql startup mount; sql exit rman target sys/password nocatalog then, run { allocate channel d1 type disk; restore database; recover database until cancel; alter database open resetlogs; } Is you're controlfile coming from the RMAN backup set ora are you copying it from the production box? I discovered the control file is backed-up during an RMAN Level 0 at the beginning of the process before the datafiles, therefore the backed-up control file doesn't know about the backup set being run ( at least this was the case during a database clone attempt). ...JIM... [EMAIL PROTECTED] 8/15/02 6:29:16 PM Oops, I forgot to clarify that I have the production database in archivelog mode, but the recovery database not in archivelog mode. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 5:08 PM To: '[EMAIL PROTECTED]' James I think you may have put your finger on a possible misconception of mine. Here is my situation/understanding. - On production, - Archive logging. - RMAN backup to disk without shutting the database down. - Not using RMAN to backup the archive logs. - Disaster recovery scenario. - This is a burn the server scenario. Imagine the computer room no longer exists. All you have is the backup tape from the offsite storage. No stringent recovery timeframe. If you tell the managers that it will take you a week to recover the data, no big deal. If you tell them you cannot recover the data because you forgot to copy some critical file to tape, that is a big deal. - I would assume that all of you that use RMAN have performed such a test. - My concept was to perform the equivalent of a cold backup/cold recovery. Just recover using the RMAN backup set that was written to disk and subsequently written to tape. - What RMAN commands should I use to perform this recovery? I have made assumptions, but they may not be correct. - I assumed this would be an incomplete recovery since I can't recover to the present time. So I inserted the SET UNTIL TIME command because I think that is how you get RMAN to perform an incomplete recovery. I picked a time just after the end of the RMAN backup. - Do I need any archive logs under this scenario? I can change my production procedure to use RMAN to backup the archive logs if that is what is required. - Do I need the RECOVER DATABASE command? Any ideas would be appreciated. Thanks for everyone's patience while I flail around with this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 4:08 PM To: Multiple recipients of list ORACLE-L Dennis, This is just a wild guess and I'm probably wrong but I saw in you're original post this DB was not in archivelog mode, try putting it in archivelog mode and running the restore again??? ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm
RE: RMAN recovery stuck
James Will try your suggestion. I dodged the problem you describe with the control file. I couldn't get RMAN to create the controlfile without the database being mounted, so I've just been doing a backup controlfile. Thanks for the ideas. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Friday, August 16, 2002 9:39 AM To: Multiple recipients of list ORACLE-L Dennis, Try this: sql startup mount; sql exit rman target sys/password nocatalog then, run { allocate channel d1 type disk; restore database; recover database until cancel; alter database open resetlogs; } Is you're controlfile coming from the RMAN backup set ora are you copying it from the production box? I discovered the control file is backed-up during an RMAN Level 0 at the beginning of the process before the datafiles, therefore the backed-up control file doesn't know about the backup set being run ( at least this was the case during a database clone attempt). ...JIM... [EMAIL PROTECTED] 8/15/02 6:29:16 PM Oops, I forgot to clarify that I have the production database in archivelog mode, but the recovery database not in archivelog mode. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 5:08 PM To: '[EMAIL PROTECTED]' James I think you may have put your finger on a possible misconception of mine. Here is my situation/understanding. - On production, - Archive logging. - RMAN backup to disk without shutting the database down. - Not using RMAN to backup the archive logs. - Disaster recovery scenario. - This is a burn the server scenario. Imagine the computer room no longer exists. All you have is the backup tape from the offsite storage. No stringent recovery timeframe. If you tell the managers that it will take you a week to recover the data, no big deal. If you tell them you cannot recover the data because you forgot to copy some critical file to tape, that is a big deal. - I would assume that all of you that use RMAN have performed such a test. - My concept was to perform the equivalent of a cold backup/cold recovery. Just recover using the RMAN backup set that was written to disk and subsequently written to tape. - What RMAN commands should I use to perform this recovery? I have made assumptions, but they may not be correct. - I assumed this would be an incomplete recovery since I can't recover to the present time. So I inserted the SET UNTIL TIME command because I think that is how you get RMAN to perform an incomplete recovery. I picked a time just after the end of the RMAN backup. - Do I need any archive logs under this scenario? I can change my production procedure to use RMAN to backup the archive logs if that is what is required. - Do I need the RECOVER DATABASE command? Any ideas would be appreciated. Thanks for everyone's patience while I flail around with this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 4:08 PM To: Multiple recipients of list ORACLE-L Dennis, This is just a wild guess and I'm probably wrong but I saw in you're original post this DB was not in archivelog mode, try putting it in archivelog mode and running the restore again??? ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336,
Re: RMAN recovery stuck
And that is why you have to use the controlfile from the time of the backup. Your current constrolfile doesn't know about back then, You can then roll forward to whenever you want. HTH, Ruth - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Thursday, August 15, 2002 4:03 PM Ruth - You suggested using a backup controlfile. What I have been doing is some time after the RMAN backup creates the backup set, I do a alter database backup controlfile to file, and use that. Does that sound okay to you? I really don't care when it recovers to, I'm just trying to perform a disaster recovery test so I can quit doing weekly non-RMAN backups. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 10:39 AM To: Multiple recipients of list ORACLE-L You have to use the backup controlfile to do a point-in-time recovery that is prior to the current time. This may be your probelm. Ruth - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Thursday, August 15, 2002 12:28 AM The problem is probably in the RESETLOGS part. Do you have many large log files? If you do, your instance is trying to initialize them all. Take a look at the system monitor and observe the CPU consumption. If it is large, you have a problem. If it isn't, use sar -b 5 50 or something alike and see how many preads/pwrites are you getting. Many preads/pwrites means that your database software is doing IO. for 8 redo groups, consisting of two members each, 512M per member. it used to take almost 40 minutes to do open resetlogs, after a successful recovery. Solution was a cheap one, FC/AL based EMC Symmetrix 1+0 RAID with 32 GB cache. That has helped. Such solution costs $.25 per MB of the effective storage, which means that 1TB costs $250,000. On 2002.08.14 16:28 DENNIS WILLIAMS wrote: I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Mladen Gogala -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Mladen Gogala INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing
RE: RMAN recovery stuck
Thanks Ruth. Actually the controlfile keeps records for quite a few backups. I actually don't care about rolling forward. In a disaster recovery I will probably only have the level zero backup from tape. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Friday, August 16, 2002 10:54 AM To: Multiple recipients of list ORACLE-L And that is why you have to use the controlfile from the time of the backup. Your current constrolfile doesn't know about back then, You can then roll forward to whenever you want. HTH, Ruth - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Thursday, August 15, 2002 4:03 PM Ruth - You suggested using a backup controlfile. What I have been doing is some time after the RMAN backup creates the backup set, I do a alter database backup controlfile to file, and use that. Does that sound okay to you? I really don't care when it recovers to, I'm just trying to perform a disaster recovery test so I can quit doing weekly non-RMAN backups. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 10:39 AM To: Multiple recipients of list ORACLE-L You have to use the backup controlfile to do a point-in-time recovery that is prior to the current time. This may be your probelm. Ruth - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Thursday, August 15, 2002 12:28 AM The problem is probably in the RESETLOGS part. Do you have many large log files? If you do, your instance is trying to initialize them all. Take a look at the system monitor and observe the CPU consumption. If it is large, you have a problem. If it isn't, use sar -b 5 50 or something alike and see how many preads/pwrites are you getting. Many preads/pwrites means that your database software is doing IO. for 8 redo groups, consisting of two members each, 512M per member. it used to take almost 40 minutes to do open resetlogs, after a successful recovery. Solution was a cheap one, FC/AL based EMC Symmetrix 1+0 RAID with 32 GB cache. That has helped. Such solution costs $.25 per MB of the effective storage, which means that 1TB costs $250,000. On 2002.08.14 16:28 DENNIS WILLIAMS wrote: I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Mladen Gogala --
RE: RMAN recovery stuck
Thanks Jim. This morning I did something close to what you describe. I just did run { allocate channel d1 type disk; restore database; } Same result. It recovers the same set of files and hangs. At least this rules out a lot of things like the time format of time. Now I'm thinking maybe I should kill this RMAN session and issue it again. Since it reports successful recovery of these files, maybe it will go past them and recover more of the files. Thanks to you and everyone for their great suggestions. As long as I have ideas I can keep plugging away at this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Friday, August 16, 2002 9:39 AM To: Multiple recipients of list ORACLE-L Dennis, Try this: sql startup mount; sql exit rman target sys/password nocatalog then, run { allocate channel d1 type disk; restore database; recover database until cancel; alter database open resetlogs; } Is you're controlfile coming from the RMAN backup set ora are you copying it from the production box? I discovered the control file is backed-up during an RMAN Level 0 at the beginning of the process before the datafiles, therefore the backed-up control file doesn't know about the backup set being run ( at least this was the case during a database clone attempt). ...JIM... [EMAIL PROTECTED] 8/15/02 6:29:16 PM Oops, I forgot to clarify that I have the production database in archivelog mode, but the recovery database not in archivelog mode. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 5:08 PM To: '[EMAIL PROTECTED]' James I think you may have put your finger on a possible misconception of mine. Here is my situation/understanding. - On production, - Archive logging. - RMAN backup to disk without shutting the database down. - Not using RMAN to backup the archive logs. - Disaster recovery scenario. - This is a burn the server scenario. Imagine the computer room no longer exists. All you have is the backup tape from the offsite storage. No stringent recovery timeframe. If you tell the managers that it will take you a week to recover the data, no big deal. If you tell them you cannot recover the data because you forgot to copy some critical file to tape, that is a big deal. - I would assume that all of you that use RMAN have performed such a test. - My concept was to perform the equivalent of a cold backup/cold recovery. Just recover using the RMAN backup set that was written to disk and subsequently written to tape. - What RMAN commands should I use to perform this recovery? I have made assumptions, but they may not be correct. - I assumed this would be an incomplete recovery since I can't recover to the present time. So I inserted the SET UNTIL TIME command because I think that is how you get RMAN to perform an incomplete recovery. I picked a time just after the end of the RMAN backup. - Do I need any archive logs under this scenario? I can change my production procedure to use RMAN to backup the archive logs if that is what is required. - Do I need the RECOVER DATABASE command? Any ideas would be appreciated. Thanks for everyone's patience while I flail around with this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 4:08 PM To: Multiple recipients of list ORACLE-L Dennis, This is just a wild guess and I'm probably wrong but I saw in you're original post this DB was not in archivelog mode, try putting it in archivelog mode and running the restore again??? ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target:
Re: RMAN recovery stuck
You have to use the backup controlfile to do a point-in-time recovery that is prior to the current time. This may be your probelm. Ruth - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Thursday, August 15, 2002 12:28 AM The problem is probably in the RESETLOGS part. Do you have many large log files? If you do, your instance is trying to initialize them all. Take a look at the system monitor and observe the CPU consumption. If it is large, you have a problem. If it isn't, use sar -b 5 50 or something alike and see how many preads/pwrites are you getting. Many preads/pwrites means that your database software is doing IO. for 8 redo groups, consisting of two members each, 512M per member. it used to take almost 40 minutes to do open resetlogs, after a successful recovery. Solution was a cheap one, FC/AL based EMC Symmetrix 1+0 RAID with 32 GB cache. That has helped. Such solution costs $.25 per MB of the effective storage, which means that 1TB costs $250,000. On 2002.08.14 16:28 DENNIS WILLIAMS wrote: I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Mladen Gogala -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Mladen Gogala INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Ruth Gramolini INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery stuck
Robert, Tim, Mladen, Bruce - Thanks for the excellent suggestions. I will try them today. Bruce - Yes, you have an excellent memory. Not only for the Metalink note, but recalling my posting from a month ago. I was pulled into some other projects and just now getting back to this one, trying the last round of suggestions. Also, I thought maybe a control-file based recovery was just my own offbeat idea until Joe mentioned he was using that. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- [mailto:[EMAIL PROTECTED]] Sent: Wednesday, August 14, 2002 11:33 PM To: Multiple recipients of list ORACLE-L This reminded me of a Metalink note I once found. Dennis - you might want to look at Note:145624.1 (RMAN: Resolving an RMAN Hung Job) for some more hints information. Tim - the note mentions the debug command line parameter but doesn't show the trace=1 phrase so its good to learn that. Dennis - Is this the same hanging issue you had a month or so ago? Regards, Bruce Reardon -Original Message- Sent: Thursday, 15 August 2002 13:23 Call RMAN from command-line as follows: rman nocatalog log=logfilename debug trace=tracefilename Both the logfilename and tracefilename should have copious amounts of output, which can provide a clue. When you allocate the channel, make sure to add the phrase trace=1 to the end of the ALLOCATE command. This will produce .trc files in your USER_DUMP_DEST directory with additional diagnostic output... Hope this helps... - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Reardon, Bruce (CALBBAY) INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery stuck
Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm about ready to think it is TAR time. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 2:23 PM To: '[EMAIL PROTECTED]' I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery stuck
Ruth - You suggested using a backup controlfile. What I have been doing is some time after the RMAN backup creates the backup set, I do a alter database backup controlfile to file, and use that. Does that sound okay to you? I really don't care when it recovers to, I'm just trying to perform a disaster recovery test so I can quit doing weekly non-RMAN backups. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 10:39 AM To: Multiple recipients of list ORACLE-L You have to use the backup controlfile to do a point-in-time recovery that is prior to the current time. This may be your probelm. Ruth - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Thursday, August 15, 2002 12:28 AM The problem is probably in the RESETLOGS part. Do you have many large log files? If you do, your instance is trying to initialize them all. Take a look at the system monitor and observe the CPU consumption. If it is large, you have a problem. If it isn't, use sar -b 5 50 or something alike and see how many preads/pwrites are you getting. Many preads/pwrites means that your database software is doing IO. for 8 redo groups, consisting of two members each, 512M per member. it used to take almost 40 minutes to do open resetlogs, after a successful recovery. Solution was a cheap one, FC/AL based EMC Symmetrix 1+0 RAID with 32 GB cache. That has helped. Such solution costs $.25 per MB of the effective storage, which means that 1TB costs $250,000. On 2002.08.14 16:28 DENNIS WILLIAMS wrote: I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Mladen Gogala -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Mladen Gogala INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Ruth Gramolini INET: [EMAIL PROTECTED] Fat City Network Services--
RE: RMAN recovery stuck
Dennis, This is just a wild guess and I'm probably wrong but I saw in you're original post this DB was not in archivelog mode, try putting it in archivelog mode and running the restore again??? ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm about ready to think it is TAR time. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 2:23 PM To: '[EMAIL PROTECTED]' I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: James Howerton INET: [EMAIL PROTECTED] Fat
RE: RMAN recovery stuck
Dennis, Annother thought, I had similar hang problems while trying to clone a database to annother box. I can't remember exactly what I did, it was too long ago. I set the until time in a different format than you are using. run { set until time '03-FEB-2002 03:30:59'; allocate channel d1type disk ... ... } HTH ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm about ready to think it is TAR time. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 2:23 PM To: '[EMAIL PROTECTED]' I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like
RE: RMAN recovery stuck
I was thinking along those same lines...set your NLS_DATE_FORMAT environment variable, for example, to '-MM-DD:HH24:MI:SS' and then use a matching format in your script: set until time '2002-02-03:03:30:59'; This format is what I use. Debi At 01:13 PM 8/15/2002 -0800, James Howerton wrote: Dennis, Annother thought, I had similar hang problems while trying to clone a database to annother box. I can't remember exactly what I did, it was too long ago. I set the until time in a different format than you are using. run { set until time '03-FEB-2002 03:30:59'; allocate channel d1type disk ... ... } HTH ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm about ready to think it is TAR time. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 2:23 PM To: '[EMAIL PROTECTED]' I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from
RE: RMAN recovery stuck
James I think you may have put your finger on a possible misconception of mine. Here is my situation/understanding. - On production, - Archive logging. - RMAN backup to disk without shutting the database down. - Not using RMAN to backup the archive logs. - Disaster recovery scenario. - This is a burn the server scenario. Imagine the computer room no longer exists. All you have is the backup tape from the offsite storage. No stringent recovery timeframe. If you tell the managers that it will take you a week to recover the data, no big deal. If you tell them you cannot recover the data because you forgot to copy some critical file to tape, that is a big deal. - I would assume that all of you that use RMAN have performed such a test. - My concept was to perform the equivalent of a cold backup/cold recovery. Just recover using the RMAN backup set that was written to disk and subsequently written to tape. - What RMAN commands should I use to perform this recovery? I have made assumptions, but they may not be correct. - I assumed this would be an incomplete recovery since I can't recover to the present time. So I inserted the SET UNTIL TIME command because I think that is how you get RMAN to perform an incomplete recovery. I picked a time just after the end of the RMAN backup. - Do I need any archive logs under this scenario? I can change my production procedure to use RMAN to backup the archive logs if that is what is required. - Do I need the RECOVER DATABASE command? Any ideas would be appreciated. Thanks for everyone's patience while I flail around with this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 4:08 PM To: Multiple recipients of list ORACLE-L Dennis, This is just a wild guess and I'm probably wrong but I saw in you're original post this DB was not in archivelog mode, try putting it in archivelog mode and running the restore again??? ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm about ready to think it is TAR time. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 2:23 PM To: '[EMAIL PROTECTED]' I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The
RE: RMAN recovery stuck
Oops, I forgot to clarify that I have the production database in archivelog mode, but the recovery database not in archivelog mode. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 5:08 PM To: '[EMAIL PROTECTED]' James I think you may have put your finger on a possible misconception of mine. Here is my situation/understanding. - On production, - Archive logging. - RMAN backup to disk without shutting the database down. - Not using RMAN to backup the archive logs. - Disaster recovery scenario. - This is a burn the server scenario. Imagine the computer room no longer exists. All you have is the backup tape from the offsite storage. No stringent recovery timeframe. If you tell the managers that it will take you a week to recover the data, no big deal. If you tell them you cannot recover the data because you forgot to copy some critical file to tape, that is a big deal. - I would assume that all of you that use RMAN have performed such a test. - My concept was to perform the equivalent of a cold backup/cold recovery. Just recover using the RMAN backup set that was written to disk and subsequently written to tape. - What RMAN commands should I use to perform this recovery? I have made assumptions, but they may not be correct. - I assumed this would be an incomplete recovery since I can't recover to the present time. So I inserted the SET UNTIL TIME command because I think that is how you get RMAN to perform an incomplete recovery. I picked a time just after the end of the RMAN backup. - Do I need any archive logs under this scenario? I can change my production procedure to use RMAN to backup the archive logs if that is what is required. - Do I need the RECOVER DATABASE command? Any ideas would be appreciated. Thanks for everyone's patience while I flail around with this. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Thursday, August 15, 2002 4:08 PM To: Multiple recipients of list ORACLE-L Dennis, This is just a wild guess and I'm probably wrong but I saw in you're original post this DB was not in archivelog mode, try putting it in archivelog mode and running the restore again??? ...JIM... [EMAIL PROTECTED] 8/15/02 2:58:31 PM Okay, I implemented everyone's comments and re-executed the RMAN recovery. Here is what I did and the results. 1. Action: Removed alter database open resetlogs from the run statement. Result: No change. 2. Action: Added trace=1 to the allocate channel command. Result: No trace file is produced in udump. 3. Action: Reviewed Note 145624.1 Result: Did not see the solution to my problem. Most of the suggestions seem appropriate to backup rather than recovery jobs. 4. Action: Added log and debug trace statements to rman invocation line. Result: Produced log and trace file. The trace file at the point of the recovery hang contains the following: krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #10 completed immediately RMAN-08523: restoring datafile 6 to /ora05/ams/data0501.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #11 completed immediately RMAN-08523: restoring datafile 00017 to /ora05/ams/rbs01.dbf krmxrpc: xc=5372006336 kpurpc2 returned 0 krmxrpc: xc=5372006336 RPC #12 completed immediately RMAN-08523: restoring datafile 00025 to /ora05/ams/mls_data0401.dbf krmxrpc: xc=5372006336 kpurpc2 returned 3123 krmxrpc: xc=5372006336 starting longrunning RPC #13 to target: DBMS_BACKUP_RESTO RE.RESTOREBACKUPPIECE krmxr: xc=5372006336 started long running rpc krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: callback returned TRUE, skipping sleep krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 1 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 2 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 4 seconds krmxpoq: xc=5372006336, action=013 STARTED, col_l=15, ind=0, sid=13 krmxr: sleeping for 8 seconds And the trace continues with this statement. Any suggestions would be appreciated, as are the suggestions to this point. I'm about ready to think it is TAR time. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 2:23 PM To: '[EMAIL PROTECTED]' I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; }
RE: RMAN recovery stuck
There is a bug in 8.1.6 that is fixed in 8.1.7: 1164440 The RMAN command release channel reports RMAN-20020 if there are set until and alter database open reset logs commands in the same run block. While this isn't exactly like your problem, it might be related. So, I'd remove the last alter database command, run the rest, and then manually do the alter database and see if it works. RF Robert G. Freeman - Oracle OCP Oracle Database Architect CSX Midtier Database Administration Author Oracle9i RMAN Backup and Recovery (Oracle Press - Oct 2002) Oracle9i New Features (Oracle Press) Mastering Oracle8i (Sybex) The avalanche has begun, It is too late for the pebbles to vote. -Original Message- Sent: Wednesday, August 14, 2002 4:28 PM To: Multiple recipients of list ORACLE-L I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Freeman, Robert INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery stuck
Robert - Thanks a million. I'll get started on that tomorrow. Yep, I figured I'd made some type of novice error which is why I posted the commands I'm using. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -Original Message- Sent: Wednesday, August 14, 2002 4:33 PM To: Multiple recipients of list ORACLE-L There is a bug in 8.1.6 that is fixed in 8.1.7: 1164440 The RMAN command release channel reports RMAN-20020 if there are set until and alter database open reset logs commands in the same run block. While this isn't exactly like your problem, it might be related. So, I'd remove the last alter database command, run the rest, and then manually do the alter database and see if it works. RF Robert G. Freeman - Oracle OCP Oracle Database Architect CSX Midtier Database Administration Author Oracle9i RMAN Backup and Recovery (Oracle Press - Oct 2002) Oracle9i New Features (Oracle Press) Mastering Oracle8i (Sybex) The avalanche has begun, It is too late for the pebbles to vote. -Original Message- Sent: Wednesday, August 14, 2002 4:28 PM To: Multiple recipients of list ORACLE-L I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Freeman, Robert INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
Re: RMAN recovery stuck
Call RMAN from command-line as follows: rman nocatalog log=logfilename debug trace=tracefilename Both the logfilename and tracefilename should have copious amounts of output, which can provide a clue. When you allocate the channel, make sure to add the phrase trace=1 to the end of the ALLOCATE command. This will produce .trc files in your USER_DUMP_DEST directory with additional diagnostic output... Hope this helps... - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] Sent: Wednesday, August 14, 2002 2:28 PM I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Tim Gorman INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
Re: RMAN recovery stuck
The problem is probably in the RESETLOGS part. Do you have many large log files? If you do, your instance is trying to initialize them all. Take a look at the system monitor and observe the CPU consumption. If it is large, you have a problem. If it isn't, use sar -b 5 50 or something alike and see how many preads/pwrites are you getting. Many preads/pwrites means that your database software is doing IO. for 8 redo groups, consisting of two members each, 512M per member. it used to take almost 40 minutes to do open resetlogs, after a successful recovery. Solution was a cheap one, FC/AL based EMC Symmetrix 1+0 RAID with 32 GB cache. That has helped. Such solution costs $.25 per MB of the effective storage, which means that 1TB costs $250,000. On 2002.08.14 16:28 DENNIS WILLIAMS wrote: I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: DENNIS WILLIAMS INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing). -- Mladen Gogala -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Mladen Gogala INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).
RE: RMAN recovery stuck
This reminded me of a Metalink note I once found. Dennis - you might want to look at Note:145624.1 (RMAN: Resolving an RMAN Hung Job) for some more hints information. Tim - the note mentions the debug command line parameter but doesn't show the trace=1 phrase so its good to learn that. Dennis - Is this the same hanging issue you had a month or so ago? Regards, Bruce Reardon -Original Message- Sent: Thursday, 15 August 2002 13:23 Call RMAN from command-line as follows: rman nocatalog log=logfilename debug trace=tracefilename Both the logfilename and tracefilename should have copious amounts of output, which can provide a clue. When you allocate the channel, make sure to add the phrase trace=1 to the end of the ALLOCATE command. This will produce .trc files in your USER_DUMP_DEST directory with additional diagnostic output... Hope this helps... - Original Message - To: Multiple recipients of list ORACLE-L [EMAIL PROTECTED] I am trying to perform an RMAN disaster recovery task. While I use an RMAN catalog to make backups, I am trying to recover using just the control file information. Oracle 8.1.6, Compaq/HP Tru64 I start RMAN with rman target sys/password nocatalog then, startup mount run { set until time to_date('08/11/2002 01:00:00','MM/DD/ HH24:MI:SS'); allocate channel d1 type disk; restore database; recover database; alter database open resetlogs; } Everything appears normal for awhile. In the alert log RMAN tries to find each file, doesn't find them. Then it successfully recovers 5 data files (including system and rollback) and reports success in the alert log. Then . . nothing for hours. RMAN doesn't return an error. The RMAN shadow processes are still present but with no CPU consumption. Nothing is written to the alert log. I check V$SESSION_WAIT, and the only entry for the RMAN shadow processes is one is SQL*Net message to client with seconds_in_wait = 0, state = waited unknown time. In V$SYSTEM_EVENT, time_waited and average_wait are zero for all events. The following events have values of total_waits that are increasing: Increase in total_waits in 10-minutes rdbms ipc message 401 pmon timer 57 control file parallel write 56 SQL*Net message to client24 SQL*Net message from client 24 virtual circuit status5 dispatch timer3 smon timer1 Archiving is turned off. I have attempted this recovery many times using different RMAN backup sets, but the system always hangs at this point. Any ideas would be appreciated. Dennis Williams DBA Lifetouch, Inc. [EMAIL PROTECTED] -- Please see the official ORACLE-L FAQ: http://www.orafaq.com -- Author: Reardon, Bruce (CALBBAY) INET: [EMAIL PROTECTED] Fat City Network Services-- (858) 538-5051 FAX: (858) 538-5051 San Diego, California-- Public Internet access / Mailing Lists To REMOVE yourself from this mailing list, send an E-Mail message to: [EMAIL PROTECTED] (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L (or the name of mailing list you want to be removed from). You may also send the HELP command for other information (like subscribing).