Data Guard: Concepts and implementation

General Concepts: Components, Roles, Interfaces
Architecture
Data Guard Protection Modes
Physical Standby
Implementation with RMAN (Recommended)
Physical Standby Implementation  (OLD WAY)
Logical Standby Implementation with RMAN (Recommended)
Logical Standby Sanity Check
Troubleshooting a Logical Standby
Logical Standby Database Activation (Role Transition) - Switchover and Failover
Suspend Physical Standby Recovery
Monitoring Physical Data Guard  (Detect Gap)
Activate Physical Standby (on Read-Only Mode or PROD Mode)
Switchover between Prod and Standby DB
Physical Standby Switchover Steps
Physical Standby Failover Steps
Implementation Tips
Applying Patches with Standby
Resolving Problems
Synchronize a GAP on the STANDBY when ARCH Logs are lost
Using Flashback with Data Guard
Monitor Data Guard


General Concepts
 
Data Guard Components
Oracle Data Guard consists of the following components:

Primary Database:
A primary database is a production database.  The primary database is used to create a standby database.  Every standby database is associated with one and only one primary database.

Standby Database:

A physical or logical standby database is a database replica created from a backup of a primary database.A Data Guard configuration consists of one production (or primary) database and up to nine standby databases. The databases in a Data Guard configuration are connected by Oracle Net and may be dispersed geographically. There are no restrictions on where the databases are located, provided that they can communicate with each other. Log Transport Services:
Enables and controls the automated transfer of redo data within a Data Guard configuration from the primary site to each of its standby sites.
Log transport services also controls the level of data protection for your database. The DBA will configure log transport services to balance data protection and availability against database performance. Log transport services will also coordinate with log apply services and role management services for switchover and failover operations.

Log Apply
Services:
Log apply services apply the archived redo logs to the standby database. 

Data Guard Broker:

Data Guard Broker is the management and monitoring component with which you configure, control, and monitor a fault tolerant system consisting of a primary database protected by one or more standby database.Data Guard can be used in combination with other Oracle High Availability (HA) solutions such as Real Application Clusters (RAC), Oracle Flashback , Oracle Recovery Manager (RMAN), and new database options for Oracle Database that include Oracle Active Data Guard and Oracle Advanced Compression, to provide a high level of data protection,


Fig 1: DATA GUARD GENERAL VIEW



Fig 2: DATA GUARD WITH DETAILS OF PROCESSES

Data Guard Roles

A database can operate in one of the two mutually exclusive roles: primary or standby database. 

Failover
A failover is done when the primary database (all instances of an Oracle RAC primary database) fails or has become unreachable and one of the standby databases is transitioned to take over the primary role. Failover should be performed when the primary database cannot be recovered in a timely manner. Failover may or may not result in data loss depending on the protection mode in effect at the time of the failover.

Switchover

A switchover is a role reversal between the primary database and one of its standby databases. A switchover guarantees no data loss and is typically done for planned maintenance of the primary system. During a switchover, the primary database transitions to a standby role, and the standby database transitions to the primary role

 

Data Guard Interfaces

Oracle provides three ways to manage a Data Guard environment:
SQL*Plus and SQL Statements
Using SQL*Plus and SQL commands to manage Data Guard environment. The following SQL statement initiates a switchover operation:
SQL> alter database commit to switchover to physical standby;

Data Guard Broker GUI Interface (Data Guard Manager)
Data Guard Manger is a GUI version of Data Guard broker interface that allows you to automate many of the tasks involved in configuring and monitoring a Data Guard environment. 

Data Guard Broker Command-Line Interface (CLI)
It is useful if you want to use the broker from batch programs or scripts.  You can perform most of the activities required to manage and monitor the Data Guard environment using the CLI. The following example lists the available commands:

$ dgmgrl
Welcome to DGMGRL, type "help" for information.

DGMGRL> help

The following commands are available:
add            Adds a member to the broker configuration
connect        Connects to an Oracle database instance
convert        Converts a database from one type to another
create         Creates a broker configuration
disable        Disables a configuration, a member, or fast-start failover
edit           Edits a configuration or a member
enable         Enables a configuration, a member, or fast-start failover
exit           Exits the program
failover       Changes a standby database to be the primary database
help           Displays description and syntax for a command
quit           Exits the program
reinstate      Changes a database marked for reinstatement into a viable standby
rem            Comment to be ignored by DGMGRL
remove         Removes a configuration or a member
show           Displays information about a configuration or a member
shutdown       Shuts down a currently running Oracle database instance
sql            Executes a SQL statement
start          Starts the fast-start failover observer
startup        Starts an Oracle database instance
stop           Stops the fast-start failover observer
switchover     Switches roles between a primary and standby database
validate       Performs an exhaustive set of validations for a database




Process Architecture

Physical Standby Processes Architecture (Apply Redo Logs)
A physical standby database is a byte for byte exact copy of the primary database. This also means that rowids stay the same in a physical standby database environment.
On the primary database site, the log writer process (LGWR) collects transactions from the log buffer and writes to the online redo logs.  The archiver process (ARCH) creates a copy of the online redo logs, and writes to the local archive destination.  Depending on the configuration, the archiver process or log writer process can also transmit redo logs to standby database.  When using the log writer process, you can specify synchronous or asynchronous network transmission of redo logs to remote destinations.  Data Guard achieves asynchronous network I/O using LGWR network server process (LNS).  These network severs processes are deployed by LOG_ARCHIVE_DEST_n initialization parameter. Data Guard’s asynchronous log transport (i.e. the Maximum Performance mode) is recommended for a configuration in which the network distance is up to thousands of miles, providing continual maximum performance, while minimizing the risks of transaction loss in the event of a disaster.
On the standby database site, the remote file server process (RFS) receives archived redo logs from the primary database.  The primary site launches the RFS process during the first log transfer.  The redo logs information received by the RFS process can be stored as either standby redo logs or archived redo logs.  Data Guard introduces the concept of standby redo logs (separate pool of log file groups).  Standby redo logs must be archived by the ARCH process to the standby archived destination before the managed recovery process (MRP) applies redo log information to the standby database.
The fetch archive log (FAL) client is the MRP process.  The fetch archive log (FAL) server is a foreground process that runs on the primary database and services the fetch archive log requests coming from the FAL client.  A separate FAL server is created for each incoming FAL client.
FAL_SERVER specifies the FAL (fetch archive log) server for a standby database. The value is an Oracle Net service name, which is assumed to be configured properly on the standby database system to point to the desired FAL server.
FAL_CLIENT specifies the FAL client name that is used by the FAL service, configured through the FAL_SERVER parameter, to refer to the FAL client. The value is an Oracle Net service name, which is assumed to be configured properly on the FAL server system to point to the FAL client (standby database).
Thanks to the FAL_CLIENT and FAL_SERVER parameters, the managed-recovery process in the physical database will automatically check and resolve gaps at the time redo is applied. This helps in the sense that you don't need to perform the transfer of those gaps by yourself.
FAL_CLIENT and FAL_SERVER only need to be defined in the initialization parameter file for the standby database(s). It is possible; however, to define these two parameters in the initialization parameter for the primary database server to ease the amount of work that would need to be performed if the primary database were required to transition its role.
When using Data Guard Broker (DG_BROKER_START = TRUE), the monitor agent process named Data Guard Broker Monitor (DMON) is running on every site (primary and standby) and maintain a two-way communication.

Logical Standby Processes Architecture (redo logs converted to sql, called SQL APPLY)
The major difference between the logical and physical standby database architectures is in its log apply services.
On a Logical Standby, you can query it while simultaneously applying transactions from the primary. This is ideal for business that requires a near real-time copy of your production DB for reporting.
The key advantage for logical standby databases is that they're opened read/write, even while they're in applied mode. That is, they can be used to generate reports and the like. It is indeed a fully functional database. Also, additional indexes, materialized views and so on can be created.
Oracle (or more exactly the log apply services) uses the primary database's redo log, transforms them into SQL statements and replays them on the logical standby database. SQL Apply uses  LOGMINER technology to reconstruct  DML statements from the redo generated on the primary.
The logical standby process (LSP) is the coordinator process for two groups of parallel execution process (PX) that work concurrently to read, prepare, build, and apply completed SQL transactions from the archived redo logs sent from the primary database.  The first group of PX processes read log files and extract the SQL statements by using LogMiner technology; the second group of PX processes apply these extracted SQL transactions to the logical standby database.  The mining and applying process occurs in parallel.  Logical standby database does not use standby online redo logs.  Logical standby database does not have FAL capabilities in Oracle.  All gaps are resolved by the proactive gap resolution mechanism running on the primary that polls the standby to see if they have a gap.  

In addition to data protection and availability, Data Guard standby databases deliver high return on investment by supporting ad-hoc queries, reporting, backups, or test activity, while in standby role. Specifically:
- The Active Data Guard option enables a physical standby database to be used for read-only applications while simultaneously receiving updates from the primary database. Queries executed on an active standby database return up-to-date results. An Active Data Guard standby database is unique compared to other physical replication methods in its ability to guarantee the same level of read consistency as the primary database while replication is active. More information HERE
- Snapshot Standby enables a physical standby database to be open read-write for testing or any activity that requires a read-write replica of production data. A Snapshot Standby continues to receive, but not apply, updates generated by the primary. When testing is complete, the Snapshot Standby is converted back into a synchronized physical standby database by first discarding the changes made while open read-write, and then applying the redo received from the primary database. Primary data is protected at all times. More information HERE
- A physical standby database, because it is an exact replica of the primary database, can also be used to offload the primary database of the overhead of performing backups. All recovery operations are interchangeable between primary and Data Guard physical standby databases.



Data Protection Modes
Maximum Protection: It offers the highest level of data availability for the primary database. 
In order to provide this level of protection, the redo data needed to recover each transaction must be written to both the local (online) redo log and to a standby redo log on at least one standby database before the transaction can be committed. In order to guarantee no loss of data, the primary database will shut down if a fault prevents it from writing its redo data to at least one remote standby redo log.
Redo records are synchronously transmitted from the primary database to the standby database using LGWR process.  Transaction is not committed on the primary database until it has been confirmed that the transaction data is available on at least one standby database.  This mode is usually configured with at least two standby databases.  Standby online redo logs are required in this mode.  Therefore, logical standby database cannot participate in a maximum protection configuration. The log_archive_dest_n parameter needs to have the LGWR SYNC AFFIRM option, for example:
log_archive_dest_2='service=testdb_standby LGWR SYNC AFFIRM'

Maximum Availability: Provides the highest level of data protection that is possible without affecting the availability of the primary database.
This protection mode is very similar to maximum protection where a transaction will not commit until the redo data needed to recover that transaction is written to both the local (online) redo log and to at least one remote standby redo log.
Redo records are synchronously transmitted from the primary database to the standby database using LGWR process.  Unlike maximum protection mode; however, the primary database will not shut down if a fault prevents it from writing its redo data to a remote standby redo log. Instead, the primary database will operate in maximum performance mode until the fault is corrected and all log gaps have been resolved. After all log gaps have been resolved, the primary database automatically resumes operating in maximum availability mode. This protection mode supports both physical and logical standby databases. Standby online redo logs are required in this mode. The log_archive_dest_n parameter needs to have the LGWR SYNC AFFIRM option, for example:
log_archive_dest_2='service=testdb_standby LGWR SYNC AFFIRM'


Maximum Performance: It is the default protection mode. 
It offers slightly less primary database protection than maximum availability mode but with higher performance.  Redo logs are asynchronously shipped from the primary database to the standby database using either LGWR or ARCH process.  When operating in this mode, the primary database continues its transaction processing without regard to data availability on any standby databases and there is little or no effect on performance. It supports both physical and logical standby databases. The log_archive_dest_n parameter needs to have the LGWR ASYNC AFFIRM or NOAFFIRM option, for example:
log_archive_dest_2='service=testdb_standby ARCH NOAFFIRM'
or
log_archive_dest_2='service=testdb_standby LGWR ASYNC NOAFFIRM'

 

Mode

Log Writing Process

Network Trans Mode

Disk Write Option

Redo Log Reception Option

Supported on

Maximum Protection

Zero data loss Double failure protection

LGWR

SYNC

AFFIRM

Standby redo logs are required

Physical standby databases.
Signal commit success to the application only after acknowledgement is received from a standby database that redo for that transaction is hardened to disk.

Maximum Availability

Zero data loss Single failure protection

LGWR

SYNC

AFFIRM

Standby redo logs

Physical and logical standby databases.
Signal commit success to the application only after acknowledgement is received from a standby database or after NET_TIMEOUT threshold period expires – whichever occurs first

Maximum Performance

Potential for minimal data loss

LGWR or ARCH

ASYNC

NOAFFIRM

Standby redo logs

Physical and logical standby databases.
Primary never waits for standby acknowledgment to signal commit success to the application

 

Setting the Level of Protection
This is achieved by using the following command syntax executed on the primary database.
ALTER DATABASE SET STANDBY DATABASE TO MAXIMIZE { PROTECTION | AVAILABILITY | PERFORMANCE };

The protection mode can be found by executing this query. PERFORMANCE is the default.
SELECT name, protection_mode, protection_level FROM v$database;

NAME      PROTECTION_MODE      PROTECTION_LEVEL
--------- -------------------- --------------------
STBY      MAXIMUM PERFORMANCE  MAXIMUM PERFORMANCE

-No Data Loss Mode. The PROTECTION mode applies only to physical standby databases, using LGWR SYNC and will shutdown the primary database if no standby database can be written to.
-Minimal Data Loss. The AVAILABILITY mode prevents a transaction committing on the primary until all redo entries are written to at least one standby database. SYNC transport is required and this option is available to both logical and physical standby type databases. Unlike PROTECTION mode, which shuts down the primary database in the event of failure to pass redo entries to the standby, this mode simply lowers the protection mode to PERFORMANCE until the error is corrected.
- No Data Divergence. PERFORMANCE mode is the default setting and available for both physical and logical standby type databases. A transaction will commit to the primary before all redo entries are written to any standby database.

To ensure that minimal data loss will be encountered execute this command on the primary database. The database must be in mounted exclusive mode to execute this command.
ALTER DATABASE SET STANDBY DATABASE TO MAXIMIZE AVAILABILITY;


Terms or Options to know
AFFIRM assures that archive logs are written to disk, primary or standby.
MANDATORY assures that redo logs are not overwritten until archive logs are successfully created. This should only apply to the primary database.
REOPEN=30 means that there will be a 30 second delay until LGWR process try again on a MANDATORY destination which failed.
DELAY is in minutes and does not stop the copy of an archive log file to a standby server but the application of redo on the standby after copying the archive log to the standby. This will not help primary database performance.
• Using ARCH instead of LGWR for the second standby database may help primary database performance but smaller sized log files would probably be required. SYNC=PARALLEL applies to LGWR only. Using ARCH waits for a switch on the primary, LGWR copies entries to a standby archive log, applied only at switch. ARCH will copy and apply at switch. LGWR is more efficient since it writes redo entries to all standby databases at once but a primary and two standby databases could possibly cause a performance issue for the primary database, possibly but unlikely! Additionally multiple archiver processes can be created on the primary database. Increase the value of the LOG_ARCHIVE_MAX_PROCESSES parameter to start additional archiver processes. The default on my machine appears to be 2 and not 1 as stated in the manuals; probably because I have two standby databases.
• The ARCHIVE_LAG_TARGET parameter could be used to increase the frequency of log switches, thus sending less data to the standby databases more often. Specifies the maximum number of seconds between each log switch, so it will force a log switch when that number in seconds is reached. Used on Physical Implementation Only.



Using Data Guard Redo Apply in a LAN the following is recommended:
• Use Maximum Protection or Maximum Availability modes for zero data loss; the performance impact was less than 3% in all synchronous tests. With a single remote archive destination, use the NOPARALLEL option (“lgwr sync=noparallel”).
• For very good performance and a minimal risk of transaction loss in the event of a disaster, use Maximum Performance mode, with LGWR ASYNC and a 10 MB async buffer (ASYNC=20480). LGWR ASYNC performance degraded no more than 1% as compared to using the ARCH transport. LGWR ASYNC also bounds the risk of potential transaction loss much better than the ARCH transport. The 10 MB async buffer outperformed smaller buffer sizes and reduced the chance of network timeout errors in a high latency / low bandwidth network.


Metropolitan and Wide Area Network (WAN)
Data Guard is used across a metropolitan area networks (MAN) or WANs to get complete disaster  recovery protection. Typically a MAN covers a large metropolitan area and has network Round-Trip-Times (RTT) from 2-10 ms. For the MAN/WAN tests, different network RTT’s were simulated during testing to measure the impact of the RTT on the primary database performance. The tests were conducted for the following RTT’s: 2 ms (MAN), 10 ms, 50 ms, and 100 ms (WAN) Additionally, tests using Secure Shell (SSH) port forwarding with compression were also done for different RTT’s.
Best practices recommendations are:
• Use Maximum Protection and Maximum Availability modes over a MAN for zero data loss. For these modes, the network RTT overhead over a WAN can impact response time and throughput of the primary database. The performance impact was less than 6% with a 10 ms network RTT and a high transaction rate.
• For very good performance and a minimal risk of transaction loss in the event of a disaster, use Maximum Performance mode, with LGWR ASYNC and a 10 MB async buffer (ASYNC=20480). LGWR SYNC performance degraded no more than 2% as compared to remote archiving. The 10 MB async buffer outperformed smaller buffer sizes and reduced the chance of network timeout errors in a high latency / low bandwidth network.
• For optimal primary database performance throughput, use remote archiving (i.e. the ARCH process as the log transport). This configuration is best used when network bandwidth is limited and when your applications can risk some transaction loss in the event of a disaster.
• If you have sufficient memory, then set the TCP send and receive buffer sizes (these affect the advertised TCP window sizes) to the bandwidth delay product, the bandwidth times the network round trip time. This can improve transfer time to the standby by as much as 10 times, especially with the ARCH transport.


Best Practices for Network Configuration and Highest Network Redo Rates
Set SDU=32768 (32K) for the Oracle Net connections between the primary and standby. Setting the Oracle network services session data unit (SDU) to its maximum setting of 32K resulted in a 5% throughput improvement over the default setting of 2048 (2K) for LGWR ASYNC transport services and a 10% improvement for the LGWR SYNC transport service. SDU designates the size of the Oracle Net buffer used to collect data before it is delivered to the TCP network layer for transmission across the network. Oracle internal testing of Oracle Data Guard has demonstrated that the maximum setting of 32767 performs best. The gain in performance is a result of the reduced number of system calls required to pass the data from Oracle Net buffers to the operating system TCP network layer. SDU can be set on a per connection basis with the SDU parameter in the local naming configuration file (tnsnames.ora) and the listener configuration file (listener.ora), or SDU can be set for all Oracle Net connections with the profile parameter DEFAULT_SDU_SIZE in the sqlnet.ora file. This is specially true for WAN environment.
Use SSH port forwarding with compression for WAN’s with a large RTT when using maximum performance mode. Do not use SSH with compression for Maximum Protection and Maximum Availability modes since it adversely affected the primary throughput. Using SSH port forwarding with compression reduced the network traffic by 23-60% at a 3-6% increase in CPU usage. This also eliminated network timeout errors. With the ARCH transport, using SSH also reduced the log transfer time for RTT’s of 50 ms or greater. For RTT’s of 10ms or less, the ARCH transport log transfer time was increased when using SSH with compression.
Ensure TCP.NODELAY is YES
To preempt delays in buffer flushing in the TCP protocol stack, disable the TCP Nagle algorithm by setting TCP.NODELAY to YES in the SQLNET.ORA file on both the primary and standby systems.


Physical Standby Implementation

NOTE = If you want to use the new way to setup Data Guard with RMAN, please check the following LINK

The following example shows how to set up Data Guard MANUALLY in this given environment:
1.  The production database name is FGUARD
2.  One primary database instance called FGUARD on host server_01; one physical standby database instance called FGUARD on host server_02.
3. Listener listener is on host server1, and pointed by TNS entry FGUARD
4. Listener listener is on host server2, and pointed by TNS entry FGUARD.
5. The purpose of TNS entry FGUARD and FGUARD are used for LGWR/ARCH process to ship redo logs to the standby site, and for FAL process to fetch redo logs from the primary site.
6. We will be implementing the configuration manually, but we will set dg_broker_start to true, so Data Guard broker can be used later.
7. The protection mode is set to best performance.  Therefore, only local archive destination (log_archive_dest_1) is set to mandatory; the standby archive destination (log_archive_dest_2) is set to optional for LGWR process, with network transmission method of asynchronous and disk write option of no affirm.
8.
The standby site is not using standby online redo logs.  Therefore, the redo log reception option is archived logs.


Section 1:   Site Information
Primary Site:
Database Name: FGUARD
Primary Server  : server_01
Primary Instance Name: FGUARD
Primary Listener: LISTENER
Recovery Database: DR_FGUARD

Standby Site:
Database Name: FGUARD
Standby Server: server_02
Standby Instance name: FGUARD
Standby Listener: LISTENER
Production DB: PROD_FGUARD



Section 2:   Oratab /etc/oratab entry:
Primary Site: Standby Site:
FGUARD:/u01/app/oracle/product/11.2.0:Y FGUARD:/u01/app/oracle/product/11.2.0:N


Section 3:  Parameter file

Primary init.ora file:

archive_lag_target = 1800  #specifies how often in seconds we will force a switch log
db_name                   = FGUARD
db_unique_name            = FGUARD
#
fal_server               = DR_FGUARD   #PROD DB
used on tnsnames.ora
#fal_client               = FGUARD      #this DB used on tnsnames.ora
log_archive_dest_1        = 'LOCATION=/u02/arch/PROD MANDATORY'   #Local Location of Archive Log Files
log_archive_dest_2        = 'SERVICE=DR_FGUARD reopen=60'         #Remote Service Name based on tnsnames.ora
log_archive_dest_state_1  = 'enable'
log_archive_dest_state_2  = 'enable'
log_archive_format        = 'arch_t%t_s%s.dbf'
log_archive_start         = true      (not used on 10g)
standby_archive_dest      = '/oracle/arch'
standby_file_management   ='AUTO'    #If auto, newly created tablespaces/datafiles must be created manually on the standby environment.
dg_broker_start           = true

New Ones
service_names =FGUARD
instance_name
=FGUARD
log_archive_config="dg_config=(FGUARD,DR_FGUARD)"   -->This parameter is required by the Data Guard Broker
log_archive_max_processes=5
log_archive_dest_1='location=D:\oracle\product\10.1.0\flash_recovery_area\FGUARD\ARCHIVELOG valid_for=(ALL_LOGFILES,ALL_ROLES) db_unique_name=FGUARD'
log_archive_dest_2= 'service=standby LGWR SYNC AFFIRM valid_for=(ONLINE_LOGFILES,PRIMARY_ROLE) DB_UNIQUE_NAME=DR_FGUARD'
standby_archive_dest=D:\oracle\product\10.1.0\flash_recovery_area\PRIMARY\ARCHIVELOG


Optional parameters:
log_archive_dest_2='service=stby lgwr sync affirm mandatory reopen=180'
LOG_ARCHIVE_DEST_2 - Specifies the net service name of the standby database (check tnsnames.ora on primary database).You can either per destination use LGWR or ARCH or both, due to network traffic it is advised to use LGWR for at most one remote destination. Also the network transmission mode (SYNC or ASYNC) has to be specified in case primary database modifications are propagated by the LGWR. The NO DATA LOSS situation demands the SYNC mode, control is not returned to the executing application or user until the redo information is received by the standby site (this can have impact on the performance as mentioned).


Standby init.ora file:
db_name                   =
FGUARD
db_unique_name            = DR_FGUARD  --> MUST BE DIFFERENT FROM PRIMARY SITE
fal_server                = PROD_FGUARD  #PROD DB used on tnsnames.ora
fal_client                = FGUARD       #this DB used on tnsnames.ora
log_archive_dest_1        = 'LOCATION=/oracle/arch MANDATORY'  #This parameter should always coincide with the standby_archive_dest parameter
log_archive_dest_state_1  = 'enable'
#log_archive_dest_2       = 'SERVICE=PROD_FGUARD reopen=60'
#log_archive_dest_state_2 = 'enable'
log_archive_format        = 'arch_t%t_s%s.dbf'
log_archive_start         = true      (not used on 10g)
standby_archive_dest      = '/oracle/arch'                     #
#This parameter should always coincide with the log_archive_dest_1 parameter
standby_file_management   ='AUTO'
dg_broker_start           = true

New Ones
service_names='
DR_FGUARD'
instance_name=
DR_FGUARD
control_files='D:\oradata\PRIMARY_STDBY.CTL'
log_archive_config="dg_config=(FGUARD,DR_FGUARD)"  -->This parameter is required by the Data Guard Broker
log_archive_max_processes=5
log_archive_dest_1='location=D:\oracle\product\10.1.0\flash_recovery_area\PRIMARY\ARCHIVELOG valid_for=(ALL_LOGFILES,ALL_ROLES) db_unique_name=
DR_FGUARD'
log_archive_dest_2= 'service=
FGUARD LGWR SYNC AFFIRM valid_for=(ONLINE_LOGFILES,PRIMARY_ROLE) DB_UNIQUE_NAME=FGUARD'
standby_archive_dest=D:\oracle\product\10.1.0\flash_recovery_area\PRIMARY\ARCHIVELOG

Optional Parameters:
db_file_name_convert=('/disk1/oracle/oradata/payroll/','/disk1/oracle/oradata/payroll/standby/')
or
db_file_name_convert=('/PROD2/','/PROD')

log_file_name_convert=('/disk1/oracle/oradata/payroll/','/disk1/oracle/oradata/payroll/standby/')

or
log_file_name_convert=('/PROD2/','/PROD')
DB_FILE_NAME_CONVERT - Specifies the location of datafiles on standby database.The two arguments that this parameter needs are: location of datafiles on primary database , location of datafiles on standby database. This parameter will convert the filename of the primary database datafiles to the filename of the standby datafile filenames. If the standby database is on the same system as the primary database or if the directory structure where the datafiles are located on the standby site is different from the primary site then this parameter is required. See Section 3.2.1 for the location of the datafiles on the primary database. Used on Physical Implementation ONLY.
LOG_FILE_NAME_CONVERT - Specifies the location of redo logfiles on standby database.The two arguments that this parameter needs are: location of redo logfiles on primary database , location of redo logfiles on standby database. This parameter will convert the filename of the primary database log to the filenames of the standby log. If the standby database is on the same system as the primary database or if the directory structure where the logs are located on the standby site is different from the primary site then this parameter is required.
Used on Physical Implementation ONLY.


Section 4:  Listener.ora file
Primary Site: Standby Site:
FGUARD =
  (DESCRIPTION =
    (ADDRESS_LIST =
      (ADDRESS = (PROTOCOL = TCP)(HOST = server1)(PORT = 1521))
    )
    (CONNECT_DATA =
      (SERVER = DEDICATED)
      (SERVICE_NAME = FGUARD)
    )
  )

DR_FGUARD =
  (DESCRIPTION =
    (ADDRESS_LIST =
      (ADDRESS = (PROTOCOL = TCP)(HOST = server2)(PORT = 1521))
    )
    (CONNECT_DATA =
      (SERVER = DEDICATED)
      (SERVICE_NAME = FGUARD)
    )
  )
FGUARD=
  (DESCRIPTION =
    (ADDRESS_LIST =
      (ADDRESS = (PROTOCOL = TCP)(HOST = server2)(PORT = 1521))
    )
    (CONNECT_DATA =
      (SERVER = DEDICATED)
      (SERVICE_NAME = FGUARD)
    )
  )

PROD_FGUARD =
  (DESCRIPTION =
    (ADDRESS_LIST =
      (ADDRESS = (PROTOCOL = TCP)(HOST = server1)(PORT = 1521))
    )
    (CONNECT_DATA =
      (SERVER = DEDICATED)
      (SID = FGUARD)
    )
  )

Steps to Set up a PHYSICAL Data Guard Environment
The following steps show how to set up a Physical Data Guard environment:

Step 1: The Preparation

- Ensure primary database is in ARCHIVELOG mode
If Archive Log Mode is not enabled for your primary database, enable it using the following. First, you will need to define the following instance variables:
alter system set log_archive_dest_1 = 'LOCATION=/oracle/arch MANDATORY' scope=both;
alter system set log_archive_dest_state_1 = 'enable' scope=both;
alter system set log_archive_format = 'arch_t%t_s%s.dbf' scope=both;
shutdown immediate;
startup mount;
alter database archivelog;
alter database open;
archive log list;

- Enable Forced Logging. This option ensures that even in the event that a 'nologging' operation is done, force logging takes precedence and all operations are logged into the redo logs.
alter database force logging;

Create standby redo log files (recommended):
Standby redo logs are necessary for the higher protection levels such as Guaranteed, Instant, and Rapid.  In these protection modes LGWR from the Primary host writes transactions directly to the standby redo logs. This enables no data loss solutions and reduces the amount of data loss in the event of failure.  Standby redo logs are not necessary if you are using the delayed protection mode.
If you configure standby redo on the standby then you should also configure standby redo logs on the primary database. Even though the standby redo logs are not used when the database is running in the primary role, configuring the standby redo logs on the primary database is recommended in preparation for an eventual switchover operation
Standby redo logs must be archived before the data can be applied to the standby database. The standby archival operation occurs automatically, even if the standby database is not in ARCHIVELOG mode. However, the archiver process must be started on the standby database. Note that the use of the archiver process (performed by the LGWR process) is a requirement for selection of a standby redo log
You must have the same number of standby redo logs on the standby as you have online redo logs on production. They must also be exactly the same size.

select * from v$logfile;
GROUP#     STATUS  TYPE    MEMBER                                                                 IS_
---------- ------- ------- ---------------------------------------------------------------------- ---
1                   ONLINE /export/home/oracle/temp/oracle/data/redo01.log                        NO
                  ONLINE /export/home/oracle/temp/oracle/data/redo02.log                        NO
                  ONLINE /export/home/oracle/temp/oracle/data/redo03.log                        NO

select bytes from v$log;

BYTES
----------
52428800
52428800
52428800

The following syntax is used to create standby redo logs:
SQL> alter database add standby logfile GROUP 4 size 50m;
SQL> alter database add standby logfile GROUP 5 size 50m;
SQL> alter database add standby logfile GROUP 6 size 50m;

select * from v$logfile;
GROUP#     STATUS  TYPE    MEMBER                                                                 IS_
---------- ------- ------- ---------------------------------------------------------------------- ---
1                   ONLINE /export/home/oracle/temp/oracle/data/redo01.log                        NO
                  ONLINE /export/home/oracle/temp/oracle/data/redo02.log                        NO
                  ONLINE /export/home/oracle/temp/oracle/data/redo03.log                        NO
4                  STANDBY /export/home/oracle/temp/oracle/data/standbyredo01.dbf                 NO
5                  STANDBY /export/home/oracle/temp/oracle/data/standbyredo01.dbf                 NO
6                  STANDBY /export/home/oracle/temp/oracle/data/standbyredo01.dbf                 NO


select * from v$standby_log;
GROUP#   DBID        THREAD# SEQUENCE#  BYTES     USED  ARC STATUS FIRST_CHANGE# FIRST_TIM LAST_CHANGE# LAST_TIME
-------- ----------- ------- ---------- --------- ----- ------------------------------------------------------------------
4        UNASSIGNED  0       0          52428800  512   YES UNASSIGNED 0 0
5        UNASSIGNED  0       0          52428800  512   YES UNASSIGNED 0 0
6        UNASSIGNED  0       0          52428800  512   YES UNASSIGNED 0 0

·         Setup the init.ora file for both primary and standby databases. (see section 3)
 NOTE: In the above example db_file_name_convert and log_file_name_convert are not needed as the directory structure on the two hosts are the same. If the directory structure is not the same then setting of these parameters is recommended. Please reference ML notes 47325.1 and 47343.1 for further information.
Note here that the Primary init.ora on the Standby host to have log_archive_dest_2 use the alias that points to the Primary host. You must modify the Standby init.ora on the standby host to have fal_server and fal_client use the aliases when standby is running on the Primary host.
·         Setup the tnsnames.ora and listener.ora file for both primary and standby databases. (see section 4)

Step 2: Backup  the primary Database Datafiles

·         Shut down the primary database.
·         Backup the primary database datafiles and online redo logs. A backup of the online redo logs is necessary to facilitate switchover.
$ cp /u02/oradata/prod/* /oracle/BCKUP
or
set pages 50000 lines 120 head off veri off flush off ti off
select 'cp ' ||
file_name || ' /oracle/BCKUP' from dba_data_files
UNION
select
'cp ' || file_name || ' /oracle/BCKUP' from dba_temp_files
UNION
select 'cp ' || member || ' /oracle/BCKUP' from  v$logfile;

Step 3: Create the Physical standby Database Control File

·         Startup the Primary database and issue the following command to create the standby control file (it must be done AFTER the backup):
SQL> alter database create standby controlfile as '/oracle/BCKUP/standby.ctl';
NOTE: The controlfile must be created after the last time stamp for the backup datafiles.

Step 4: Transfer the Datafiles and Standby Control File to the Standby Site

·         Transfer the backuped datafiles, redo log files and archived redo logs to the standby site.
$ rcp  /oracle/BCKUP/* server2:/u02/oradata/FGUARD
·         Transfer the standby control file to the standby site
$ rcp  /oracle/BCKUP/standby.ctl server2:/u01/app/oracle/admin/FGUARD/ctl/control01.ctl

·         Also copy the orapwd file and init.ora files. A password file must be created on the Primary and copied over to the Standby site. The sys password must be identical on both sites. This is a key pre requisite in order to be able to ship and apply archived logs from Primary to Standby. If you need to generate a password file perform the following:
cd $ORACLE_HOME/dbs
orapwd file=orapwFGUARD password=oracle force=y ignorecase=y

If needed perform the following on the other system:
chmod 6751 orapwSID

·         Put all of them on the correct places

Step 5: Start the Listeners on both Primary and Standby Site

·         If the standby system is running on a Windows-based system, use the ORADIM utility to create a Windows Service and password file. For example:
     WINNT> oradim -NEW -SID databaseid -INTPWD password -STARTMODE manual

·         Start the the listener on the primary and standby database
$ lsnrctl start


Step 6: Start the Standby Database (Primary Database already running)

·         Set the correct Oracle environment and copy all the files to each location
If the standby is on a separate site with the same directory structure as the primary database then you can use the same path names for the standby files as the primary files. In this way, you do not have to rename the primary datafiles in the standby control file.
If the standby is on the same site as the primary database, or the standby database is on a separate site with a different directory structure the you must rename the primary datafiles in the standby control file after copying them to the standby site. This can be done using the db_file_name_convert and log_file_name_convert parameters or by manually using the alter database statements. If the directory structure is not the same then reference notes 47325.1 and 47343.1 for further information.
If you decided to rename them manually, you MUST use ALTER DATABASE RENAME FILE <oldname> TO <newname> after the standby database is mounted to rename the database files and redo log files..
If needed, copy the Standby Controlfile that your created FROM the production DB to the appropiate location on the standby DB according your init.ora file
$ cd
$ cp standby.ctl /u03/app/oradata/FGUARD/control01.ctl
$ cp standby.ctl /u04/app/oradata/FGUARD/control02.ctl
$ cp standby.ctl /u05/app/oradata/FGUARD/control03.ctl

·         Connect as sysdba.
create spfile from pfile;
·         Bring the database in nomount mode first.
startup nomount;
·         Mount the standby database.
alter database mount standby database;

If you decided to rename the database files manually, you MUST use ALTER DATABASE RENAME FILE <oldname> TO <newname> after the standby database is mounted.

Step 7: Place the Standby Database in Managed Recovery Mode

·         Issue the following command to bring the standby database in managed recover mode (start log apply services).
alter database recover managed standby database disconnect from session;
NOTE: The example includes the DISCONNECT FROM SESSION option so that log apply services run in a background session.


Step 8: Monitor the Log Transport services and Log Apply Services

·         With the protection mode we are using, Maximum Performance, archiving of redo logs to the remote standby location do not occur until after a log switch. A log switch occurs, by default, when an online redo log becomes full. Issue a few log switches on the primary database.
SQL> alter system switch logfile;
or
SQL> alter system archive log current;

·         Confirm the log files received on the standby archive destination.
·         Check the standby alert log file to see if the new logs have applied to the standby database.
Media Recovery Log /u02/arch/FGUARD/prod_1482.arc
·         Monitor the managed recovery.
select process, status, thread#, sequence#, block#, blocks
     from   v$managed_standby;
or
select sequence#, first_time, next_time
    from v$archived_log order by sequence#;

·         Verify that the new archived redo log was applied

From the standby database, query the V$ARCHIVED_LOG view to verify the archived redo log was applied.
select sequence#, archived, applied
    from v$archived_log order by sequence#;

 SEQUENCE# ARCHIVED APPLIED
---------- -------- -------
       115 YES      YES
       116 YES      YES




Logical Standby Sanity Check

Detect Database Role
select name, database_role from v$database;


Queries for PRIMARY Database
- Show Thread, Archived, Applied
select DEST_ID, THREAD#, SEQUENCE#, ARCHIVED, APPLIED, COMPLETION_TIME from v$archived_log where DEST_ID = 2 order by SEQUENCE#;

  DEST_ID    THREAD#  SEQUENCE# ARC APP COMPLETION_TIME

--------- ---------- ---------- --- --- ------------------
        2          1       1350 YES YES 17/JAN/11 21:35:13
        2          1       1351 YES YES 17/JAN/11 22:03:06
        2          1       1352 YES YES 17/JAN/11 22:17:30
        2          1       1353 YES NO  17/JAN/11 22:37:10
        2          1       1354 YES NO  17/JAN/11 22:37:11
        2          1       1355 YES NO  17/JAN/11 22:44:21
        2          1       1356 YES NO  18/JAN/11 07:45:09


On STANDBY database
ALTER SESSION SET NLS_DATE_FORMAT = 'DD-MON-YY HH24:MI:SS';

- Verify the archived redo log files were registered.
SELECT SEQUENCE#, FIRST_TIME, NEXT_TIME, DICT_BEGIN, DICT_END, TIMESTAMP FROM DBA_LOGSTDBY_LOG ORDER BY SEQUENCE#;

SEQUENCE# FIRST_TIME         NEXT_TIME          DIC DIC TIMESTAMP

--------- ------------------ ------------------ --- --- ------------------
     1353 17-JAN-11 22:17:14 17-JAN-11 22:36:33 NO  NO  17-JAN-11 22:37:14
     1354 17-JAN-11 22:36:33 17-JAN-11 22:37:06 NO  NO  17-JAN-11 22:37:14
     1355 17-JAN-11 22:37:06 17-JAN-11 22:44:19 NO  NO  17-JAN-11 22:44:24
     1356 17-JAN-11 22:44:19 18-JAN-11 07:44:23 NO  NO  18-JAN-11 07:45:12


- Show Threads Progress, Determine how much progress was made through the available logs
SELECT L.SEQUENCE#, L.FIRST_TIME,
      (CASE WHEN L.NEXT_CHANGE# < P.READ_SCN THEN 'YES'WHEN L.FIRST_CHANGE# < P.APPLIED_SCN THEN 'CURRENT' ELSE 'NO' END) APPLIED

FROM DBA_LOGSTDBY_LOG L, DBA_LOGSTDBY_PROGRESS P
ORDER BY SEQUENCE#;

 SEQUENCE# FIRST_TIME         APPLIED

---------- ------------------ -------
      1353 17-JAN-11 22:17:14 CURRENT
      1354 17-JAN-11 22:36:33 NO
      1355 17-JAN-11 22:37:06 NO
      1356 17-JAN-11 22:44:19 NO


- Verify Currect SCN to see if all log file information was applied
SELECT APPLIED_SCN, NEWEST_SCN FROM DBA_LOGSTDBY_PROGRESS;

APPLIED_SCN NEWEST_SCN

----------- ----------
 2998474027 2998474027     


- Show File Location that I received and its status
SELECT FILE_NAME, SEQUENCE#, FIRST_CHANGE#, NEXT_CHANGE#, TIMESTAMP, DICT_BEGIN, DICT_END, THREAD# FROM
DBA_LOGSTDBY_LOG ORDER BY SEQUENCE#;

FILE_NAME

--------------------------------------------------------------------------------
 SEQUENCE# FIRST_CHANGE# NEXT_CHANGE# TIMESTAMP          DIC DIC    THREAD#
---------- ------------- ------------ ------------------ --- --- ----------
C:\ORACLE\PRODUCT\10.2.0\ORADATA\WINORADB_ARCH_STDBY_REDO_LOG\ARCH_001_01353_697041245.LOG
      1353    2998447810   2998451537 17-JAN-11 22:37:14 NO  NO           1

C:\ORACLE\PRODUCT\10.2.0\ORADATA\WINORADB_ARCH_STDBY_REDO_LOG\ARCH_001_01354_697041245.LOG
      1354    2998451537   2998451552 17-JAN-11 22:37:14 NO  NO           1

C:\ORACLE\PRODUCT\10.2.0\ORADATA\WINORADB_ARCH_STDBY_REDO_LOG\ARCH_001_01355_697041245.LOG
      1355    2998451552   2998451783 17-JAN-11 22:44:24 NO  NO           1

C:\ORACLE\PRODUCT\10.2.0\ORADATA\WINORADB_ARCH_STDBY_REDO_LOG\ARCH_001_01356_697041245.LOG
      1356    2998451783   2998472682 18-JAN-11 07:45:12 NO  NO           1


- Show what the Standby is doing, Inspect the process activity for SQL apply operations
column status format a50
column type format a12
select type, high_scn, status from v$logstdby;
SELECT substr(TYPE,1,10) Type, HIGH_SCN, substr(STATUS,1,55) FROM V$LOGSTDBY;

TYPE           HIGH_SCN STATUS
------------ ---------- --------------------------------------------------
COORDINATOR  2998474028 ORA-16116: no work available
READER       2998474028 ORA-16240: Waiting for logfile (thread# 1, sequence# 1358)
BUILDER      2998474025 ORA-16116: no work available
PREPARER     2998474024 ORA-16116: no work available
ANALYZER     2998474025 ORA-16117: processing
APPLIER      2998474005 ORA-16116: no work available
APPLIER      2998474025 ORA-16116: no work available
APPLIER      2998473973 ORA-16116: no work available
APPLIER      2998473978 ORA-16116: no work available


- Show problematic Rows or Events
SELECT EVENT_TIME, STATUS, EVENT FROM DBA_LOGSTDBY_EVENTS ORDER BY EVENT_TIME, COMMIT_SCN;


- Check Coordinator Status
SELECT substr(NAME,1,20) Name, substr(VALUE,1,30) value
FROM V$LOGSTDBY_STATS
WHERE NAME LIKE 'coordinator%' or NAME LIKE 'transactions%';


NAME                 VALUE

-------------------- -------------------
coordinator state    IDLE
transactions ready   1165
transactions applied 1165
coordinator uptime   285



Troubleshooting a Logical Standby

Setting up a Skip Handler for a DDL Statement
The DB_FILE_NAME_CONVERT initialization parameter is not honored once a physical standby database is converted to a logical standby database.
This can be a problem, for example, when adding a non-OMF datafile to the primary database and the datafile paths are different between the primary and standby. This section describes the steps necessary to register a skip handler and provide SQL Apply with a replacement DDL string to execute by converting the path names of the primary database datafiles to the standby datafile path names.

This may or may not be a problem for everyone. For example, if you are using Oracle Managed Files (OMF), SQL Apply will successfully execute DDL statements generated from the primary to CREATE and ALTER tablespaces and their associated system generated path name on the logical standby.

On PRIMARY Standby using Oracle Managed Files (OMF)

show parameter db_create_file_dest
NAME                       TYPE        VALUE
-------------------------- ----------- ----------------
db_create_file_dest        string      /u02/oradata

create tablespace data2 datafile size 5m;


On LOGICAL Standby using Oracle Managed Files (OMF)
show parameter db_create_file_dest
NAME                       TYPE        VALUE
-------------------------- ----------- ----------------
db_create_file_dest        string      /u02/oradata

----------------------- alert.log -----------------------
Wed Jan 12 18:45:28 EST 2011
Completed: create tablespace data2 datafile size 5m
---------------------------------------------------------

select tablespace_name, file_name
from dba_data_files
where tablespace_name = 'DATA2';

TABLESPACE_NAME    FILE_NAME
------------------ ---------------------------------------------------------
DATA2              /u02/oradata/FGUARD/datafile/o1_mf_data2_6lwh8q9d_.dbf


On PRIMARY Standby using Oracle Managed Files (OMF)
alter tablespace data2 add datafile '/u05/oradata/MODESTO/data02.dbf' size 5m;

On LOGICAL Standby WITHOUT Oracle Managed Files (OMF)
If on the other hand, you attempt to specify a physical path name in the CREATE/ALTER tablespace statement that does not exist on the logical standby, SQL Apply will not succeed in processing the statement and will fail. Whenever SQL Apply encounters an error while applying a SQL statement, it will stop and provide the DBA with an opportunity to correct the statement and restart SQL Apply.

----------------------- alert.log -----------------------
Wed Jan 12 19:59:36 EST 2011
alter tablespace data2 add datafile '/u05/oradata/MODESTO/data02.dbf' size 5m
Wed Jan 12 19:59:36 EST 2011
ORA-1119 signalled during: alter tablespace data2 add datafile '/u05/oradata/MODESTO/data02.dbf' size 5m...
LOGSTDBY status: ORA-01119: error in creating database file '/u05/oradata/MODESTO/data02.dbf'
ORA-27040: file create error, unable to create file
Linux Error: 2: No such file or directory
LOGSTDBY Apply process P004 pid=31 OS id=28497 stopped
Wed Jan 12 19:59:36 EST 2011
Errors in file /u01/app/oracle/admin/turlock/bdump/turlock_lsp0_28465.trc:
ORA-12801: error signaled in parallel query server P004
ORA-01119: error in creating database file '/u05/oradata/MODESTO/data02.dbf'
ORA-27040: file create error, unable to create file
Linux Error: 2: No such file or directory
LOGSTDBY Analyzer process P003 pid=30 OS id=28495 stopped
LOGSTDBY Apply process P005 pid=32 OS id=28499 stopped
LOGSTDBY Apply process P006 pid=33 OS id=28501 stopped
LOGSTDBY Apply process P007 pid=34 OS id=28503 stopped
LOGSTDBY Apply process P008 pid=35 OS id=28505 stopped
---------------------------------------------------------


select event_timestamp, event, status from dba_logstdby_events;

EVENT_TIMESTAMP               EVENT                          Status
----------------------------- ------------------------------ ------------------------------
12-JAN-11 07.59.36.134349 PM  alter tablespace data2 add dat ORA-01119: error in creating d
                              afile '/u05/oradata/MODESTO/da atabase file '/u05/oradata/MOD
                              ta02.dbf' size 5m              ESTO/data02.dbf'
                                                             ORA-27040: file create error,
                                                             unable to create file
                                                             Linux Error: 2: No such file o
                                                             r directory

Fixing it
1. Disable the database guard for this session so we can modify the logical standby.
alter session disable guard;

2. Issue a compensating transaction or statement on the logical standby. For this example, issue the following command with the correct path:
alter tablespace data2 add datafile '/u05/oradata/FGUARD/data02.dbf' size 5m;

3. Re-enable the database guard for this session.
alter session enable guard;

4. Restart logical apply with a clause that will cause the failed transaction to be automatically skipped.
alter database start logical standby apply immediate skip failed transaction;

5. Verify results.
select tablespace_name, file_name
from dba_data_files
where tablespace_name = 'DATA2';

TABLESPACE_NAME    FILE_NAME
------------------ ---------------------------------------------------------
DATA2              /u02/oradata/FGUARD/datafile/o1_mf_data2_6lwh8q9d_.dbf
DATA2              /u05/oradata/FGUARD/data02.dbf


NOTE
It is possible to avoid errors of this nature on the logical standby database by registering a skip handler and provide SQL Apply with a replacement DDL string to execute by converting the path names of the primary database datafiles to the standby datafile path names. The steps to perform this are presented below. The actions below should be run on the logical standby database.

1. First, create the PL/SQL 'skip procedure' to handle tablespace DDL transactions.
create or replace procedure sys.handle_tbs_ddl (
      old_stmt  in  varchar2
    , stmt_typ  in  varchar2
    , schema    in  varchar2
    , name      in  varchar2
    , xidusn    in  number
    , xidslt    in  number
    , xidsqn    in  number
    , action    out number
    , new_stmt  out varchar2
) as
begin
  -- --------------------------------------------------------
  -- All primary file specification that contain a directory
  -- '/u05/oradata/FGUARD' should be changed to the
  -- '/u01/another_location/oradata/FGUARD' directory specification.
  -- --------------------------------------------------------
  new_stmt := replace(old_stmt, '/u05/oradata/FGUARD', '/u01/another_location/oradata/FGUARD');
  action := dbms_logstdby.skip_action_replace;
 
  exception
      when others then
          action := dbms_logstdby.skip_action_error;
          new_stmt := null;
end handle_tbs_ddl;
/


2. Stop SQL Apply.
alter database stop logical standby apply;

3.Register the skip procedure with SQL Apply.
execute dbms_logstdby.skip(stmt => 'TABLESPACE', proc_name => 'sys.handle_tbs_ddl');

4.Start SQL Apply.
alter database start logical standby apply immediate;


5. Perform a test.
On PRIMARY Standby
alter tablespace data2 add datafile '/u05/oradata/FGUARD/data03.dbf' size 5m;

ON Logical Standby
----------------------- alert.log -----------------------
Wed Jan 12 20:51:58 EST 2011
LOGSTDBY status: ORA-16110: user procedure processing of logical standby apply DDL
LOGSTDBY status: ORA-16202: Skip procedure requested to replace statement
Wed Jan 12 20:51:58 EST 2011
alter tablespace data2 add datafile '/u01/another_location/oradata/FGUARD/data03.dbf' size 5m
Completed: alter tablespace data2 add datafile '/u01/another_location/oradata/FGUARD/data03.dbf' size 5m
---------------------------------------------------------

select tablespace_name, file_name
  from dba_data_files
  where tablespace_name = 'DATA2';

TABLESPACE_NAME    FILE_NAME
------------------ ---------------------------------------------------------
DATA2              /u01/another_location/oradata/FGUARD/data03.dbf



Logical Standby Database Activation (Role Transition)
A database can operate in one of two mutually exclusive modes in an Oracle Data Guard configuration: primary or standby.
Whenever the role is changed between the primary and standby, this is referred to as a role transition.
Role transition plays an important part in Data Guard by providing an interface that allows DBA's to activate a standby database to take over as the primary production database or vice versa.
There are two types of role transitions supported in Oracle: switchover and failover.


Switchover
1- Issue the following statement on the production database to enable receipt of redo from the current standby database:
ALTER DATABASE PREPARE TO SWITCHOVER TO LOGICAL STANDBY;

2- On the current logical standby database, build the LogMiner dictionary and transmit this dictionary to the current production database::
ALTER DATABASE PREPARE TO SWITCHOVER TO PRIMARY;

Depending on the work to be done and the size of the database, the prepare statement may take some time to complete.

3- Verify the LogMiner Multiversioned Data Dictionary was received by the production database by querying the SWITCHOVER_STATUS column of the V$DATABASE fixed view on the production database
Initially, the SWITCHOVER_STATUS column shows PREPARING DICTIONARY while the LogMiner Multiversioned Data Dictionary is being recorded in the redo stream. Once this has completed successfully, the column shows PREPARING SWITCHOVER. When the query returns the TO LOGICAL STANDBY value, you can proceed to the next step.

4- When the SWITCHOVER_STATUS column of the V$DATABASE view returns TO LOGICAL STANDBY, convert the production database to a standby by issuing:
ALTER DATABASE COMMIT TO SWITCHOVER TO LOGICAL STANDBY with SESSION SHUTDOWN;

5- Issue the following statement on the old standby database:
ALTER DATABASE COMMIT TO SWITCHOVER TO PRIMARY;


Failover
Copy and register any missing archived redo logs from PROD. Execute the following query on Standby:
COLUMN FILE_NAME FORMAT a55;
SELECT THREAD#, SEQUENCE#, FILE_NAME
FROM DBA_LOGSTDBY_LOG L
WHERE NEXT_CHANGE# NOT IN  (SELECT FIRST_CHANGE#
                              FROM DBA_LOGSTDBY_LOG
                              WHERE L.THREAD# = THREAD#)
ORDER BY THREAD#,SEQUENCE#;

Register any log:
ALTER DATABASE REGISTER LOGICAL LOGFILE '/disk1/oracle/dbs/log-1292880008_7.arc';

Copy and register the online redo logs from the primary database (if possible).
You can safely ignore the error ORA-01289: cannot add duplicate logfile
ALTER DATABASE REGISTER LOGICAL LOGFILE '/disk1/oracle/dbs/online_log1.log';

Turn off the apply delay interval.
ALTER DATABASE STOP LOGICAL STANDBY APPLY;
EXECUTE DBMS_LOGSTDBY.APPLY_UNSET('APPLY_DELAY');
ALTER DATABASE START LOGICAL STANDBY APPLY;

Initiate the failover by issuing the following on the target standby database:
ALTER DATABASE ACTIVATE LOGICAL STANDBY DATABASE FINISH APPLY;
This statement stops the RFS process, applies any remaining redo data, stops SQL Apply, and activates the logical standby database in the production role.

Note: To avoid waiting for the redo in the standby redo log file to be applied prior to performing the failover, omit the FINISH APPLY clause on the statement. Although omitting the FINISH APPLY clause will accelerate failover, omitting the clause will cause the loss of any unapplied redo data in the standby redo log. To gauge the amount of redo that will be lost, query the V$LOGSTDBY_PROGRESS view.
The LATEST_SCN column value indicates the last SCN received from the production database, and the APPLIED_SCN column value indicates the last SCN applied to the standby database. All SCNs between these two values will be lost.



Monitoring Physical Data Guard (Detect Gap)

Perform a query of the V$ARCHIVE_GAP view from the physical standby database:
select * from v$archive_gap;

If you get results from this query, it means there is a gap. You can easily detect what is the problem by checking the alert.log file on your Primary DB.

Potential data loss window for a physical standby database (Archived logs not applied on a physical standby database)
--On the standby, Get the sequence number of the last applied archive log.
  select max(sequence#) Last_applied_arch_log from v$archived_log where applied='YES';

-- On the standby, Get the sequence number of the last complete archive log on the standby.
-- This is the last log the standby can apply without receiving additional archive logs from the primary.  
  SELECT min(sequence#)  Last_archive_log_received FROM v$archived_log
    WHERE (sequence#+1) NOT IN (SELECT sequence# FROM v$archived_log)
     AND sequence# > &Last_applied_arch_log;

--Connect to the primary database and obtain the sequence number of the current online log:
select sequence# from v$log where status='CURRENT';

-- The difference between 2nd query and 1st query should be 0
-- The difference between 3d query and 1st query is the number of archive logs that the standby
-- database would not be able to recover if the primary host become unavailable


Some GOOD queries to detect GAPS
select process,status,client_process,sequence#,block#,active_agents,known_agents
  from v$managed_standby;
PROCESS STATUS       CLIENT_P  SEQUENCE#     BLOCK# ACTIVE_AGENTS KNOWN_AGENTS
------- ------------ -------- ---------- ---------- ------------- ------------
ARCH    CONNECTED    ARCH              0          0             0            0
ARCH    CONNECTED    ARCH              0          0             0            0
MRP0    WAIT_FOR_GAP N/A            5134          0             0            0
RFS     RECEIVING    ARCH           5454    1637652             0            0
RFS     ATTACHED     ARCH           5456     819100             0            0

Run this at Primary
set pages 1000
set lines 120
column DEST_NAME format a20
column DESTINATION format a35
column ARCHIVER format a10
column TARGET format a15
column status format a10
column error format a15
select DEST_ID,DEST_NAME,DESTINATION,TARGET,STATUS,ERROR from v$archive_dest
where DESTINATION is NOT NULL
/
select ads.dest_id,max(sequence#) "Current Sequence",
max(log_sequence) "Last Archived"
from v$archived_log al, v$archive_dest ad, v$archive_dest_status ads
where ad.dest_id=al.dest_id and al.dest_id=ads.dest_id group by ads.dest_id;

Run this at Standby

select max(al.sequence#) "Last Seq Recieved" from v$archived_log al
/
select max(al.sequence#) "Last Seq Apllied" from v$archived_log al
where applied ='YES'
/
select process,status,sequence# from v$managed_standby
/


select * from v$archive_gap;
   THREAD# LOW_SEQUENCE# HIGH_SEQUENCE#
---------- ------------- --------------
         1          5134           5222

Another Method
Use the following SQL on the standby database (the database must be mounted).

SELECT high.thread#, "LowGap#", "HighGap#"
FROM
(     SELECT thread#, MIN(sequence#)-1 "HighGap#"
      FROM
      (     SELECT a.thread#, a.sequence#
            FROM
            (     SELECT *
                  FROM v$archived_log
            ) a,
            (     SELECT thread#, MAX(sequence#)gap1
                  FROM v$log_history
                  GROUP BY thread#
            ) b
            WHERE a.thread# = b.thread#
            AND a.sequence# > gap1
      )
      GROUP BY thread#
) high,
(     SELECT thread#, MIN(sequence#) "LowGap#"
      FROM
      (     SELECT thread#, sequence#
            FROM v$log_history, v$datafile
            WHERE checkpoint_change# <= next_change#
            AND checkpoint_change# >= first_change#
      )
      GROUP BY thread#
) low
WHERE low.thread# = high.thread#;

If no rows are retunred, you are Fine


Suspend Physical Standby Recovery

To stop managed standby recovery:
-- Cancel protected mode on primary
CONNECT sys/password@primary1 AS SYSDBA
ALTER DATABASE SET STANDBY DATABASE UNPROTECTED;

-- Cancel recovery if necessary
CONNECT sys/password@standby1 AS SYSDBA
RECOVER MANAGED STANDBY DATABASE CANCEL;
ALTER DATABASE OPEN READ ONLY;
The database can subsequently be switched back to recovery mode as follows:
-- Startup managed recovery
CONNECT sys/password@standby1 AS SYSDBA
SHUTDOWN IMMEDIATE
STARTUP NOMOUNT;
ALTER DATABASE MOUNT STANDBY DATABASE;
RECOVER MANAGED STANDBY DATABASE DISCONNECT FROM SESSION;

-- Protect primary database
CONNECT sys/password@primary1 AS SYSDBA
ALTER DATABASE SET STANDBY DATABASE PROTECTED;


Activating A Physical Standby Database

Procedures to Open the DR Database in Query Mode
Once there, cancel recovery with (you could get an error message that you can ignore):
RECOVER MANAGED STANDBY DATABASE CANCEL;

Open the DB in Read Mode with:
ALTER DATABASE OPEN READ ONLY;
alter tablespace TEMP add tempfile '/oracle/DBA_SCRIPTS/temp.dbf' size 1000m reuse;


Procedures to Put the Database back in DR Mode

Close the DB:
SHUTDOWN IMMEDIATE;

Mount the DB with:
startup nomount;
alter database mount standby database;

Open the DB in Recovery mode with:
recover standby database;
alter database recover managed standby database disconnect from session;


Procedures to Activate a DR Database as a PROD database

Primary Site Standby Site
Archive the current online redo log
ALTER SYSTEM ARCHIVE LOG CURRENT;
Start the database in nomount
STARTUP NOMOUNT
ALTER DATABASE MOUNT STANDBY DATABASE;
Send the ARCH files to the standby site -- Start Recovery Mode to receive the latest ARCH Log Files from PROD
RECOVER MANAGED STANDBY DATABASE; 
or
RECOVER STANDBY DATABASE; 

Check the alert log file to review the last applied logs

Modify your init.ora file. Specifically 
LOG_ARCHIVE_DEST=<Value>
LOG_ARCHIVE_START = TRUE
LOG_ARCHIVE_DEST_STATE = ENABLE
And Comment the parameter:
STANDBY_ARCHIVE_DEST = <Value>

Ensure that your standby database is mounted in EXCLUSIVE mode by executing the following query:
SELECT name,value FROM v$parameter WHERE name='parallel_server';
If the value is TRUE, then the database is not mounted exclusively; if the value is FALSE, then the database is mounted exclusively.

--You may need to rename log files like:
ALTER DATABASE RENAME FILE '/u01/app/oracle/oradata/CCOM/redo1.log' to '/opt/app/oracle/oradata/CCOM/redo01.log';
ALTER DATABASE RENAME FILE '/u01/app/oracle/oradata/CCOM/redo2.log' to '/opt/app/oracle/oradata/CCOM/redo02.log';

Activate the standby database (this command resets the online redo logs):
RECOVER MANAGED STANDBY DATABASE CANCEL;
or
ALTER DATABASE RECOVER MANAGED STANDBY DATABASE FINISH;
ALTER DATABASE ACTIVATE STANDBY DATABASE;

Shut down the NEW PROD instance:
SHUTDOWN IMMEDIATE

Make a COLD BACKUP and check init.ora parameters

Start the new production instance:
STARTUP

Create or Add a tempfile to the temporary tablespace: Ex:
create temporary tablespace TEMP2 TEMPFILE '/oracle/oradata/V901/temp2.dbf' size 100M;
or
alter tablespace TEMP add tempfile ‘/oracle/app/product/9.2/admin/oradata/temp01.dbf’ size 100M ;

 

Switchover between Prod and Standby DB

Important Note: These steps MUST be tested and understood because a mistake can result in the loss of the production or standby database.

The main Advantage of a graceful switchover or switchback is that it avoids the resetlogs operation. So the source database can resume its role as the standby database almost immediately. When the primary site requires scheduled maintenance, the production database can gracefully switch over to the standby database. Thereby, the graceful switchover technique may be useful for planned site repairs, hardware changes, O/S upgrades.

Prerequisites of Graceful Switchover or Switchbak

  1. Production DB is shutdown normal or immediate or instance down.
  2. There is no loss of any archive log that haven't been yet applied to the standby database
  3. All archives are applied to the standby database
  4. Source database's online redo logs are available and intact
  5. Target database is still intact and has the same resetlogs version as the source database
Graceful switchover or switchback will not work when either production database's online redo logs or standby database is lost or inaccesible. 
 

Steps in detail 
 

Primary Site Standby Site
Alter system archive log current; DB is in Recover Mode
alter database backup controlfile to trace noresetlogs; 
Send the trace file.
Copy the controlfile script from prod to standby. You will need to modify this file with the appropiate data file and log file path names. Validate status and existence of all data files (all the online, datafiles from offline tablespaces and read only tablespaces will be offlined in the create controlfile script) and log files. Comment out the RECOVER DATABASE and the ALTER DATABASE OPEN.
shutdown immediate   (check alert.log file for ALTER DATABASE CLOSE NORMAL) Apply last set of archives log files and cancel recovery
Copy control files and online redo logs to standby (do not delete them from prod) shutdown immediate (check alert.log file for ALTER DATABASE CLOSE NORMAL and ALTER DATABASE DISMOUNT)
Reverse production and standby network connections (if necessary) Reverse production and standby network connections (if necessary)

Modify your init.ora file to adapt it to the new changes 

Execute the create controlfile script. 
startup nomount 
create controlfile ......

Recover database; (will recover only online datafiles)

Validate oracle database

alter database open; At this point this DB is open in read-write mode

alter database create standby controlfile as '/path/standby.ctl'; And copy that file to the OLD PROD DB.
Copy the standby.ctl overwritting the existing controlfiles that you had before.
alter system archive log all (to send archives to the new standby database)
Server start to receive archives log files Now clients can reconnect here, Standby becomes PRODUCTION
Validate existence of all Oracle datafiles and standby controlfiles. Modify your init.ora file to adapt it to the new changes (Appendix A)
Startup nomount;
alter database mount standby database;

Rename datafiles and control files is necessary (alter database rename file '/path/name' to '/newpath/name'; )
Offline data files if requiered (alter database datafile 'name' offline;)
Initiate recovery: 
alter database recover managed standby database or recover standby database;

Oracle start to apply archive logs (check alert file)
PRODUCTION BECOMES STANDBY


 
Physical standby Switchover Steps

Unlike failover, a switchover operation is a planned operation.  All the archive logs required bringing the standby to the primary’s point in time need to be available.  The primary database’s online redo logs also must be available and intact.  During switchover operation, primary and standby databases switch roles.

NOTE = Standby must be mounted before starting the switchover!!!!

QUICK GUIDE

The Current PROD Site, that will become Standby
select database_role, switchover_status from v$database;    -- Here we would like to see "PRIMARY    TO STANDBY"
alter database commit to switchover to physical standby with session shutdown;
shutdown immediate
startup nomount
alter database mount standby database;
alter system set log_archive_dest_state_2=defer;
ALTER SYSTEM SET fal_client='THIS_STANDBY' SCOPE=BOTH     --This should be DR DB (Denver)
ALTER SYSTEM SET fal_server='PROD' SCOPE=BOTH;                     --This should be PROD (Falcon)

**** At this point we have 2 standby Databases ******


The Current STDBY Site, that will become primary
select database_role, switchover_status from v$database;    -- Here we would like to see "PHYSICAL STANDBY   TO PRIMARY"
alter database commit to switchover to primary;
shutdown immediate
startup
alter system set fal_client=NULL scope=both;
alter system set fal_server=NULL scope=both;
alter system set log_archive_dest_2='SERVICE=STANDBY reopen=60' scope=both;
alter system set log_archive_dest_state_2=enable scope=both;

OLD PRIMARY SITE
recover managed standby database disconnect


In this FULL Example, the standby database (STDBY) becomes the new primary, and the primary (PROD) becomes the new standby database.
The following are steps for switchover operation:

Step 1: Switchover preparation for formal primary database
·         End all activities on the primary and standby database
·         Check primary database switchover status
SQL> select database_role, switchover_status from v$database
DATABASE_ROLE              SWITCHOVER_STATUS
-------------------------  -----------------------------------
PRIMARY                    TO STANDBY

The SWITCHOVER_STATUS column of v$database can have the following values:
NOT ALLOWED - Either this is a standby database and the primary database has not been switched first, or this is a primary database and there are no standby databases.
SESSIONS ACTIVE - Indicates that there are active SQL sessions attached to the primary or standby database that need to be disconnected before the switchover operation is permitted.
SWITCHOVER PENDING - This is a standby database and the primary database switchover request has been received but not processed.
SWITCHOVER LATENT - The switchover was in pending mode, but did not complete and went back to the primary database.
TO PRIMARY - This is a standby database, with no active sessions, that is allowed to switch over to a primary database.
TO STANDBY - This is a primary database, with no active sessions, that is allowed to switch over to a standby database.
RECOVERY NEEDED - This is a standby database that has not received the switchover request.

During normal operations it is acceptable to see the following values for SWITCHOVER_STATUS on the primary to be SESSIONS ACTIVE or TO STANDBY.
During normal operations on the standby it is acceptable to see the values of NOT ALLOWED or SESSIONS ACTIVE.

If SWITCHOVER_STATUS returns SESSIONS ACTIVE then you should either disconnect all sessions manually or you can use the following statement to close those sessions:
alter database commit to switchover to standby with session shutdown;

·         Initiate the switchover operation on the primary database. Convert Primary DB to standby
SQL> alter system switch logfile;
SQL> alter database commit to switchover to
physical standby;
or if you have current sessions connected, you can perform:
SQL> alter database commit to switchover to physical standby with session shutdown;
 
Step 2: Shutdown the primary database and bring IT up as the new standby database
·         Shutdown the primary database normally
SQL> shutdown immediate;

·         Modify the former primary database’s initialization file
-   Add the following two parameters on the Primary init.ora File.  These two parameters can also be set on the primary database ahead of time for future switchover operation.
-  fal_client = “PROD_01”
- fal_server = “STDBY”
or
ALTER SYSTEM SET fal_client='PROD' SCOPE=BOTH;        #This DB used on tnsnames.ora
ALTER SYSTEM SET fal_server='STDBY' SCOPE=BOTH;     #The new PROD DB used on tnsnames.ora
-     Remove the following parameters from the Primary init.ora File: log_archive_dest_2 and log_archive_dest_state_2.  Or, just defer if you like
     ALTER SYSTEM SET log_archive_dest_2='' SCOPE=BOTH;

·         Bring the former primary database in mount mode
SQL> startup nomount;
SQL> alter database mount standby database;

 
Step 3: Switchover preparation for the former standby database
·         At this time, we have 2 standby Databases, now we will prepare the original standby and convert it to primary. Check standby database switchover status, if we see the "SESSIONS ACTIVE", we need to act as we did it before.
SQL> select database_role, switchover_status from v$database;
DATABASE_ROLE              SWITCHOVER_STATUS
-------------------------  -----------------------------------
PHYSICAL STANDBY                  TO PRIMARY

·         Initiate the switchover operation on the standby database
SQL> alter database commit to switchover to primary;
or
SQL> alter database commit to switchover to primary with session shutdown;
 
Step 4: Shutdown the standby database and bring IT up as the new primary database
·         Shutdown the standby database
SQL> shutdown immediate;

·         Bring up the former standby database as the new primary database
        SQL> startup;

·         Modify the former standby database’s initialization file
-  fal_client = “STDBY”
-   fal_server = “PROD”
-  Add parameters log_archive_dest_2 and log_archive_dest_state_2
alter system set log_archive_dest_2='SERVICE=PROD reopen=60' scope=both;
alter system set log_archive_dest_state_2=enable scope=both;
ALTER SYSTEM SET fal_client='STBY' SCOPE=BOTH;  #This box when is in Standby
ALTER SYSTEM SET fal_server='PROD' SCOPE=BOTH;   #The "original" PROD box
 
Step 5: Add Temp Tablespace
·         Issue the following command to add TEMP tablespace
ALTER TABLESPACE TEMP ADD TEMPFILE '/u02/oradata/prod/temp01.dbf' SIZE 3072M reuse AUTOEXTEND OFF;
 
Step 6: Put the new standby (OLD PROD database) in managed recovery mode
·         Issue the following command on the new standby database.
SQL> alter database recover managed standby database disconnect from session;

Step 7: SWITCH THE LOG FILES A COUPLE OF TIMES on the new primary db
·         Issue the following commands:
     SQL> alter system switch logfile;
     SQL> alter system switch logfile;

Step 8: Change TNS Entry for the New Primary Database
·         Change the TNS entry on all application hosts to point to the new primary
Prod =
   (description =
      (address = (protocol = tcp)  (host = server_02) (port = 1522)
      (connect_data = (sid = stdby))
    )


Physical standby Failover Steps

Failover is only performed as a result of an unplanned outage of the primary database.  During a failover, the standby database (prod_02) becomes the new primary database.  It is possible to have data loss. 
The old primary (prod_01) has to be discarded and can not be used as the new standby database.  You need to create a new standby database by backing up the new primary and restore it on host server_01.  The time to create a new standby database exposes the risk of having no standby database for protection.
After failover operation, you need to modify TNS entry for ‘prod’ to point to the new instance and host name.

Steps
1- Initiate the failover by issuing the following on the target standby database:
ALTER DATABASE RECOVER MANAGED STANDBY DATABASE FINISH FORCE;
Note: Include the FORCE keyword to ensure that the RFS processes on the standby database will fail over without waiting for the network connections to time out through normal TCP timeout processing before shutting down.

2- Convert the physical standby database to the production role:
ALTER DATABASE COMMIT TO SWITCHOVER TO PRIMARY;

3- If the standby database was never opened read-only since the last time it was started, then open the new production database by issuing the following statement:
ALTER DATABASE OPEN;
If the physical standby database has been opened in read-only mode since the last time it was started, shut down the target standby database and restart it:
SHUTDOWN IMMEDIATE;
STARTUP;

Note: In rare circumstances, administrators may wish to avoid waiting for the standby database to complete applying redo in the current standby redo log file before performing the failover. (note: use of Data Guard real-time apply will avoid this delay by keeping apply up to date on the standby database).
If so desired, administrators may issue the ALTER DATABASE ACTIVATE STANDBY DATABASE statement to perform an immediate failover. This statement converts the standby database to the production database, creates a new resetlogs branch, and opens the database.
 
Create or Add a tempfile to the temporary tablespace: Ex:
create temporary tablespace TEMP2 TEMPFILE '/oracle/oradata/V901/temp2.dbf' size 100M;
or
alter tablespace TEMP add tempfile ‘/oracle/app/product/9.2/admin/oradata/temp01.dbf’ size 100M ;




Implementation Tips

Here are several tips for implementing Data Guard:
 
Tip #1: Primary Online Redo Logs
The number of redo groups and the size of redo logs are two key factors in configuring online redo logs. In general, you try to create the fewest groups possible without hampering the log writer process’s ability to write redo log information. In a Data Guard environment, LGWR process may take longer to write to the remote standby sites, you may need to add additional groups to guarantee that a recycled group is always available to the log writer process. Otherwise, you may receive incomplete logs on the standby sites. The size of redo log is determined by the amount of transaction needed to be applied to a standby database during database failover operation. A small size of redo will minimize the standby database lag time; however, it may cause more frequent log switches and require more redo groups for log switches to occur smoothly. On the other hand, large size redo logs may require few groups and less log switches, but it may increase standby database lag time and potential for more data loss. The best way to determine if the current configuration is satisfactory is to examine the contents of the log writer process’s trace file and the database’s alert log. 
For example, the following message from the alert log may indicate a need for more log groups.
ORA-00394: online log reused while attempting to archive it
 
Tip #2: Standby Online Redo Logs vs. Standby Archived Redo logs
Online redo logs transferred from the primary database are stored as either standby redo logs or archived redo logs.  Which redo log reception option should we choose?  Here is the comparison chart:
  Standby Online Redo Logs Standby Archived Redo Logs
Advantages -  Pre-allocated files
Can place on raw devices
-  Can be duplexed for more protection
-  Improve redo data availability
-  No Data Loss capable
-  No extra ARCH process
-  Reduce lag time
 
Tip #3: Enforce Logging
It is recommended that you set the FORCE LOGGING clause to force redo log to be generated for individual database objects set to NOLOGGING.  This is required for a no data loss strategy. 
Here is the SQL command to set FORCE LOGGING:

SQL> select force_logging from v$database;
FORCE_LOGGING
--------------
NO
 
SQL> alter database force logging;
 
Tip #4: RMAN Backup
A failover operation reset logs for the new primary.  If you use RMAN to backup your database, you need to create a new incarnation of the target database.  Otherwise, your RMAN backup will fail.

RMAN> reset database;
 
Tip #5: Disable Log Transport Services When Standby Database is down
When a standby database or host is down for maintenance, it is advisable to temporarily disable the log transport services for that site.  Especially during a heavily transaction period, one behavior observed in Oracle is that when one of the standby database is down for maintenance, it can temporarily freeze the primary database even the data protection mode is set to rapid mode.  To avoid such problem, you can issue this command on the primary database before bring down the standby database:
SQL> alter system set log_archive_dest_state_2 = defer;

When the standby database is up again, issue:
SQL> alter system set log_archive_dest_state_2 = enable;
 
Tip #6: Standby Database Upgrade
Steps to upgrade standby database to newer database version:
Step 1: Shutdown both primary and standby databases
Step 2: Install Oracle software on both primary and standby hosts
Step 3: Upgrade the primary database
Step 4: Rebuild standby database from the upgraded primary
 
Tip #7: Data Guard Broker
Starting on Oracle9i R2 broker has made great improvements.  The new configuration now support up to nine standby sites (including logical standby database).   Both Data Guard Manager and CLI support switchover and failover operations.
 
Tip #8: Using ‘Delay’ Option to Protect Logical/Physical Corruptions
You may utilize the delay option (if you have multiple standby sites) to prevent physical/logical corruption of your primary.  For instance, your standby #1 may not have ‘Delay’ on to be your disaster recovery standby database.  However, you may opt to implement a delay of minutes or hours on your standby #2 to allow recover from a possible physical or logical corruption on your primary database.
SQL> alter database recover managed standby database delay 5 disconnect;
 

Tip #9: Always monitor log apply services and Check Alert. log file for errors.
If you are not using Data Guard broker, here is a script to help you to monitor your standby database recover process:
 
$ cat ckalertlog.sh
####################################################################
## ckalertlog.sh                                                  ##
####################################################################
#!/bin/ksh
export EDITOR=vi
export ORACLE_BASE=/u01/app/oracle
export ORACLE_HOME=$ORACLE_BASE/product/10.2.0
export ORACLE_HOME LD_LIBRARY_PATH=$ORACLE_HOME/lib
export TNS_ADMIN=/var/opt/oracle
export ORATAB=/var/opt/oracle/oratab
PATH=$PATH:$ORACLE_HOME:$ORACLE_HOME/bin:/usr/ccs/bin:/bin:/usr/bin:/usr/sbin:/
sbin:/usr/openwin/bin:/opt/bin:.; export PATH DBALIST="primary.dba@company.com,another.dba@company.com";export
 
for SID in `cat $ORACLE_HOME/sidlist`
do
    cd $ORACLE_BASE/admin/$SID/bdump
    if [ -f alert_${SID}.log ]
    then
        mv alert_${SID}.log alert_work.log
        touch alert_${SID}.log
        cat alert_work.log >> alert_${SID}.hist
        grep ORA- alert_work.log > alert.err
    fi
    if [ `cat alert.err|wc -l` -gt 0 ]
    then
        mailx -s "${SID} ORACLE ALERT ERRORS" $DBALIST < alert.err
    fi
    rm -f alert.err
    rm -f alert_work.log
done
Place the script in a crontab:
#########################################################
# Check Standby log file
#########################################################
9,19,29,39,49,59 7-17 * * 1-5 /dba/scripts/ckalertlog.sh



Applying Patches with Standby
Beginning with Oracle Database 11.2, Oracle has introduced Standby-First Patch Apply to enable a physical standby to use Redo Apply to support different software patch levels between a primary database and its physical standby database for the purpose of applying and validating Oracle patches in rolling fashion. Patches eligible for Standby-First patching include:
• Patch Set Update (PSU)
• Critical Patch Update (CPU)
• Patch Set Exception (PSE)
• Oracle Database bundled patch
• Oracle Exadata Database Machine bundled patch
• Oracle Exadata Storage Server Software patch
Standby-First Patch Apply is supported for certified software patches for Oracle Database Enterprise Edition Release 2 (11.2) release 11.2.0.1 and later.
Refer to My Oracle Support Note 1265700.1 for more information and the README for each patch to determine if a target patch is certified as being a Standby-First Patch.

For other type of patches or older versions of Oracle
    - How do you apply a Patchset,PSU or CPU in a Data Guard Physical Standby configuration (Doc ID 278641.1)


Procedure to Apply a Patch Set with Data Guard Standby-First Patch Apply (Doc ID 1265700.1)
The following types of patches are candidates to be Data Guard Standby-First certified:
Data Guard Standby-First Patch Apply has the following advantages:

Steps to Perform Data Guard Standby-First Patch Apply

To accomplish Data Guard Standby-First Patch Apply, do the following:

OPatch supports 3 different patch methods on a RAC environment:

Refer to Rolling Patch - OPatch Support for RAC (Doc ID 244241.1)

Another option for patching Oracle Database Homes is Rapid Home Provisioning (RHP). RHP is available with Grid Infrastructure 12.x. A central RHP Server provisions patched homes to target nodes, and orchestrates the switch to the patched home. When patching Oracle Database version 12.x, RHP automatically applies DataPatch unless the Database is a Data guard Standby. Therefore when using RHP in a Data Guard Standby-First Patch Apply process, you must follow Phase 3 Option 2. See How To Setup a Rapid Home Provisioning (RHP) Server and Client (Doc ID 2097026.1) for more details.


Phase 1: Perform Patch Binary Installation on Standby Only

1. Shutdown all standby instances on the standby database using the following commands (if Patch is not RAC Rolling):  shutdown immediate
2. Perform binary installation of the patch on the standby according to the patch README. Do not perform SQL installation (e.g. do not run catbundle.sql or datapatch) for the patch at this time. SQL installation is performed after the primary database and all standby databases have their database home binaries patched to the same level in Phase 3.
3. Restart the standby instances after the patch has been applied to all standby database ORACLE_HOME, as follows:
    If Active Data Guard is used, then start all standby instances using the following command:
        SQL> startup
    If Active Data Guard is not used, then mount all standby instances using the following command:
        SQL> startup mount

4. Restart the media recovery using the following command:
Data Guard configuration managed by SQL*Plus:
SQL> alter database recover managed standby database using current logfile disconnect;

Data Guard configuration managed by Data Guard broker:
The Data Guard broker will automatically restart the media recovery.


Phase 2: Evaluate Patch on Standby Database

The Oracle recommended best practice and most comprehensive evaluation method is to use Snapshot Standby and Oracle Real Application Testing in the following manner:
    1. Convert the standby database into a snapshot standby (see the Oracle Data Guard Concepts and Administration Guide)
    2. Perform any required SQL installation steps for the patch on the snapshot standby.
    3. Use Oracle Real Application Testing to evaluate stability and performance of the new software using real application workload.
    4. After testing is complete, convert the snapshot standby back to a physical standby. Conversion back to a physical standby will roll back changes made by Oracle Real Application Testing workload replay, and made by SQL installation steps for the patch.

Less comprehensive evaluation can be performed by the following:

    If using the Active Data Guard option, open the standby database in read only mode and stress the standby database by running your read-only workload.
    Leave the standby database in managed recovery mode at the mount state, and monitor for any issues in the standby alert log and trace files.


Phase 3: Complete Patch Installation or Rollback

At this point the patch has been applied only to the standby system binaries; therefore, it is only partially installed. The environment may remain in mixed version state for a maximum of 31 days. To complete patch installation, binary installation must be completed on the primary system, and SQL installation (if necessary for the patch) must be performed.

When installing Oracle Database Proactive Bundle Patch 12.1.0.2.170418 in a rolling Data Guard standby-first manner to a database software home, patch 26112084 must be installed on top of 12.1.0.2.170418. See document 2267842.1 for details.

 There are 3 options for Phase 3.

  • Option 1 - Apply patch to primary
  • Option 2 - Execute Switchover and apply patch to standby
  • Option 3 - Rollback patch from standby
 
Phase 3 Option 1: Apply Patch to Primary Database

The main step in Option 1 is to apply the patch to the primary, which will include performing binary installation in the primary database home, and performing SQL installation against the primary database. Changes made to the primary database during SQL installation will propagate to the standby via redo. Option 1 requires either rolling outage or complete outage of the primary database, depending on the installation method chosen and supported by the patch.

Perform the following steps to complete patch installation on the primary:

  1. If the patch is not RAC Rolling Installable, then restart standby database recovery with the standby in mounted mode. Patches that are listed as RAC Rolling Installable in the patch README can be applied on the primary with the standby performing recovery in read only mode.  However, patches that are not RAC Rolling Installable must stop read only recovery on the standby, bring the standby database to the mount state, and restart recovery prior to applying the patch to the primary database. For example, run the following command on the standby instance that performs media recovery:

    Data Guard configuration managed by SQL*Plus:
    SQL> shutdown immediate
    SQL> startup mount
    SQL> alter database recover managed standby database using current logfile disconnect;

    Data Guard configuration managed by Data Guard broker:
    SQL> shutdown immediate
    SQL> startup mount
  2. Perform binary installation of the patch to the database home on the primary according to the patch README.
  3. If required, perform SQL installation of the patch according to the patch README. This step may be performed only after the primary and all standby databases have been patched to use the same software.
  4. If using Active Data Guard, then restart into Active Data Guard mode:

    Data Guard configuration managed by SQL*Plus:
    SQL> alter database recover managed standby database cancel;
    SQL> alter database open;
    SQL> alter database recover managed standby database using current logfile disconnect;

    Data Guard configuration managed by Data Guard broker:
    SQL> alter database open;

Phase 3 Option 2: Data Guard Switchover and Apply Patch to New Physical Standby

The main steps in Option 2 are to perform Data Guard switchover, perform binary installation in the new standby database home, and perform SQL installation against the new primary database. Option 2 has brief impact to the primary database during Data Guard switchover, but there is no impact to the primary while completing patch installation.

*Note:  For switchover Option 2,  the patching needs to commence immediately after Switchover to the standby(original primary) and then subsequent Sql run on the primary.

 

The main steps in Option 2 are the following:

  • Perform a Data Guard switchover so that the new primary (i.e. old standby) is now running on the patched binaries.
  • Perform binary installation on the new standby (i.e. old primary).
  • Perform SQL installation on the new primary (i.e. old standby). Changes made to the new primary database during SQL installation will propagate to the new standby via redo.
  • Optionally perform a switchover to get back to the original configuration.

Run the following steps to perform Data Guard switchover and complete patch installation:

  1. Execute Data Guard Switchover as described in the Data Guard Concepts and Administration Guide.

    Data Guard configuration managed by SQL*Plus:

    Primary Database:
    SQL> alter database commit to switchover to physical standby with session shutdown;

    Standby Database:
    SQL> alter database commit to switchover to primary with session shutdown;
    SQL> alter database open;

    New Standby Database (Old Primary Database):
    SQL> shutdown immediate
    SQL> startup mount
    SQL> alter database recover managed standby database using current logfile disconnect;

    Data Guard configuration managed by Data Guard broker:
    DGMGRL> switchover to '<standby database name>'
  2. If the patch is not RAC Rolling Installable, then restart standby database recovery with the standby in mounted mode. Patches that are listed as RAC Rolling Installable in the patch README can be applied on the primary with the standby performing recovery in read only mode.  However, patches that are not RAC Rolling Installable must stop read only recovery on the standby, bring the standby database to the mount state, and restart recovery prior to applying the patch to the primary database. For example, run the following command on the standby instance that performs media recovery:

    Data Guard configuration managed by SQL*Plus:
    SQL> shutdown immediate
    SQL> startup mount
    SQL> alter database recover managed standby database using current logfile disconnect;

    Data Guard configuration managed by Data Guard broker:
    SQL> shutdown immediate
    SQL> startup mount
  3. Perform binary installation of the patch to the database home on the standby system according to the patch README.
  4. If required, perform SQL installation of the patch on the primary database according to the patch README.  This step may be performed only after the primary and all standby databases have been patched to use the same software.
  5. If using Active Data Guard, then restart into Active Data Guard mode:

    Data Guard configuration managed by SQL*Plus:
    SQL> alter database recover managed standby database cancel;
    SQL> alter database open;
    SQL> alter database recover managed standby database using current logfile disconnect;

    Data Guard configuration managed by Data Guard broker:
    SQL> alter database open;
  6. Optionally you can perform another Switchover to get your initial Environment back (Primary Database A and Standby Database B)

Phase 3 Option 3: Rolling Back a Patch on a Standby Database

The following procedure describes how to roll back a patch on the standby database:

  1. Cancel media recovery and shut down the standby instances on the standby database using the following command:

    Data Guard configuration managed by SQL*Plus or the Data Guard broker:
    SQL> shutdown immediate
  2. Deinstall the patch as outlined in the “Deinstallation” section of the patch README from all standby database homes.
  3. Restart the standby instances, as follows: 

    If Active Data Guard is used, then start all standby instances using the following command:

    Data Guard configuration managed by SQL*Plus or the Data Guard broker:
    SQL> startup

    If Active Data Guard is not used, then mount all standby instances using the following command:

    Data Guard configuration managed by SQL*Plus or the Data Guard broker:
    SQL> startup mount
  4. Restart the media recovery using the following command:

    Data Guard configuration managed by SQL*Plus:
    SQL> alter database recover managed standby database using current logfile disconnect;

    Data Guard configuration managed by Data Guard broker:
    The Data Guard broker will automatically restart the media recovery.



How do you apply a Patchset,PSU or CPU in a Data Guard Physical Standby configuration (Doc ID 278641.1)

OVERALL STEPS

1. Disable REDO transport on Primary.

2. Shutdown the standby site and apply interim patchsets to the RDBMS binaries as per the README. This includes Patchset/Patchset Update(PSU)/Critical Patch Update (CPU). Post upgrade changes must come via REDO transport(catpatch.sql etc) against the standby rdbms itself.  Start the standby site to mount only, do not restart managed recovery.

3. Shutdown the primary site, apply the Patchset/PSU/CPU patch to the RDBMS binaries and patch the RDBMS itself using the instructions in the README (run catpatch/catbundle/catcpu etc).

NOTE: The latest Patchsets for Oracle 11gR2 (11.2.0) require to be installed into a new ORACLE_HOME. So mind to reset your Environment and copy corresponding Files (like SPFILE, Network Files,..) to the new ORACLE_HOME, too. Follow the Database Upgrade Guide for further Details.

4. Start the primary site, re-enable log shipping to the standby.

5. At the standby site start the MRP(managed recovery). RDBMS changes implemented in the Primary Site through catpatch/catbundle/catcpu will also be applied to the standby.

NOTE:
Step 5. should be done immediately after upgrading the Database Binaries on the Standby Database. It is to ensure the Data Dictionary (CATPROC)-Version matches the Version of the Database Binaries. If this does not match (eg. when you upgrade the Standby Database Binaries first and perform a Role Change on the Standby before you upgrade the Primary) you may run into severe Problems. Having different Patchlevels in a Data Guard Physical Standby Database Environment is not supported anyway, see
Mixed Oracle Version support with Data Guard Redo Transport Services (Doc ID 785347.1)
for further Details and Reference.

6. Checks to perform to ensure the patch has been applied successfully at the primary and standby sites.


1. Disable REDO Transport on Primary

1.1  Disable log shipping using DGMGRL.

If DG broker in place it is mandatory to disable log shipping via DG broker.

DGMGRL> connect /
DGMGRL> show database verbose plb_prm
Database
Name: plb_prm
Role: PRIMARY
Enabled: YES
Intended State: ONLINE
Instance(s):  plb
Properties:
InitialConnectIdentifier = 'plb_prm_dgmgrl'
..
.
Current status for "plb_prm":
SUCCESS
 
DGMGRL> edit database plb_prm set state='LOG-TRANSPORT-OFF';
Succeeded.
 


1.2 If DG broker not in place

Disable the log_archive_destination used to ship archives from the primary to the standby site using sqlplus.

SQL> alter system set log_archive_dest_state_X=defer scope=both sid='*'


From 11.2 onwards by default SID=* to disable REDO transport in all the nodes. But in 10G explicitly mention SID=*

2. Apply the patch at the Standby Site 

2.1 If the standby is a RAC environment, then the patch application would occur across all nodes.

2.1.1 Non RAC Environment,

Shutdown ALL processes running from Standby ORACLE_HOME. This will include all listeners, database instances, ASM instances etc running from the home to patched.

$ lsnrctl stop lsnrplb

LSNRCTL for Linux: Version 10.2.0.4.0 - Production on 04-FEB-2010 08:41:29
Copyright (c) 1991, 2007, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=<host>)(PORT=1666)))
The command completed successfully

$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.4.0 - Production on Thu Feb 4 08:42:02 2010
Copyright (c) 1982, 2007, Oracle. All Rights Reserved.
Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options

SQL> shutdown immediate

ORA-01109: database not open
Database dismounted.
ORACLE instance shut down.


2.1.2 RAC Environment, all the RAC services that are running from the ORACLE_HOME to be patched.

In this case the RAC database is called plb and the listener name for the listener running from the ORACLE_HOME to patched is lsnrplb.

$ srvctl stop database -d plb_std
$ srvctl stop listener -n <host> -l lsnrplb_<host> 
$ srvctl stop listener -n <host> -l lsnrplb_<host>


2.2 The release of OPatch that is supplied with 10.2.0.4 will not be able to apply a PSU as seen through the errors below.

$ pwd
/home/oracle/patches/9119284
$ which opatch
/<path>/db_plb/OPatch/opatch

$ opatch apply

Invoking OPatch 10.2.0.4.2
Oracle Interim Patch Installer version 10.2.0.4.2
Copyright (c) 2007, Oracle Corporation. All rights reserved.
Oracle Home : /u01/oracle/product/10.2.0/db_plb
Central Inventory : /u01/oraInventory
from : /etc/oraInst.loc
OPatch version : 10.2.0.4.2
OUI version : 10.2.0.4.0
OUI location : /u01/oracle/product/10.2.0/db_plb/oui
Log file location : /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch2010-02-04_08-45-30AM.log

ApplySession failed: Patch ID is null.
System intact, OPatch will not attempt to restore the system

OPatch failed with error code 73


2.2.1 To overcome the patch issue apply the latest copy of OPatch to the home via Patch 6880880.

If you already have a later release of OPatch then it is not necessary to perform this step.

Download Patch 6880880 and following the install instructions in the README

$ mv /u01/oracle/product/10.2.0/db_plb/OPatch /u01/oracle/product/10.2.0/db_plb/OPatch_10204
$ pwd
/home/oracle/patches
$ ls
9119284 OPatch

$ cp -rp OPatch/ /u01/oracle/product/10.2.0/db_plb/


2.3 Once the new release of OPatch is in place apply the patch to the Standby Site

Please Note: In the Standby Site only the patching of the binaries is performed, there is no need to run the catupgrade/catcpu/catbundle.sql script as this will be performed through redo apply at the Standby Site.

The example below is applying the patch to a Single Instance Standby Site and its applying PSU Patch 9119284.

$ opatch apply

Invoking OPatch 11.2.0.1.1
Oracle Interim Patch Installer version 11.2.0.1.1
Copyright (c) 2009, Oracle Corporation. All rights reserved.
Oracle Home : /u01/oracle/product/10.2.0/db_plb
Central Inventory : /u01/oraInventory
from : /etc/oraInst.loc
OPatch version : 11.2.0.1.1
OUI version : 10.2.0.4.0
OUI location : /u01/oracle/product/10.2.0/db_plb/oui
Log file location : /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch2010-02-04_08-57-57AM.log
Patch history file: /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch_history.txt

ApplySession applying interim patch '9119284' to OH '/u01/oracle/product/10.2.0/db_plb'
Running prerequisite checks...
Provide your email address to be informed of security issues, install and
initiate Oracle Configuration Manager. Easier for you if you use your My
Oracle Support Email address/User Name.
Visit http://www.oracle.com/support/policies.html for details.
Email address/User Name:
You have not provided an email address for notification of security issues.
Do you wish to remain uninformed of security issues ([Y]es, [N]o) [N]: Y
OPatch detected non-cluster Oracle Home from the inventory and will patch the local system only.
Please shutdown Oracle instances running out of this ORACLE_HOME on the local system.
(Oracle Home = '/u01/oracle/product/10.2.0/db_plb')
Is the local system ready for patching? [y|n]
y
User Responded with: Y
Backing up files and inventory (not for auto-rollback) for the Oracle Home
Backing up files affected by the patch '9119284' for restore. This might take a while...
Backing up files affected by the patch '9119284' for rollback. This might take a while...
Execution of 'sh /home/oracle/patches/9119284/custom/scripts/pre -apply 9119284 ':
Return Code = 0
Patching component oracle.rdbms.rsf, 10.2.0.4.0...
Updating archive file "/u01/oracle/product/10.2.0/db_plb/lib/libgeneric10.a" with "lib/libgeneric10.a/qcodfdef.o"
..
.
Updating jar file "/u01/oracle/product/10.2.0/db_plb/rdbms/jlib/qsma.jar" with "/rdbms/jlib/qsma.jar/oracle/qsma/QsmaFileManager.class"
Copying file to "/u01/oracle/product/10.2.0/db_plb/rdbms/lib/env_rdbms.mk"
..
.
Running make for target idgmgrl
Running make for target ioracle
Running make for target client_sharedlib
Running make for target itnslsnr
Running make for target iwrap
Running make for target genplusso
ApplySession adding interim patch '9119284' to inventory

Verifying the update...
Inventory check OK: Patch ID 9119284 is registered in Oracle Home inventory with proper meta-data.
Files check OK: Files from Patch ID 9119284 are present in Oracle Home.
--------------------------------------------------------------------------------

********************************************************************************
********************************************************************************
** ATTENTION **
** **
** Please note that the Patch Set Update Installation (PSU Deinstallation) **
** is not complete until all the Post Installation (Post Deinstallation) **
** instructions noted in the Readme accompanying this PSU, have been **
** successfully completed. **
** **
********************************************************************************
********************************************************************************
--------------------------------------------------------------------------------

Execution of 'sh /home/oracle/patches/9119284/custom/scripts/post -apply 9119284 ':

Return Code = 0
The local system has been patched and can be restarted.

OPatch succeeded.


2.4 Start the Standby Site database to mount and restart the listener(s).

2.4.1 If the environment is a single instance (non-RAC standby)

Start the listener:

$ lsnrctl start lsnrplb
LSNRCTL for Linux: Version 10.2.0.4.0 - Production on 04-FEB-2010 09:06:02
Copyright (c) 1991, 2007, Oracle. All rights reserved.
Starting /u01/oracle/product/10.2.0/db_plb/bin/tnslsnr: please wait...
TNSLSNR for Linux: Version 10.2.0.4.0 - Production
System parameter file is /u01/oracle/product/10.2.0/db_plb/network/admin/listener.ora
Log messages written to /u01/oracle/product/10.2.0/db_plb/network/log/lsnrplb.log
Listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=<host>)(PORT=1666)))
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=<host>)(PORT=1666)))
STATUS of the LISTENER
------------------------
Alias lsnrplb
Version TNSLSNR for Linux: Version 10.2.0.4.0 - Production
Start Date 04-FEB-2010 09:06:02
Uptime 0 days 0 hr. 0 min. 0 sec
Trace Level off
Security ON: Local OS Authentication
SNMP OFF
Listener Parameter File /u01/oracle/product/10.2.0/db_plb/network/admin/listener.ora
Listener Log File /u01/oracle/product/10.2.0/db_plb/network/log/lsnrplb.log
Listening Endpoints Summary...
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=<host>)(PORT=1666)))
Services Summary...
Service "plb_std_DGMGRL" has 1 instance(s).
Instance "plb_std", status UNKNOWN, has 1 handler(s) for this service...
The command completed successfully

Start the database:

$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.4.0 - Production on Thu Feb 4 15:12:24 2010
Copyright (c) 1982, 2007, Oracle. All Rights Reserved.
Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options

SQL> startup mount
ORACLE instance started.
Total System Global Area 268435456 bytes
Fixed Size 1266968 bytes
Variable Size 138414824 bytes
Database Buffers 125829120 bytes
Redo Buffers 2924544 bytes
Database mounted.

If DG Broker in place, then change the state=APPLY-OFF via DG broker to avoid DG broker starting the MRP automatically.

2.4.2 If this is a RAC standby

$ srvctl start listener -n <host> -l lsnrplb_<host> 
$ srvctl start listener -n <host> -l lsnrplb_<host> 
$ srvctl start database -d plb_std -o mount

3. Apply the Patch to the Primary.

Please Note: the primary site may also require that Opatch be upgraded in the same way as it was in the Standby Site via Patch 6880880.

Change to the patch directory at the Primary Site and apply the PSU here

$ cd 9119284/
$ pwd
/home/oracle/patches/9119284


3.1 Stop all processes running from the home being patched.

This will include listeners and databases etc.


3.1.1 If this is a single instance Primary (non-RAC)

$ lsnrctl stop lsnrplb

LSNRCTL for Linux: Version 10.2.0.4.0 - Production on 04-FEB-2010 09:06:28
Copyright (c) 1991, 2007, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=<host>)(PORT=1666)))
The command completed successfully

$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.4.0 - Production on Thu Feb 4 09:06:36 2010
Copyright (c) 1982, 2007, Oracle. All Rights Reserved.
Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options

SQL> shutdown immediate
Database closed.
Database dismounted.
ORACLE instance shut down.

SQL> exit
Disconnected from Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options


3.1.2 For RAC Priamry

$ srvctl stop listener -n <host> -l lsnrplb_<host>
$ srvctl stop listener -n <host> -l lsnrplb_<host>
$ srvctl stop database -d plb

3.2 Apply the patch to the Primary Site binaries.

As per the standby site this example is applying Patchset Update 9119284 to a single instance Primary Site.

$ opatch apply

Invoking OPatch 11.2.0.1.1
Oracle Interim Patch Installer version 11.2.0.1.1
Copyright (c) 2009, Oracle Corporation. All rights reserved.
Oracle Home : /u01/oracle/product/10.2.0/db_plb
Central Inventory : /u01/oraInventory
from : /etc/oraInst.loc
OPatch version : 11.2.0.1.1
OUI version : 10.2.0.4.0
OUI location : /u01/oracle/product/10.2.0/db_plb/oui
Log file location : /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch2010-02-04_09-10-28AM.log
Patch history file: /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch_history.txt

ApplySession applying interim patch '9119284' to OH '/u01/oracle/product/10.2.0/db_plb'
Running prerequisite checks...
Provide your email address to be informed of security issues, install and
initiate Oracle Configuration Manager. Easier for you if you use your My
Oracle Support Email address/User Name.
Visit http://www.oracle.com/support/policies.html for details.
Email address/User Name:
You have not provided an email address for notification of security issues.

Do you wish to remain uninformed of security issues ([Y]es, [N]o) [N]: Y

OPatch detected non-cluster Oracle Home from the inventory and will patch the local system only.
Please shutdown Oracle instances running out of this ORACLE_HOME on the local system.
(Oracle Home = '/u01/oracle/product/10.2.0/db_plb')
Is the local system ready for patching? [y|n]
y
User Responded with: Y
Backing up files and inventory (not for auto-rollback) for the Oracle Home
Backing up files affected by the patch '9119284' for restore. This might take a while...
Backing up files affected by the patch '9119284' for rollback. This might take a while...
Execution of 'sh /home/oracle/patches/9119284/custom/scripts/pre -apply 9119284 ':
Return Code = 0
Patching component oracle.rdbms.rsf, 10.2.0.4.0...
Updating archive file "/u01/oracle/product/10.2.0/db_plb/lib/libgeneric10.a" with "lib/libgeneric10.a/qcodfdef.o"
..
.
Updating jar file "/u01/oracle/product/10.2.0/db_plb/rdbms/jlib/qsma.jar" with "/rdbms/jlib/qsma.jar/oracle/qsma/QsmaFileManager.class"
..
.
Running make for target client_sharedlib
Running make for target idgmgrl
Running make for target ioracle
Running make for target client_sharedlib
Running make for target itnslsnr
Running make for target iwrap
Running make for target genplusso
ApplySession adding interim patch '9119284' to inventory
Verifying the update...
Inventory check OK: Patch ID 9119284 is registered in Oracle Home inventory with proper meta-data.
Files check OK: Files from Patch ID 9119284 are present in Oracle Home.
--------------------------------------------------------------------------------
********************************************************************************
********************************************************************************
** ATTENTION **
** **
** Please note that the Patch Set Update Installation (PSU Deinstallation) **
** is not complete until all the Post Installation (Post Deinstallation) **
** instructions noted in the Readme accompanying this PSU, have been **
** successfully completed. **
** **
********************************************************************************
********************************************************************************
--------------------------------------------------------------------------------

Execution of 'sh /home/oracle/patches/9119284/custom/scripts/post -apply 9119284 ':
Return Code = 0
The local system has been patched and can be restarted.

OPatch succeeded.

3.3 Upgrade/Patch the RDBMS and dictionary objects.

In a RAC environment this is performed from once instance only with the remaining cluster instances down and not running. The example below once again shows patch application to a single instance (non-RAC) Primary Site and is applying PSU 9119284.

Change to the $ORACLE_HOME/rdbms/admin directory where the patch has been applied and perform the following.

$ cd $ORACLE_HOME/rdbms/admin
$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.4.0 - Production on Thu Feb 4 09:25:09 2010
Copyright (c) 1982, 2007, Oracle. All Rights Reserved.
Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options

SQL> startup restrict   ===========> Follow the README to know the mode that the database has to be opened. 

ORACLE instance started.
Total System Global Area 268435456 bytes
Fixed Size 1266968 bytes
Variable Size 138414824 bytes
Database Buffers 125829120 bytes
Redo Buffers 2924544 bytes
Database mounted.
Database opened.

SQL> @catbundle psu apply
PL/SQL procedure successfully completed.
..
.
SQL> COMMIT;
Commit complete.
SQL> SPOOL off
SQL> SET echo off
Check the following log file for errors:
/u01/oracle/product/10.2.0/db_plb/cfgtoollogs/catbundle/catbundle_PSU_PLB_APPLY_2010Feb04_09_25_28.log


 

Note: If this was a normal Patchset you would run:

SQL>@catupgrade

If this were a Critical Patch Update you would run:

SQL>@catcpu


4. Re-establish the Data Guard environment at the Primary Site.

4.1.1 In a Single Instance Primary restart processes running from the patched ORACLE_HOME.

This will include the listener, database instances, ASM instance and any other processes that were previously running from this ORACLE_HOME.

$ lsnrctl start lsnrplb

LSNRCTL for Linux: Version 10.2.0.4.0 - Production on 04-FEB-2010 09:45:43
Copyright (c) 1991, 2007, Oracle. All rights reserved.
Starting /u01/oracle/product/10.2.0/db_plb/bin/tnslsnr: please wait...
TNSLSNR for Linux: Version 10.2.0.4.0 - Production
System parameter file is /u01/oracle/product/10.2.0/db_plb/network/admin/listener.ora
Log messages written to /u01/oracle/product/10.2.0/db_plb/network/log/lsnrplb.log
Listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=<host>)(PORT=1666)))
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=<host>)(PORT=1666)))
STATUS of the LISTENER
------------------------
Alias lsnrplb
Version TNSLSNR for Linux: Version 10.2.0.4.0 - Production
Start Date 04-FEB-2010 09:45:43
Uptime 0 days 0 hr. 0 min. 0 sec
Trace Level off
Security ON: Local OS Authentication
SNMP OFF
Listener Parameter File /u01/oracle/product/10.2.0/db_plb/network/admin/listener.ora
Listener Log File /u01/oracle/product/10.2.0/db_plb/network/log/lsnrplb.log
Listening Endpoints Summary...
(DESCRIPTION=(ADDRESS=(PROTOCOL=tcp)(HOST=<host>)(PORT=1666)))
Services Summary...
Service "plb_prm_DGMGRL" has 1 instance(s).
Instance "plb", status UNKNOWN, has 1 handler(s) for this service...
The command completed successfull

4.1.2 If this were a RAC environment restart the listeners on each node running from the ORACLE_HOME being patched using srvctl.

$ srvctl start listener -n <host> -l lsnrplb_<host> 
$ srvctl start listener -n <host> -l lsnrplb_<host>

4.1.3. Force the Primary to register its services with the listener.

$ sqlplus / as sysdba

SQL*Plus: Release 10.2.0.4.0 - Production on Thu Feb 4 09:45:50 2010
Copyright (c) 1982, 2007, Oracle. All Rights Reserved.
Connected to:
Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options

SQL> alter system register;

System altered.

4.2.1 In a single instance (non-RAC) disable restricted session to allow end user connectivity.

SQL> alter system disable restricted session;
System altered.
SQL> exit

Disconnected from Oracle Database 10g Enterprise Edition Release 10.2.0.4.0 - Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options

4.2.2 In a RAC Primary Site restart the primary site and all its instances

$ srvctl stop database -d plb
$ srvctl start database -d plb

If there are additional RAC services used to connect to this database these will need to be restarted using srvctl

$ srvctl start service -d plb

4.3 Re-enable log shipping to the Standby Site.

This will allow the RDBMS changes made through running catupgrade/catbundle/catcpu can be applied to the standby.

4.3.1 If you are using a Data Guard Broker configuration

$ dgmgrl

DGMGRL for Linux: Version 10.2.0.4.0 - Production
Copyright (c) 2000, 2005, Oracle. All rights reserved.
Welcome to DGMGRL, type "help" for information.

DGMGRL> connect /
Connected.

DGMGRL> edit database plb_prm set state='ONLINE';
Succeeded.
DGMGRL> exit

4.3.2 If you are using sqlplus re-enable the archive destination used to ship archives from the Primary Site to the Standby Site

SQL> alter system set log_archive_dest_state_X=enable scope=both sid='*'

Where X is the number of the destination used for shipping redo to the standby site.

5. The Primary will then resume shipping and media recovery will continue at the Standby site and apply the changes made through the patch application at the Primary Site.

From the alert log at the Standby you will see logs generated during the running of the cat scripts shipped and applied to the standby:

Thu Feb 4 09:45:57 2010
RFS[1]: Archived Log: '/oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_79_5pmz52s2_.arc'
RFS[1]: Archived Log: '/oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_76_5pmz55l4_.arc'
RFS[1]: Archived Log: '/oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_77_5pmz55oo_.arc'
RFS[1]: Archived Log: '/oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_78_5pmz561c_.arc'
Thu Feb 4 09:46:24 2010
Media Recovery Log /oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_76_5pmz55l4_.arc
Media Recovery Log /oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_77_5pmz55oo_.arc
Media Recovery Log /oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_78_5pmz561c_.arc
Thu Feb 4 09:46:44 2010
Media Recovery Log /oradata/flash_recovery_area/PLB_STD/archivelog/2010_02_04/o1_mf_1_79_5pmz52s2_.arc
Media Recovery Waiting for thread 1 sequence 80


6. To check the patch has been applied successfully at the primary and standby sites perform.

In this case it was the PSU Patch 9119284.

Please Note: Currently (10.2.0.4 PSU's) do not update the header information for tools like sqlplus nor the versioning information for the database. These will remain at 10.2.0.4.0 and NOT updated to 10.2.0.4.3 for example after the application of the PSU. You will only be able to see the PSU's application through the inventory or looking at the version

$ opatch lsinventory

Invoking OPatch 11.2.0.1.1
Oracle Interim Patch Installer version 11.2.0.1.1
Copyright (c) 2009, Oracle Corporation. All rights reserved.
Oracle Home : /u01/oracle/product/10.2.0/db_plb
Central Inventory : /u01/oraInventory
from : /etc/oraInst.loc
OPatch version : 11.2.0.1.1
OUI version : 10.2.0.4.0
OUI location : /u01/oracle/product/10.2.0/db_plb/oui
Log file location : /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch2010-02-04_10-01-18AM.log
Patch history file: /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/opatch_history.txt
Lsinventory Output file location : /u01/oracle/product/10.2.0/db_plb/cfgtoollogs/opatch/lsinv/lsinventory2010-02-04_10-01-18AM.txt
--------------------------------------------------------------------------------
Installed Top-level Products (2):
Oracle Database 10g 10.2.0.1.0
Oracle Database 10g Release 2 Patch Set 3 10.2.0.4.0
There are 2 products installed in this Oracle Home.
Interim patches (1) :
Patch 9119284 : applied on Thu Feb 04 10:15:51 EST 2010
Unique Patch ID: 11999265
Created on 17 Dec 2009, 03:32:08 hrs PST8PDT
Bugs fixed:
6418420, 7835247, 7207654, 7592346, 6724797, 7936993, 7331867, 9093300
..
.
7523787, 6029179, 5231155, 6455659
--------------------------------------------------------------------------------

OPatch succeeded.


In the case of a PSU examine the registry history view.

SQL> select ACTION_TIME,ACTION,NAMESPACE,VERSION,BUNDLE_SERIES,ID from registry$history;
ACTION_TIME
---------------------------------------------------------------------------
ACTION NAMESPACE
------------------------------ ------------------------------
VERSION BUNDLE_SERIES ID
------------------------------ ------------------------------ ----------
04-FEB-10 09.26.25.026334 AM
APPLY SERVER
10.2.0.4 PSU 3



NOTE: If you are using the Data Guard Broker, you should either disable the Data Guard Broker Configuration
      DGMGRL> disable configuration;
      or stop the Data Guard Broker (set db_broker_start=false) during the Upgrade.
      You can enable the Data Guard Broker Configuration
      DGMGRL> enable configuration;
      or restart the Data Guard Broker (set db_broker_start=true) again once the Upgrade completed successfully.
     
1. Log in to the oracle account on both the PRIMARY and STANDBY hosts and make sure the environment is set to the correct ORACLE_HOME and ORACLE_SID.

2. On both the PRIMARY and STANDBY host uncompress and untar the downloaded patch set / interim patch file into a new directory.

3. Shut down the existing Oracle Server instance on the PRIMARY host with immediate priority.
Stop all listeners, agents and other processes running against the ORACLE_HOME. If using Real Application Clusters perform this step on all nodes.
shutdown immediate
% agentctl stop
% lsnrctl stop

4. Cancel managed recovery on the STANDBY database.
recover managed standby database cancel;

5. Shutdown the STANDBY instance on the STANDBY host. Stop all listeners, agents and other processes running against the ORACLE_HOME.
If using Real Application Clusters perform this step on all nodes.
shutdown immediate
% agentctl stop
% lsnrctl stop

6. Run the Installer and install the patchset on both PRIMARY and STANDBY host.
% ./runInstaller
If this is an interim patch, run opatch per the patch README.
If using Real Application Clusters, be sure the install has propagated to the other nodes if using private ORACLE_HOMEs.
Please see the Patch readme for specific instructions.

7. Once the patchset/patch has been installed on on all hosts/nodes startup the STANDBY listener on STANDBY host.
% lsnrctl start

8. Startup nomount the STANDBY database.
% sqlplus "/ as sysdba"
startup nomount

9. Mount the STANDBY database.
alter database mount standby database;

10. Place the STANDBY database in managed recovery mode.
recover managed standby database nodelay disconnect;

11. Startup the PRIMARY instance on the primary host.
% sqlplus "/ as sysdba"
startup migrate

12. Ensure that remote archiving to the STANDBY database is functioning correctly by switching logfiles on the PRIMARY and verifying that v$archive_dest.status is valid.
If you are not performing remote archiving make note of the current archive log sequence.
alter system archive log current;
select dest_id, status from v$archive_dest;

13. On the PRIMARY instance run the following script:
@?/rdbms/admin/catpatch.sql
For the interim patch, run any scripts as outlined in the README.

14. Once the catpatch.sql script / patch SQL scripts completes make note of the current log sequence and issue the following command:
alter system archive log current;

15. Verify the STANDBY database has been recovered to the log sequence from step 12.
select max(sequence#) from v$log_history;

16. On the PRIMARY instance run the following command:
alter system disable restricted session;

17. Complete the remainder of the "Post Install Actions" from the Patch Set readme on the primary host.
Please note that it is not necessary to shudown the STANDBY in conjuction with the PRIMARY during the "Post Install Actions".

18. Once all "Post Install Actions" have been completed verify the STANDBY database has been recovered to the last archive log produced by the PRIMARY . On the PRIMARY :
select max(sequence#) from v$archived_log;

On the STANDBY :
select max(sequence#) from v$log_history;



Resolving Problems
After adding a datafile to primary database, recovery of the standby database fails with the following error messages:
ORA-01157: cannot identify/lock data file 16 - see DBWR trace file
ORA-01110: data file 16: '/oracle/oradata/FGUARD/undotbs02.dbf'
ORA-27037: unable to obtain file status

Problem Explanation:
The datafiles do not exist on the standby database.  
 
Solution Description: 
Create the datafile(s) on the standby database.  When the files exist, recovery can continue.
The datafiles are not automatically created on the standby site.  For example, the redo does not create a new datafile for you.  
Then create datafile command from startup mount is:
     alter database create datafile '/home/orahome/data/721/users02.dbf';


Pausing/Starting from PROD
alter system set log_archive_dest_state_2='defer';
alter system set log_archive_dest_state_2='enable';

Getting 'Restarting dead background process QMN0' on Alert Log File
If you get many of this messages, just perform the following:
alter system set aq_tm_processes=0 scope=both;


Gap Detected
If there is a gap on the arch log files, then you need to perform the following:
1- Copy the arch logs that doesn't exist on the DR box
2- Apply them by using the following command:
SQL> alter database recover managed standby database disconnect from session;

If you see errors on the Alert.log file like:
Fetching gap sequence for thread 1, gap sequence 5007-5060
Trying FAL server: PROD_FGUARD
Wed May 31 10:19:41 2006
Failed to request gap sequence. Thread #: 1, gap sequence: 5007-5060
All FAL server has been attempted.
Wed May 31 10:21:28 2006
Restarting dead background process QMN0

Then try with :
RECOVER AUTOMATIC STANDBY DATABASE;
If you get :
ORA-01153: an incompatible media recovery is active

Then stop/restart DR and try the last command again:
startup nomount;
alter database mount standby database;
RECOVER AUTOMATIC STANDBY DATABASE;

After recovery is done, then:
alter database recover managed standby database disconnect from session;


Recovering After a Network Failure
The primary database may eventually stall if the network problem is not fixed in a timely manner, because the primary database will not be able to switch to an online redo log that has not been archived. You can issue the following SQL query to determine whether the primary database stalled because it was not able to switch to an online redo log:
SELECT decode(COUNT(*),0,'NO','YES') "switch_possible"
FROM V$LOG
WHERE ARCHIVED='YES';

If the output from the query displays "Yes," a log switch is possible; if the output displays "No," a log switch is not possible.

The V$ARCHIVE_DEST view contains the network error and identifies which standby database cannot be reached. On the primary database, issue the following SQL statement for the archived log destination that experienced the network failure. For example:
SELECT DEST_ID, STATUS, ERROR FROM V$ARCHIVE_DEST WHERE DEST_ID = 2;

DEST_ID    STATUS     ERROR
---------- ---------  ----------------------------
        2  ERROR      ORA-12224: TNS:no listener

The query results show there are errors archiving to the standby database, and the cause of the error as TNS:no listener. You should check whether the listener on the standby site is started. If the listener is stopped, then start it

If you cannot solve the network problem quickly, and if the physical standby database is specified as a mandatory destination, try to prevent the database from stalling by doing one of the following:
# Disable the mandatory archive destination:
ALTER SYSTEM SET LOG_ARCHIVE_DEST_STATE_2 = DEFER;

When the network problem is resolved, you can enable the archive destination again:
ALTER SYSTEM SET LOG_ARCHIVE_DEST_STATE_2 = ENABLE;

# Change the archive destination from mandatory to optional:
ALTER SYSTEM SET LOG_ARCHIVE_DEST_2 = 'SERVICE=standby1 OPTIONAL REOPEN=60';

When the network problem is resolved, you can change the archive destination from optional back to mandatory:
ALTER SYSTEM SET LOG_ARCHIVE_DEST_2 = 'SERVICE=standby1 MANDATORY REOPEN=60';





Synchronize a GAP on the STANDBY when the required archived logs are lost
Scenario:
All archived logs were removed from primary database. The standby had lagged far behind the primary, many required archived logs to close the gap were removed and no backup of them was available.
In order to close the gap you need to create an incremental backup that will contain all transactions since the last scn recorded on the standby database.

Implementation Steps

1- Cancel Recovery on Standby
SQL> alter database recover managed standby database cancel;

If you try to recover from the standby you will get:
SQL> recover standby database;
ORA-00279: change 4146871739 generated at 12/31/2008 11:39:03 needed for thread 1
ORA-00289: suggestion : Z:\ORACLE\ORADATA\SATI\ARCHIVE\1_205_674755717.ARC
ORA-00280: change 4146871739 for thread 1 is in sequence #205

2-Check Standby Database current_scn
SQL> select current_scn from v$database;
CURRENT_SCN
-----------
4146871738

3- Create a Primary Database Incremental Backup FROM this SCN and a Control File for Standby
rman target sys/pass@PROD
backup incremental from scn 4146871738 database FORMAT 'Z:\BACKUP\FOR_STANDBY_%U' tag 'FORSTANDBY';
backup current controlfile for standby format 'Z:\BACKUP\FORSTDBYCTRL.bck';

4- Transfer The Incremental Backup Sets to the Standby Server

5-Restore controlfile on the Standby
rman target sys/pass@STANDBY
RESTORE STANDBY CONTROLFILE FROM 'Z:\BACKUP\FORSTDBYCTRL.BCK';

6-Catalog the Incremental Backups on The Standby Server
Note that for the catalog command to succeed you will need to move the backups to be within the Flash Recovery Area.
When you execute the catalog command, RMAN will ask you if you want to catalog the new files, you will need to say YES.

catalog start with 'Z:\FRA\SATISTD\BACKUPSET';
searching for all files that match the pattern Z:\FRA\SATISTD\BACKUPSET
List of Files Unknown to the Database
=====================================
File Name: Z:\FRA\SATISTD\BACKUPSET\FOR_STANDBY_A7K471DJ_1_1
File Name: Z:\FRA\SATISTD\BACKUPSET\FOR_STANDBY_A8K471DK_1_1
File Name: Z:\FRA\SATISTD\BACKUPSET\FOR_STANDBY_A9K471EF_1_1
File Name: Z:\FRA\SATISTD\BACKUPSET\FOR_STANDBY_AAK471GL_1_1
Do you really want to catalog the above files (enter YES or NO)? yes
cataloging files...
cataloging done

7-Recover the Database and Cleanup Redologs
RMAN> recover database noredo;


SQL> alter database flashback off;
Database altered.
SQL> alter database flashback on;
Database altered.
SQL> alter database recover managed standby database disconnect from session;
Database altered.

If more archived logs were created on the primary since the finish of the SCN based incremental backup then you can copy tehm over and recover the standby database using the command : “recover standby database"

8- Enable the broker at both sites and check
When enabling the broker again it will take over the responsibility of managing the site and will resynchronize both sites
SQL> alter system set dg_broker_start=true scope=both;



Using Flashback with Data Guard
If a Logical Mistakes happen, we can address them with the Flashback techniques, introduced in Oracle Database 10g already, even if in an Data Guard Environment.
In case of “Flashback Table To Timestamp” or “Flashback Table To Before Drop”, there is nothing special to take into account regarding the Standby Database.
It will simply replicate these actions accordingly.

If we do “Flashback Database” instead, that needs a special treatment of the Standby Database. This posting is designed to show you how to do that:
DGMGRL> show configuration
Configuration
 Name:                mycf
 Enabled:             YES
 Protection Mode:     MaxAvailability
 Databases:
 prima - Primary database
 physt - Physical standby database
Fast-Start Failover: DISABLED
Current status for "mycf":
SUCCESS

This is an 11g Database, but the shown technique should work the same with 10g also. Prima & Physt are both creating Flashback Logs:
SQL> connect sys/oracle@prima as sysdba
SQL> select database_role,flashback_on from v$database;
DATABASE_ROLE    FLASHBACK_ON
---------------- ------------------
PRIMARY          YES

SQL> connect sys/oracle@physt as sysdba
SQL> select database_role,flashback_on from v$database;
DATABASE_ROLE    FLASHBACK_ON
---------------- ------------------
PHYSICAL STANDBY YES

Now we perform the "error":
SQL> select * from scott.dept;
SQL> drop user scott cascade;

The Redo Protocol gets transmitted with SYNC to the Standby Database and is applied there with Real-Time Apply. In other words:
The Logical Mistake has already reached the Standby Database. We could have configured a Delay in the Apply there to address such scenarios. But that is somewhat “old fashioned”; the modern way is to go with flashback. The background behind that is, that in case of a Disaster, hitting the Primary Site, a Delay would cause a longer Failover time.
I will now flashback the Primary to get back Scott:
SQL> shutdown immediate
SQL> startup mount
SQL> flashback database to timestamp systimestamp - interval '15' minute;
SQL> alter database open resetlogs;
SQL> select * from scott.dept;

There he is again! Until now, that was not different from a Flashback Database Operation without Data Guard. But now my Standby Database is no longer able to do Redo Apply, because it is “in the future of the Primary Database”.
I need to put it into a time, shortly before the present time of the Primary, in order to restart the Redo Apply successfully:
DGMGRL> show configuration
Configuration
 Name:                mycf
 Enabled:             YES
 Protection Mode:     MaxAvailability
 Databases:
 prima - Primary database
 physt - Physical standby database
Fast-Start Failover: DISABLED
Current status for "mycf":
Warning: ORA-16607: one or more databases have failed
DGMGRL> show database physt statusreport
STATUS REPORT
 INSTANCE_NAME   SEVERITY ERROR_TEXT
 *      ERROR ORA-16700: the standby database has diverged
                         from the primary database
 *      ERROR ORA-16766: Redo Apply is stopped

Please notice that the show statusreport clause is a new feature of 11g. In 10g, you need to look into the Broker Logfile to retrieve that problem.
SQL> connect sys/oracle@prima as sysdba
SQL> select resetlogs_change# from v$database;
RESETLOGS_CHANGE#
-----------------
 294223

SQL> connect sys/oracle@physt as sysdba
SQL> flashback database to scn 294221;
Flashback complete.

Note = I subtracted 2 from the Resetlogs Change No. above to make sure that we get to a time before the error.

DGMGRL> show configuration
Configuration
 Name:                mycf
 Enabled:             YES
 Protection Mode:     MaxAvailability
 Databases:
 prima - Primary database
 physt - Physical standby database
Fast-Start Failover: DISABLED
Current status for "mycf":
Warning: ORA-16607: one or more databases have failed

DGMGRL> edit database physt set state=apply-on;
Succeeded.

DGMGRL> show configuration
Configuration
 Name:                mycf
 Enabled:             YES
 Protection Mode:     MaxAvailability
 Databases:
 prima - Primary database
 physt - Physical standby database
Fast-Start Failover: DISABLED
Current status for "mycf":
SUCCESS

Simple as that!!!


Monitor Data Guard
select 'Last applied  : ' Logs, to_char(next_time,'DD-MON-YY:HH24:MI:SS') Time 
   from v$archived_log 
   where sequence# = (select max(sequence#) from v$archived_log where applied='YES') 
union 
select 'Last received : ' Logs, to_char(next_time,'DD-MON-YY:HH24:MI:SS') Time 
   from v$archived_log 
  where sequence# = (select max(sequence#) from v$archived_log);    


LOGS             TIME 
---------------- ------------------ 
Last applied  :  16-JUL-09:09:24:16 
Last received :  16-JUL-09:09:28:36 


select NAME Name, VALUE Value, UNIT Unit
   from v$dataguard_stats
union 
select null,null,' ' from dual
union
select null,null,'Time Computed: '||MIN(TIME_COMPUTED)
   from v$dataguard_stats;

NAME                   VALUE                  UNIT 
---------------------- ---------------------- ----------------------------------- 
apply finish time      +00 00:02:07.2         day(2) to second(1) interval 
apply lag              +00 00:01:59           day(2) to second(0) interval 
estimated startup time 16                     second 
standby has been open  N 
transport lag          +00 00:00:00           day(2) to second(0) interval 
 
                                              Time Computed: 16-JUL-2009 09:33:16 

select to_char(max(last_time),'DD-MON-YYYY HH24:MI:SS') "Redo onsite" 
  from v$standby_log 

Redo onsite 
-------------------- 
16-JUL-2009 09:42:44 



Great Resources

Monitoring a Data Guard Configuration (Doc ID 2064281.1)
https://support.oracle.com/epmos/faces/DocumentDisplay?_afrLoop=205485303502877&id=2064281.1&displayIndex=4&_afrWindowMode=0&_adf.ctrl-state=7khxadecy_77#aref_section22

Data Guard Physical Standby Setup Using the Data Guard Broker in Oracle Database 12c Release 1
https://oracle-base.com/articles/12c/data-guard-setup-using-broker-12cr1#enable-broker

Flashback Database
https://oracle-base.com/articles/10g/flashback-10g#flashback_database

Using Flashback in a Data Guard Environment
https://uhesse.com/2010/08/06/using-flashback-in-a-data-guard-environment/

Flashback Recovery
https://tamimdba.wordpress.com/tag/db_flashback_retention_target/