To avoid kernel contention, a minimum of four LUNs (Oracle ASM disks) of equal size and performance is recommended for each disk group.
Do you plan to have more than 4 disk groups? 1 TB HDD are usually faster than older 500 GB.
Oracle Doc 4 Disks Per Diskgroup
For the private network 10 Gigabit Ethernet is highly recommended, the minimum requirement is 1 Gigabit Ethernet.
Underscores are not be used in a host or domain name according to RFC952 - DoD Internet host table specification. The same applies for Net, Host, Gateway, or Domain name.
The VIPs and SCAN VIPs must be on the same subnet as the public interface. For additional information see the Understanding SCAN VIP white paper.
The default gateway must be on the same subnet as the VIPs (including SCAN VIPs) to prevent VIP start/stop/failover issues. With 11gR2 this is detected and reported by the OUI, if the check is ignored this will result in the failure to start the VIPs resulting in failure of the installation itself.
It is recommended that the SCAN name (11gR2 and above) resolve via DNS to a minimum of 3 IP addresses round-robin regardless of the size of the cluster. For additional information see the Understanding SCAN VIP white paper.
To avoid name resolution issues, ensure that the HOSTS files and DNS are furnished with both VIP and Public host names. SCAN must NOT be in the HOSTS file due to the fact that the HOSTS file
is only able to represent a 1:1 host to IP mapping.
The network interfaces must have the same name on all nodes (e.g eth1 -> eth1 in support of the VIP and eth2 -> eth2 in support of the private interconnect).
Network Interface Card (NIC) names must not contain " . "
Jumbo Frames for the private interconnect is a recommended best practice for enhanced performance of cache fusion operations. Reference: Document 341788.1
Use non-routable network addresses for private interconnect; Class A: 10.0.0.0 to 10.255.255.255, Class B: 172.16.0.0 to 172.31.255.255, Class C: 192.168.0.0 to 192.168.255.255. Refer to RFC1918 and Document 338924.1 for additional information.
Make sure network interfaces are configured correctly in terms of speed, duplex, etc. Various tools exist to monitor and test network: ethtool, iperf, netperf, spray and tcp. See Document 563566.1.
To avoid the public network or the private interconnect network from being a single point of failure, Oracle highly recommends configuring a redundant set of public network interface cards (NIC's) and private interconnect NIC's on each cluster node.. Document 787420.1. Starting with 18.104.22.168 Oracle Grid Infrastructure can provide redundancy and load balancing for the private interconnect (NOT the public network), this is the preferred method of NIC redundancy for full 22.214.171.124 stacks (126.96.36.199 Database must be used). More information can be found in Document 1210883.1.
NOTE: If using the 188.8.131.52 Redundant Interconnect/HAIP feature - At present it is REQUIRED that all interconnect interfaces be placed on separate subnets. If the interfaces are all on the same subnet and the cable is pulled from the first NIC in the routing table a rebootless-restart or node reboot will occur. See Document 1481481.1 for a technical description of this requirement.
For more predictable hardware discovery, place hba and nic cards in the same corresponding slot on each server in the Grid.
The use of a switch (or redundant switches) is required for the private network (crossover cables are NOT supported).
Dedicated redundant switches are highly recommended for the private interconnect due to the fact that deploying the private interconnect on a switch (even when using a VLAN) may expose the interconnect links to congestion and instability in the larger IP network topology. If deploying the interconnect on a VLAN, there should be a 1:1 mapping of VLAN to non-routable subnet and the VLAN should not span multiple VLANs (tagged) or multiple switches. Deployment concerns in this environment include Spanning Tree loops when the larger IP network topology changes, Asymmetric routing that may cause packet flooding, and lack of fine grained monitoring of the VLAN/port. Reference Bug 9761210.
If deploying the cluster interconnect on a VLAN, review the considerations in the Oracle RAC and Clusterware Interconnect Virtual Local Area Networks (VLANs) white paper.
Consider using Infiniband on the interconnect for workloads that have high volume requirements. Infiniband can also improve performance by lowering latency. When Infiniband is in place the RDS protocol can be used to further reduce latency. See Document 751343.1 for additional details.
In 184.108.40.206 IPv6 is supported for the Public Network, IPv4 must be used for the Private Network. Starting with 220.127.116.11 IPv6 is fully supported for both the public and private interfaces. Please see the Oracle Database IPv6 State of Direction white paper for details.
For version Grid Infrastructure 18.104.22.168 multicast traffic must be allowed on the private network for the 22.214.171.124 subnet. Patch: 9974223 (Included in GI PSU 126.96.36.199.1 and above) for Oracle Grid Infrastructure 188.8.131.52 enables multicasting on the 184.108.40.206 multicast address on the private network. Multicast must be allowed on the private network for one of these 2 addresses (assuming the patch has been applied). Additional information as well as a program to test multicast functionality is provided in Document 1212703.1.
(Doc ID 1367153.1)
The ocssd.log file shows that the node rebooted because it cannot access a majority of voting disks.
Solution: Fix the problem with the voting disk. Make sure that voting disks are available and accessible by user oracle or grid or any user who owns CRS or GI HOME. If the voting disk is not in ASM, use "dd if= of=/dev/null bs=1024 count=10240" to test the accessibility.
The ocssd.log of surviving node shows a member kill request escalated to node kill request.
Cause: Starting 11.1, inability to evict a database or asm instance at the database level means that CRS gets involved and tries to kill the problem instance. This is a member kill request. If CRS cannot kill the problem instance, then CRS reboots the node because the member kill request is escalated to a node kill request.
These are good references for VIPs involved in RAC VIP Concepts
If you are using Oracle RAC (doesn't matter how many nodes you have) You need to know where log files are located.
The Cluster Ready Services Daemon (crsd) Log Files
Log files for the CRSD process (crsd) can be found in the following directories:
Oracle Cluster Registry (OCR) Log Files
The Oracle Cluster Registry (OCR) records log information in the following location:
Cluster Synchronization Services (CSS) Log Files
You can find CSS information that the OCSSD generates in log files in the following locations:
Event Manager (EVM) Log Files
Event Manager (EVM) information generated by evmd is recorded in log files in the following locations:
RACG Log Files
The Oracle RAC high availability trace files are located in the following two locations:
Core files are in the sub-directories of the log directories. Each RACG executable has a sub-directory assigned exclusively for that executable. The name of the RACG executable sub-directory is the same as the name of the executable.
You can follow below table which define locations of logs files:
Oracle Clusterware log files
Cluster Ready Services Daemon (crsd) Log Files:
Cluster Synchronization Services (CSS):
Event Manager (EVM) information generated by evmd:
Oracle RAC RACG:
Oracle RAC 11g Release 2 log files
Clusterware alert log:
Disk Monitor daemon:
OCRDUMP, OCRCHECK, OCRCONFIG, CRSCTL:
Cluster Time Synchronization Service:
Grid Interprocess Communication daemon:
Oracle High Availability Services daemon:
Cluster Ready Services daemon:
Grid Plug and Play daemon:
Mulitcast Domain Name Service daemon:
Event Manager daemon:
RAC RACG (only used if pre-11.1 database is installed):
Cluster Synchronization Service daemon:
HA Service Daemon Agent:
HA Service Daemon CSS Agent:
HA Service Daemon ocssd Monitor Agent:
HA Service Daemon Oracle Root Agent:
CRS Daemon Oracle Agent:
CRS Daemon Oracle Root Agent:
Grid Naming Service daemon:
/oracle - binaries for the database software 50GB - 100GB (enough for the current binaries as well as upgrade or download if necessary)
/oracle_crs - binaries for the grid infrastructure 50GB - 100GB (enough for the current binaries as well as upgrade or download if necessary)
/ora01 - at least 100GB for each server in the cluster, mounted to all servers
These are GoldenGate named databases
These are ACFS based mounts