Supercluster App Domain RAC setup not able to start after ZFS storage rebooted.

11gR2 Grid Infrastructure on Solaris local containers, cssd process fails to start with following symptoms:

** root.sh output

CRS-2674: Start of 'ora.cssd' on 'racnode1' failed
..
CRS-5804: Communication error with agent process
CRS-4000: Command Start failed, or completed with errors.
The exlusive mode cluster start failed, see Clusterware alert log for more information
Initial cluster configuration failed.  See /oracle_crs/app/11.2.0/grid/cfgtoollogs/crsconfig/rootcrs_racnode1.log for details

** $GRID_HOME/cfgtoollogs/crsconfig/rootcrs_<nodename>.log

2017-03-09 22:33:38: CRS-2674: Start of 'ora.cssd' on 'racnode1' failed
..
2017-03-09 22:33:38: CRS-5804: Communication error with agent process
2017-03-09 22:33:38: CRS-4000: Command Start failed, or completed with errors.
2017-03-09 22:33:38: CSS startup failed with return code 1
..
2017-03-09 22:33:51: The exlusive mode cluster start failed, see Grid Infrastructure alert log for more

** $GRID_HOME/log/<nodename>/cssd/ocssd.log

2017-03-09 22:33:30.467: [    CSSD][1]clssscSetPrivEnv: Setting priority to 4
2017-03-09 22:33:30.491: [    CSSD][1]clssscSetPrivEnv: unable to set priority to 4
2017-03-09 22:33:30.491: [    CSSD][1]SLOS: cat=-2, opn=scls_set_priority_realtime, dep=1, loc=setsched
unable to escalate to real time
..
2017-03-09 22:33:30.491: [    CSSD][1](:CSSSC00011:)clssscExit: A fatal error occurred during initialization




Cause

Oracle Clusterware requires additional privileges that are not provided by default.

see below Oracle White Paper for :

RUNNING ORACLEƂ® REAL APPLICATION CLUSTERS ON ORACLE SOLARIS ZONE CLUSTERS

http://www.oracle.com/technetwork/articles/servers-storage-admin/o11-062-rac-solariszonescluster-429206.pdf


Solution

Halt local container, execute the following on global zone and restart the local container:

<local-container-name> =============> Zone Name , you can see "zoneadm list -civ

zonecfg -z <local-container-name>
set limitpriv="default,proc_priocntl,proc_clock_highres,sys_time"
commit
exit
zoneadm -z <local-container-name> reboot

Once local container is restarted, refer to note 942166.1 or note 969254.1 to proceed.



To list Privilege Set, execute "ppriv -l <zone>" in either global or local zone.


Ref:

Grid Infrastructure cssd fails to start on Solaris local containers (zones) (Doc ID 1340694.1)

Comments

Popular posts from this blog

Bare Metal Restore Procedure for Compute Nodes on an Exadata Environment

[INS-40718] Single Client Access Name (SCAN): could not be resolved. ( LDOMS & Zones)