sge_shadowd

Langue: en

Autres versions - même langue

Version: $Date: 2007/11/08 23:04:23 $ (fedora - 06/07/09)

Section: 8 (Commandes administrateur)

NAME

ge_shadowd - Grid Engine shadow master daemon

SYNOPSIS

ge_shadowd

DESCRIPTION

ge_shadowd is a "light weight" process which can be run on so-called shadow master hosts in a Grid Engine cluster to detect failure of the current Grid Engine master daemon, and to start-up a new on the host on which the ge_shadowd runs. If multiple shadow daemons are active in a cluster, they run a protocol which ensures that only one of them will start-up a new master daemon.

The hosts suitable for being used as shadow master hosts must have shared root read/write access to the directory $GE_ROOT/$GE_CELL/common as well as to the master daemon spool directory (by default $GE_ROOT/$GE_CELL/spool/qmaster). The names of the shadow master hosts need to be contained in the file $GE_ROOT/$xQS_NAME_Sxx_CELL/common/shadow_masters.

RESTRICTIONS

ge_shadowd may only be started by root.

ENVIRONMENT VARIABLES

GE_ROOT
Specifies the location of the Grid Engine standard configuration files.
GE_CELL
If set, specifies the default Grid Engine cell. To address a Grid Engine cell ge_shadowd uses (in the order of precedence):
The name of the cell specified in the environment variable GE_CELL, if it is set.
The name of the default cell, i.e. default.
GE_DEBUG_LEVEL
If set, specifies that debug information should be written to stderr. In addition the level of detail in which debug information is generated is defined.
GE_QMASTER_PORT
If set, specifies the tcp port on which is expected to listen for communication requests. Most installations will use a services map entry for the service "sge_qmaster" instead to define that port.
GE_DELAY_TIME
This variable controls the interval in which ge_shadowd pauses if a takeover bid fails. This value is used only when there are multiple ge_shadowd instances and they are contending to be the master. The default is 600 seconds.
GE_CHECK_INTERVAL
This variable controls the interval in which the ge_shadowd checks the heartbeat file (60 seconds by default).
GE_GET_ACTIVE_INTERVAL
This variable controls the interval when a ge_shadowd instance tries to take over when the heartbeat file has not changed.

FILES

 
 <ge_root>/<cell>/common
                Default configuration directory
 <ge_root>/<cell>/common/shadow_masters
                Shadow master hostname file.
 <ge_root>/<cell>/spool/qmaster
                Default master daemon spool directory
 <ge_root>/<cell>/spool/qmaster/heartbeat
                The heartbeat file.
 

SEE ALSO

Grid Engine Installation and Administration Guide. See for a full statement of rights and permissions.