Skip to main content

6.5 Example of Operations for Replacing a System Board


6.5 Example of Operations for Replacing a System Board
This section describes an example of the operations for replacing a system board (PSB) as shown in Figure 6-10.
Figure 6-10  Example of Operations for Replacing a System Board
Figure 6-10  Example of Operations for Replacing a System Board
The operation for replacing a PSB comprises the combined operations of PSB release and incorporation. When the dynamic reconfiguration operation is not possible due to the operation status of the physical partition, replace the PSB after powering off the physical partition.
Note - Suppose that the PPAR DR function of a physical partition is disabled, and the physical partition is operating. In such a case, you cannot release a PSB by specifying the -c disconnect or -c unassign option, or incorporate a PSB by specifying the -c configure option.
The operation example here assumes that the physical partition allows dynamic reconfiguration. The operation example of the SPARC M10-4S presented here uses XSCF shell commands and the ldm command of Logical Domains Manager, which is management software of Oracle VM Server for SPARC. The same operations are also used for the SPARC M12-2S.

  1. Log in to the XSCF.
  2. Execute the showbbstatus command to confirm that the XSCF in the PSB to be replaced is not the master XSCF.
Note - If the XSCF of the PSB to be replaced is the master XSCF, execute the switchscf command to switch the XSCF from the active state to the standby state. Confirm that the XSCF has been switched and rebooted before you release the PSB.
  1. Connect to the control domain console of the physical partition.
  2. Execute the ldm list-domain command to check the operation status of the logical domain.
  1. The following example shows that the control domain, two guest domains, and one root domain are operating.
# ldm list-domain
NAME STATE FLAGS CONS VCPU MEMORY UTIL UPTIME
primary active -n-cv- UART 14 8G 64% 2h 54m
guest0 active -n---- 5000 16 8G 42% 2h 54m
guest1 active -n---- 5001 16 8G 11% 2h 54m
domain01 active -n---- 5002 16 8G 7.3% 2h 54m
  1. Execute the ldm list-devices command with the -a option specified to check the resource usage status.
  1. In the following example, the -a option is specified to display all resources bound to the logical domain and all resources that are not bound.
# ldm list-devices -a
CORE
ID %FREE CPUSET
0 0 (0, 1)
4 0 (8, 9)
8 0 (16, 17)
(Omitted)
184 0 (368, 369)
188 100 (376, 377)
512 100 (1024, 1025)
516 100 (1032, 1033)
520 100 (1040, 1041)
524 100 (1048, 1049)
(Omitted)
VCPU
PID %FREE PM
0 0 no
1 0 no
8 0 no
9 0 no
(Omitted)
369 0 no
376 100 ---
377 100 ---
1024 100 ---
1025 100 ---
1032 100 ---
1033 100 ---
(Omitted)
  1. Execute the ldm list-io command to check the usage status of the I/O devices.
    In the following example, primary, which is a control domain, and domain01, which is a root domain, use I/O devices.
# ldm list-io
NAME TYPE BUS DOMAIN STATUS
---- ---- --- ------ ------

PCIE0 BUS PCIE0 primary IOV
PCIE1 BUS PCIE1 primary IOV
PCIE2 BUS PCIE2 primary IOV
PCIE3 BUS PCIE3 primary IOV
PCIE8 BUS PCIE8 domain01 IOV
PCIE9 BUS PCIE9 domain01 IOV
PCIE10 BUS PCIE10 domain01 IOV
PCIE11 BUS PCIE11 domain01 IOV
/BB0/CMUL/NET0 PCIE PCIE0 primary OCC
/BB0/CMUL/NET2 PCIE PCIE0 primary OCC
/BB0/CMUL/SASHBA PCIE PCIE0 primary OCC
/BB0/PCI0 PCIE PCIE1 primary EMP
/BB0/PCI1 PCIE PCIE1 primary EMP
(Omitted)
  1. Among the services provided by the root domain, release all I/O devices on the PSB to be deleted.
  1. In the following example, domain01, which is a root domain, is stopped, and of the PCIe root complexes owned by domain01, those related to the PSB to be deleted are released.
Note - To reconfigure dynamically, the ldm stop-domain command to stop the root domain is not required. Start operation from the I/O device release.
# ldm stop-domain domain01
# ldm remove-io PCIE11 domain01
# ldm remove-io PCIE10 domain01
# ldm remove-io PCIE9 domain01
# ldm remove-io PCIE8 domain01
# ldm unbind-domain domain01
  1. Execute the ldm list-io command again to check the usage status of the I/O devices.
# ldm list-io
NAME TYPE BUS DOMAIN STATUS
---- ---- --- ------ ------

PCIE0 BUS PCIE0 primary IOV
PCIE1 BUS PCIE1 primary IOV
PCIE2 BUS PCIE2 primary IOV
PCIE3 BUS PCIE3 primary IOV
PCIE8 BUS PCIE8
PCIE9 BUS PCIE9
PCIE10 BUS PCIE10
PCIE11 BUS PCIE11
/BB0/CMUL/NET0 PCIE PCIE0 primary OCC
/BB0/CMUL/NET2 PCIE PCIE0 primary OCC
/BB0/CMUL/SASHBA PCIE PCIE0 primary OCC
/BB0/PCI0 PCIE PCIE1 primary EMP
/BB0/PCI1 PCIE PCIE1 primary EMP
(Omitted)
  1. Execute the deleteboard command with the -c disconnect option specified to release the PSB from the physical partition.
  1. In the following example, PSB 01-0 is released from the physical partition and assigned to physical partition 0.
XSCF> deleteboard -c disconnect 01-0
  1. Execute the showresult command to check the exit status of the deleteboard command just executed.
  1. In the following example, exit status of 0 is returned, indicating normal termination of the deleteboard command.
XSCF> showresult
0
  1. Execute the showboards command to check the PSB status.
XSCF> showboards -p 0
PSB PPAR-ID(LSB) Assignment Pwr Conn Conf Test Fault
---- ------------ ----------- ---- ---- ---- ------- --------

00-0 00(00) Assigned y y y Passed Normal
01-0 00(01) Assigned n n n Passed Normal
  1. Execute the replacefru command to replace the PSB.
XSCF> replacefru
Note - For details on PSB replacement using the replacefru command, see the Service Manual for your server.
  1. Execute the showboards command to check the PSB status.
XSCF> showboards -p 0
PSB PPAR-ID(LSB) Assignment Pwr Conn Conf Test Fault
---- ------------ ----------- ---- ---- ---- ------- --------

00-0 00(00) Assigned y y y Passed Normal
01-0 00(01) Assigned n n n Passed Normal
  1. Execute the addboard command with the -c configure option specified to incorporate the PSB into the physical partition.
  1. In the following example, PSB 01-0 is incorporated into physical partition 0.
XSCF> addboard -c configure -p 0 01-0
Note - If you omit the -m option or specify bind=resource for the -m option when you execute the addboard command, the resource assignment may not be returned to its state before the deleteboard command was executed.
If the amount of the resources of the PSB such as CPU and memory before replacement is different from that after replacement, you cannot return the resource assignment to its original state.
Resources that cannot not be returned to their original assignment become free. In such cases, use the ldm command to assign resources to the logical domain again.
  1. Execute the showresult command to check the exit status of the addboard command just executed.
  1. In the following example, exit status of 0 is returned, indicating normal termination of the addboard command.
XSCF> showresult
0
  1. Execute the showboards command to check the PSB status.
XSCF> showboards -p 0
PSB PPAR-ID(LSB) Assignment Pwr Conn Conf Test Fault
---- ------------ ----------- ---- ---- ---- ------- --------

00-0 00(00) Assigned y y y Passed Normal
01-0 00(01) Assigned y y y Passed Normal
  1. Execute the ldm list-domain command on the control domain console of the physical partition to confirm that the operation status of the logical domain has not changed after the addition of the PSB.
  2. Start the root domain to resume its service.