EN 联系我们加入我们
典型案例
您现在的位置:首页 > 典型案例
【案例分享】IBM SVC节点内部磁盘更换流程



一、故障现象


监控显示设备故障,SVC故障节点停止对外服务,故障原因待确认。


二、问题确认



1、登录管理界面检查报错信息

image002.png


image003.png


经确认,由于设备长时间运行,大量IO读写导致节点内部磁盘故障。

2、确认故障节点信息

io_grp1 lvgsvc4

SVC版本:6.4.1.6

设备型号:2145-CF8

故障硬盘信息:PN:42D0673

image004(1).jpg


image005.png


image006(1).jpg


3、确认设备物理位置

Command line登录设备,确认信息:

image007(1).jpg


image008(1).jpg


image009(1).jpg


确认后,准备硬盘,按流程更换(为避免发生新损,建议多备硬盘)。


三、更换流程



1、Make sure that the node cover is in place and fully closed.

   进入维护模式:关闭电源

image010(1).jpg

2、Touch the static-protective package that contains the drive to any unpainted metal surface on the node; then, remove the drive from the package and place it on a static-protective surface.

3、Make sure that the disk-drive handle is in the open (unlocked) position. 

image011.png

4、Align the drive assembly with the guide rails in drive bay 4 for SAN Volume Controller 2145-CF8 nodes.

image012.png

image013.png

5、Gently push the drive assembly into the bay until the drive stops.

6、Install the service controller


image014.png


7、Make sure that all cables, adapters, and other components are installed and seated correctly and that you have not left loose tools or parts inside the node. Make sure that all internal cables are correctly routed. If you disconnected the Fibre Channel and Ethernet cables, make sure that each cable is reconnected to the same port from which it was removed.

8、Turn on the node. When you turn on the node, use the node rescue procedure to install the SAN Volume Controller software on the new disk

   Completing the node rescue when the node boots

1)  Turn off the node.

2)  Press and hold the left and right buttons on the front panel.

3)  Press the power button.

4)  Continue to hold the left and right buttons until the node-rescue-request symbol is displayed on the front panel

Results

Figure 1. Node rescue display

image015.gif


The node rescue request symbol displays on the front panel display until the node starts to boot from the service controller. If the node rescue request symbol displays for more than two minutes, go to the hardware boot MAP to resolve the problem. When the node rescue starts, the service display shows the progress or failure of the node rescue operation.

9 Then add the node back into the cluster.

10 登陆主控台关闭事件

运行修订过程

image016.png


image017.png


确认节点硬盘更换完成后点确认,设备告警已清除。

image018.png



四、故障总结


更换SVC本地硬盘过程之中,需注意以下三点:

1、确认故障硬盘节点、位置;

2、启动救援模式的方法;

3、恢复过程中的通信链路。如果过程中出现卡停,时间超过2分钟,则需检查通信链路,包扩FC链路和设备内部链路。


如欲了解更多,请登录安图特官方网站:www.antute.com.cn

版权所有 安图特(北京)科技有限公司 备案号:京ICP备17074963号-1
技术支持:创世网络