Hello guys,
For a start, we have several SUSE server installed with Oracle as DB with ASM. Our SUSE server is configured with multipath for disk but yesterday, we faced a problem during our system test where having an where we having a few of disk offline (view from multipath -ll) making the Oracle DB goes down. Here some result from multipath -ll:
3600507680c80034690000000000000d9 dm-21 IBM,2145
size=250G features=‘1 queue_if_no_path’ hwhandler=‘0’ wp=rw
-+- policy='service-time 0' prio=10 status=active |- 3:0:2:54 sdm 8:192 failed faulty offline <<---- one of the disk down |- 15:0:0:54 sdbc 67:96 active ready running |- 9:0:0:54 sdah 66:16 active ready running
- 21:0:3:54 sdbx 68:176 active ready running
3600507680c80034690000000000000d8 dm-20 IBM,2145
size=250G features=‘1 queue_if_no_path’ hwhandler=‘0’ wp=rw
-+- policy='service-time 0' prio=50 status=active |- 2:0:1:53 sdl 8:176 failed faulty offline <<---- one of the disk down |- 14:0:0:53 sdbb 67:80 active ready running |- 8:0:3:53 sdag 66:0 active ready running
- 20:0:2:53 sdbw 68:160 active ready running
3600507680c80034690000000000000e0 dm-28 IBM,2145
size=150G features=‘1 queue_if_no_path’ hwhandler=‘0’ wp=rw
-+- policy='service-time 0' prio=50 status=active |- 4:0:1:61 sdt 65:48 failed faulty offline <<---- one of the disk down |- 16:0:3:61 sdbj 67:208 active ready running |- 10:0:1:61 sdao 66:128 active ready running
- 22:0:2:61 sdce 69:32 active ready running
3600507680c80034690000000000000cf dm-11 IBM,2145
size=5.0G features=‘1 queue_if_no_path’ hwhandler=‘0’ wp=rw
-+- policy='service-time 0' prio=10 status=active |- 0:0:2:44 sdc 8:32 failed faulty offline <<---- one of the disk down |- 12:0:2:44 sdas 66:192 active ready running |- 6:0:1:44 sdx 65:112 active ready running
- 18:0:2:44 sdbn 68:16 active ready running
from “Warn” message:
Aug 10 15:31:41 xxx822 multipathd: 3600507680c8003469000000000000088: sda - tur checker reports path is down
Aug 10 15:31:41 xxx822 multipathd: 3600507680c8003469000000000000086: sdc - tur checker reports path is down
Aug 10 15:31:41 xxx822 multipathd: 3600507680c8003469000000000000092: sdp - tur checker reports path is down
Aug 10 15:31:41 xxx822 kernel: sd 0:0:3:15: rejecting I/O to offline device
Aug 10 15:31:41 xxx822 kernel: sd 0:0:3:17: rejecting I/O to offline device
Aug 10 15:31:41 xxx822 kernel: sd 0:0:3:16: rejecting I/O to offline device
Aug 10 15:31:41 xxx822 kernel: sd 3:0:0:20: rejecting I/O to offline device
Aug 10 15:31:41 xxx822 kernel: sd 1:0:0:27: rejecting I/O to offline device
Aug 10 15:31:41 xxx822 kernel: sd 0:0:3:15: rejecting I/O to offline device
What happened with the other path that configured? Why does the SUSE not picking up the other path and run as usual?
We hope we can solve this problem.