OMVにComicCafeを入れてかれこれ一ヶ月ぐらい連続稼働させていて、いたって順調だったのですが、今日突然OMVから以下の様なエラーのメールが届きました。(3通)
This is an automatically generated mail message from mdadm running on OMV-NODE804 A Fail event had been detected on md device /dev/md0. It could be related to component device /dev/sdh. Faithfully yours, etc. P.S. The /proc/mdstat file currently contains the following: Personalities : [raid6] [raid5] [raid4] md0 : active raid6 sdc[0] sdl[9](F) sdg[8](F) sdk[7] sdf[6] sdm[5] sdj[4] sdi[3] sdh[2](F) sdd[1] 23441080320 blocks super 1.2 level 6, 512k chunk, algorithm 2 [10/7] [UU_UUUUU__] unused devices: none
(F)って多分、Failって意味だから、HDD3台故障???マジですか。。。本当に3台壊れたなら終わりじゃないですか。。。同時にSMARTのエラーメールも届いてました。(2通)
This email was generated by the smartd daemon running on: host name: OMV-NODE804 DNS domain: local NIS domain: (none) The following warning/error was logged by the smartd daemon: Device: /dev/disk/by-id/ata-WDC_WD30EZRX-00D8PB0_WD-WCC4N3KS2AFV [SAT], unable to open device For details see host's SYSLOG. You can also use the smartctl utility for further investigation. Another email message will be sent in 24 hours if the problem persists.
"unable to open device"でググってみるとどうやらHDDの故障よりケーブルなどのトラブルの可能性が高いとのこと、外出先だったので直ぐに帰って復旧作業をしました。
最初に再起動を試しましたが起動後、RAIDは認識されず。。。メールで報告されたデバイスはsdg, sdk, sdl, sdhの4つで、自作時のメモでシリアル番号を確認すると、同じケーブルに接続されている4台ではないようです。これはヤバイと思い以下のコマンドでRAIDの状態を確認すると
cat /proc/mdstat Personalities : [raid6] [raid5] [raid4] md0 : inactive sdc[0](S) sdg[9](S) sdh[8](S) sdl[7](S) sdi[6](S) sdf[5](S) sdj[4](S) sdm[3](S) sdk[2](S) sdd[1](S) 29301355120 blocks super 1.2 unused devices: none
ん!? 何故かドライブの状態が全て(S)になってます。。。調べるとどうやらSはスペアのSとのこと。さらにコマンドで状況を調べます。
mdadm -E /dev/sd[cdfgihjklm] /dev/sdc: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : d1ec112e:6d291ee3:25845efa:d6957742 Update Time : Sat Dec 19 22:22:17 2015 Checksum : b585aa10 - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 0 Array State : AA.AAAAA.. ('A' == active, '.' == missing) /dev/sdd: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 5740654e:ae81ac6a:eec7e781:af1c6a60 Update Time : Sat Dec 19 22:22:17 2015 Checksum : 2e3207a - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 1 Array State : AA.AAAAA.. ('A' == active, '.' == missing) /dev/sdf: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 92bf4726:5304522d:6db5ab24:f8368453 Update Time : Sat Dec 19 22:22:17 2015 Checksum : 33492a25 - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 5 Array State : AA.AAAAA.. ('A' == active, '.' == missing) /dev/sdg: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 33585a1d:96046870:fb9f752f:3b49e3dd Update Time : Sat Dec 19 17:29:20 2015 Checksum : 2a37b3a - correct Events : 122 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 9 Array State : AAAAAAAAAA ('A' == active, '.' == missing) /dev/sdh: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : c0cdad1d:12c70ac3:bee5bd83:1ace06ca Update Time : Sat Dec 19 17:29:20 2015 Checksum : 96057de4 - correct Events : 122 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 8 Array State : AAAAAAAAAA ('A' == active, '.' == missing) /dev/sdi: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : e05b06ac:92271d9d:e2e73405:afbefa7d Update Time : Sat Dec 19 22:22:17 2015 Checksum : 33d2a3e0 - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 6 Array State : AA.AAAAA.. ('A' == active, '.' == missing) /dev/sdj: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 879cda62:7469e943:8dd7e491:da943dec Update Time : Sat Dec 19 22:22:17 2015 Checksum : 8c65ec3d - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 4 Array State : AA.AAAAA.. ('A' == active, '.' == missing) /dev/sdk: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 264a91d5:2c04c3e4:6a5f8654:64d23e6d Update Time : Sat Dec 19 17:29:20 2015 Checksum : e3a1b554 - correct Events : 122 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 2 Array State : AAAAAAAAAA ('A' == active, '.' == missing) /dev/sdl: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 277d4802:06ad5ffa:7d9907de:70522c67 Update Time : Sat Dec 19 22:22:17 2015 Checksum : a95b8ff8 - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 7 Array State : AA.AAAAA.. ('A' == active, '.' == missing) /dev/sdm: Magic : a92b4efc Version : 1.2 Feature Map : 0x0 Array UUID : 8002b7d3:4e1bb8d3:66b534b1:e7e9a90d Name : OMV-NODE804:RAID6 (local to host OMV-NODE804) Creation Time : Sun Sep 27 00:41:43 2015 Raid Level : raid6 Raid Devices : 10 Avail Dev Size : 5860271024 (2794.40 GiB 3000.46 GB) Array Size : 23441080320 (22355.16 GiB 24003.67 GB) Used Dev Size : 5860270080 (2794.39 GiB 3000.46 GB) Data Offset : 262144 sectors Super Offset : 8 sectors State : clean Device UUID : 7b8bfeed:db4b8436:0fafa8da:f59b317a Update Time : Sat Dec 19 22:22:17 2015 Checksum : e0dc9c34 - correct Events : 131 Layout : left-symmetric Chunk Size : 512K Device Role : Active device 3 Array State : AA.AAAAA.. ('A' == active, '.' == missing)
見た感じ全く問題なさそうです。ちなみにディスクが壊れてる場合は mdadm: No md superblock detected on /dev/sda. みたいなメッセージがでるらしいです。
念のためSMARTなどの状態も確認しましたが、特に異常は見つかりませんでした。幸いバックアップは前のHDDに全て残ってるはずなので、強制的にRAIDを再構築してみることにしました。
mdadm --assemble --scan --force
コマンドは一瞬で終わり、無事にRAIDが復活しました。
cat /proc/mdstat Personalities : [raid6] [raid5] [raid4] md0 : active raid6 sdc[0] sdg[9] sdh[8] sdl[7] sdi[6] sdf[5] sdj[4] sdm[3] sdk[2] sdd[1] 23441080320 blocks super 1.2 level 6, 512k chunk, algorithm 2 [10/10] [UUUUUUUUUU] cat /etc/mdadm/mdadm.conf # mdadm.conf # # Please refer to mdadm.conf(5) for information about this file. # # by default, scan all partitions (/proc/partitions) for MD superblocks. # alternatively, specify devices to scan, using wildcards if desired. # Note, if no DEVICE line is present, then "DEVICE partitions" is assumed. # To avoid the auto-assembly of RAID devices a pattern that CAN'T match is # used if no RAID devices are configured. DEVICE partitions # auto-create devices with Debian standard permissions CREATE owner=root group=disk mode=0660 auto=yes # automatically tag new arrays as belonging to the local system HOMEHOST# definitions of existing MD arrays ARRAY /dev/md0 metadata=1.2 name=OMV-NODE804:RAID6 UUID=8002b7d3:4e1bb8d3:66b534b1:e7e9a90d
最近、CalcNoteの実装ばかりで、買ったUPSは放置したままだったり、OMVのアップデートも全くやってない状態だったので、時間作ってNASの運用も頑張らねばと思った次第です。でも本当にデータが消えなくて良かったです。
0 件のコメント:
コメントを投稿