Есть полутестовый сервер с сабжем. Ночью развалилось:
Aug 4 04:39:53 NAS kernel: ahcich0: Timeout on slot 31 port 0 Aug 4 04:39:53 NAS kernel: ahcich0: is 00000000 cs 00000000 ss 80000001 rs 80000001 tfd 40 serr 00000000 cmd 0000c017 Aug 4 04:39:53 NAS kernel: (ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 3b fa 40 c2 01 00 00 00 00 Aug 4 04:39:53 NAS kernel: (ada0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:39:53 NAS kernel: (ada0:ahcich0:0:0:0): Retrying command Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: Timeout on slot 1 port 0 Aug 4 04:43:02 NAS kernel: ahcich0: is 00000000 cs 00000002 ss 00000000 rs 00000002 tfd 80 serr 00000000 cmd 0000c117 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): Retrying command Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: Timeout on slot 2 port 0 Aug 4 04:43:02 NAS kernel: ahcich0: is 00000000 cs 00000004 ss 00000000 rs 00000004 tfd 80 serr 00000000 cmd 0000c217 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retries exhausted Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: AHCI reset: device not ready after 31000ms (tfd = 00000080) Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: swap_pager: indefinite wait buffer: bufobj: 0, blkno: 139501, size: 4096 Aug 4 04:43:02 NAS kernel: ahcich0: Timeout on slot 3 port 0 Aug 4 04:43:02 NAS kernel: ahcich0: is 00000000 cs 00000008 ss 00000000 rs 00000008 tfd 80 serr 00000000 cmd 0000c317 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 00 00 00 Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): CAM status: Command timeout Aug 4 04:43:02 NAS kernel: (aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked Aug 4 04:43:02 NAS kernel: ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 Aug 4 04:43:02 NAS kernel: ada0: <WDC WD40EFRX-68N32N0 82.00A82> s/n WD-WCC7K2UANUAZ detached Aug 4 04:43:02 NAS kernel: swap_pager: I/O error - pagein failed; blkno 139501,size 4096, error 6 Aug 4 04:43:02 NAS kernel: vm_fault: pager read error, pid 329 (devd) Aug 4 04:43:02 NAS kernel: swap_pager: I/O error - pagein failed; blkno 175717,size 4096, error 6 Aug 4 04:43:02 NAS kernel: vm_fault: pager read error, pid 329 (devd) Aug 4 04:43:02 NAS kernel: swap_pager: I/O error - pagein failed; blkno 175717,size 4096, error 6
и 200 метров логов последняя строка повторяется. Короче как оказалось просто отвалился диск, то ли помер, то ли мать глючит, пока хз. Сервак ушел в ребут и сообщил что сабж degraded.
Вроде ниче страшного, если умер диск вставляем другой, клонируем gpart разбивку со старого на новый диск, руками копируем ефи, делаем буткод, из одного раздела своп и т.д. Вопрос 1: как можно без такого адского ручного труда? :)