Project

General

Profile

Bug #26797

Fix error when replacing a failed disk

Added by Patrick M. Hausen about 1 year ago. Updated about 1 year ago.

Status:
Resolved
Priority:
Critical
Assignee:
William Grzybowski
Category:
OS
Target version:
Seen in:
Severity:
New
Reason for Closing:
Reason for Blocked:
Needs QA:
No
Needs Doc:
Yes
Needs Merging:
Yes
Needs Automation:
No
Support Suite Ticket:
n/a
Hardware Configuration:
ChangeLog Required:
No

Description

Hey, guys!

When one of my disks repeatedly reported SMART errors I decided to replace all of them.
So I tried these steps:

1. Offline one of the disks in my pool - working, please see screenshot after offline.
2. Insert new disk into system.
3. Try to replace the disk - gives an error message, please see two screenshots.
4. Trying the same thing a second time - gives a warning that there is a corrupt GPT table, check "force" option, working. See last screenshot for the action taken.

Kind regards,
Patrick

Replace-2nd-attempt.png (54.1 KB) Replace-2nd-attempt.png Patrick M. Hausen, 11/21/2017 05:08 PM
Replace-Error-2.png (186 KB) Replace-Error-2.png Patrick M. Hausen, 11/21/2017 05:08 PM
Replace-Error-1.png (337 KB) Replace-Error-1.png Patrick M. Hausen, 11/21/2017 05:08 PM
State-after-offline.png (96.3 KB) State-after-offline.png Patrick M. Hausen, 11/21/2017 05:08 PM
13139
13140
13141
13142

Associated revisions

Revision ba13c6af (diff)
Added by William Grzybowski about 1 year ago

fix(notifier): revert piece of code removed by mistake in c5cc98f3

Ticket: #26797

Revision 986ee48f (diff)
Added by William Grzybowski about 1 year ago

fix(notifier): revert piece of code removed by mistake in c5cc98f3

Ticket: #26797
(cherry picked from commit ba13c6af01b7ad1e3de5b9c73f9765730ecbebcb)

Revision 626a996a (diff)
Added by William Grzybowski about 1 year ago

fix(notifier): revert piece of code removed by mistake in c5cc98f3

Ticket: #26797
(cherry picked from commit ba13c6af01b7ad1e3de5b9c73f9765730ecbebcb)

Revision 4891a564 (diff)
Added by William Grzybowski about 1 year ago

fix(notifier): revert piece of code removed by mistake in c5cc98f3

Ticket: #26797

History

#1 Updated by Sean Fagan about 1 year ago

This looks like a middleware issue to me -- not handling having to destroy the partition table and then create a new one?

#2 Updated by Patrick M. Hausen about 1 year ago

Some more info:

1. The replacement disks are factory new and empty. When I insert them, this is all I get:

ada4 at ahcich4 bus 0 scbus4 target 0 lun 0
ada4: <WDC WD2002FYPS-02W3B0 04.01G01> s/n WD-WCAVY5604577 detached
(ada4:ahcich4:0:0:0): Periph destroyed
ada4 at ahcich4 bus 0 scbus4 target 0 lun 0
ada4: <TOSHIBA HDWQ140 FJ1M> ATA8-ACS SATA 3.x device
ada4: Serial Number 677FK0OUFPBE
ada4: 300.000MB/s transfers (SATA 2.x, UDMA5, PIO 8192bytes)
ada4: Command Queueing enabled
ada4: 3815447MB (7814037168 512 byte sectors)

2. Only at the second attempt to replace the disk additionally this is logged:

GEOM: ada4: the primary GPT table is corrupt or invalid.
GEOM: ada4: using the secondary instead -- recovery strongly advised.
GEOM_MIRROR: Device mirror/swap0 launched (2/2).
GEOM_ELI: Device mirror/swap0.eli created.
GEOM_ELI: Encryption: AES-XTS 128
GEOM_ELI:     Crypto: hardware

I have replaced all 4 disks now, but I have one spare on the shelf so I could do an exchange once more to get more debugging if needed.

Kind regards,
Patrick

#3 Updated by Dru Lavigne about 1 year ago

  • Assignee changed from Release Council to William Grzybowski

#4 Updated by William Grzybowski about 1 year ago

  • Status changed from Unscreened to Screened
  • Priority changed from No priority to Critical
  • Target version set to 11.1

#5 Updated by William Grzybowski about 1 year ago

  • Status changed from Screened to Ready For Release
  • Target version changed from 11.1 to 11.1-RC2

#6 Updated by Dru Lavigne about 1 year ago

  • Subject changed from 11.1-RC1 - replacing a failed disk requires a second attempt to Fix error when replacing a failed disk

#7 Updated by Bonnie Follweiler about 1 year ago

  • Needs QA changed from Yes to No
  • QA Status Test Passes FreeNAS added
  • QA Status deleted (Not Tested)

#8 Updated by Dru Lavigne about 1 year ago

  • Target version changed from 11.1-RC2 to 11.1-RC3

#9 Updated by Dru Lavigne about 1 year ago

  • Status changed from Ready For Release to Resolved

Also available in: Atom PDF