Project

General

Profile

Feature #27700

Generate warning when a scrub is paused

Added by Joshua Sirrine over 1 year ago. Updated about 1 year ago.

Status:
Done
Priority:
No priority
Assignee:
Vladimir Vinogradenko
Category:
Middleware
Target version:
Estimated time:
(Total: 0.00 h)
Severity:
New
Reason for Closing:
Reason for Blocked:
Needs QA:
No
Needs Doc:
No
Needs Merging:
No
Needs Automation:
No
Support Suite Ticket:
n/a
Hardware Configuration:

Related projects 1 project

Description

Starting with 11.1, scrubs can be paused with the "zpool scrub -p <poolname>". TrueNAS customers using this may pause a scrub indefinitely, which would be bad for obvious reasons.

For this reason, we need to have a warning generated in the WebGUI when a zpool scrub is paused, along with ensuring the associated 3AM daily email reminds them of this condition to prevent a customer from pausing a scrub for extremely long periods of time (months?).

If possible, this would be great to add to 11.1 since it has not been released to TrueNAS yet.

Side notes: Historically, TrueNAS customers have been really bad about scheduling scrubs, performing them, etc and I can totally see customers pausing a scrub and forgetting about it for far longer than what would be a "best practice". Years ago we started generating a default scrub schedule because customers were often not scrubbing their zpools ever. This is just a natural extension of "make sure scrubs are performed regularly".


Subtasks

Feature #30468: Add pause attribute to py-libzfs scrub stateDoneWilliam Grzybowski

Associated revisions

Revision 761c0df5 (diff)
Added by Vladimir Vinogradenko over 1 year ago

feat(alert): More alert schedules (not only "run each X minutes")

Ticket: #27700

Revision 38e22a88 (diff)
Added by Vladimir Vinogradenko over 1 year ago

feat(alert): Generate warning when a scrub is paused.

Ticket: #27700

Revision 529d9d9c (diff)
Added by Vladimir Vinogradenko over 1 year ago

feat(alert): Generate warning when a scrub is paused.

Ticket: #27700

Revision 6bc0f1da (diff)
Added by Vladimir Vinogradenko over 1 year ago

feat(alert): Generate warning when a scrub is paused.

Ticket: #27700

Revision aac47fab (diff)
Added by Dru Lavigne about 1 year ago

Note that a paused scrub will trigger an alert.
Ticket: #27700

Revision b798fbf0 (diff)
Added by Dru Lavigne about 1 year ago

Note that a paused scrub will trigger an alert.
Ticket: #27700

History

#1 Updated by Dru Lavigne over 1 year ago

  • 1 added project (FreeNAS)
  • Status changed from Untriaged to 50
  • Target version set to 11.2-BETA1

Ash: as per discussion at 11.1 training meeting, can you get a concensus from support what this alert should look like (eg yellow flashing light when scrub is paused which isn't removed until scrub restarts, or an alert that is displayed X hours/days after puase, or something else)? We can then pass to William to implement.

#2 Updated by Ash Gokhale over 1 year ago

Will comply; support team, Is a yellow light sufficient to indicate paused scrubs are pending? Do we need something more insistent?

#3 Updated by Joshua Sirrine over 1 year ago

The way we discussed in the meeting it was envisioned a yellow warning light in the WebGUI with the daily warnings emails passing this on as well at 3AM. Warning comes up when a zpool scrub is paused, and warning clears itself when the scrub is in progress.

Thanks.

#4 Updated by Ash Gokhale over 1 year ago

  • Status changed from 50 to 51

#5 Updated by Dru Lavigne over 1 year ago

  • Status changed from 51 to Unscreened
  • Assignee changed from Ash Gokhale to William Grzybowski

William: please load balance.

#6 Updated by William Grzybowski over 1 year ago

  • Assignee changed from William Grzybowski to Vladimir Vinogradenko

#7 Updated by Dru Lavigne over 1 year ago

  • Status changed from Unscreened to Not Started

#8 Updated by Dru Lavigne over 1 year ago

  • 1 deleted project (FreeNAS)
  • Project changed from TrueNAS to FreeNAS
  • Category changed from Middleware to Middleware
  • Hide from ChangeLog deleted (No)
  • Support Department Priority deleted (0)

#9 Updated by Vladimir Vinogradenko over 1 year ago

  • Status changed from Not Started to In Progress

#10 Updated by Vladimir Vinogradenko over 1 year ago

  • Status changed from In Progress to Done

#11 Updated by Dru Lavigne over 1 year ago

  • Subject changed from Generate warning when a scrub is paused. to Generate warning when a scrub is paused
  • Needs Merging changed from Yes to No

#12 Updated by Dru Lavigne about 1 year ago

  • Status changed from Done to Ready for Testing

#13 Updated by Bonnie Follweiler about 1 year ago

18520
18523

I am in the process of testing this

I started the scrub, the pause it. In Scrub Status:
Scrub

Status: Unknown
I will leave it like this to see if I get an email at 3 am and if I get a warning that it is paused.

Is there a way for the Scrub Status to say "Paused"?

#14 Updated by Bonnie Follweiler about 1 year ago

18535

In the New GU, it doesn't indicate that it is paused.

#15 Updated by Bonnie Follweiler about 1 year ago

18544
18547
18550
18553

Test Failed in FreeNAS-11.2-MASTER-201806130421
There was no email sent. zpool status shows the scrub paused but I didn't get an email nor did the alert "light" change to yellow in either the legacy or new UI

#16 Updated by William Grzybowski about 1 year ago

Bonnie Follweiler wrote:

Test Failed in FreeNAS-11.2-MASTER-201806130421
There was no email sent. zpool status shows the scrub paused but I didn't get an email nor did the alert "light" change to yellow in either the legacy or new UI

The alert is only supposed to run at 3am.

#17 Updated by Vladimir Vinogradenko about 1 year ago

William, we've adjusted system clock, there really is a problem, investigating

#18 Updated by Bonnie Follweiler about 1 year ago

  • Status changed from Failed Testing to Passed Testing

Passed Test in FreeNAS-11.2-MASTER-201806130421

#19 Updated by Bonnie Follweiler about 1 year ago

  • Needs QA changed from Yes to No

#20 Updated by Dru Lavigne about 1 year ago

#21 Updated by Dru Lavigne about 1 year ago

  • Status changed from Passed Testing to Done
  • Needs Doc changed from Yes to No

Also available in: Atom PDF