Project

General

Profile

Bug #77106

Fix SMART service startup on systems with lots of disks

Added by David Johnson almost 3 years ago. Updated over 2 years ago.

Status:
Ready for Testing
Priority:
No priority
Assignee:
Vladimir Vinogradenko
Category:
Middleware
Target version:
Seen in:
Severity:
New
Reason for Closing:
Reason for Blocked:
Needs QA:
No
Needs Doc:
No
Needs Merging:
No
Needs Automation:
No
Support Suite Ticket:
n/a
Hardware Configuration:
ChangeLog Required:
No

Description

During some disk swapping the S.M.A.R.T. service required being turned back on. When I navigate to the services screen and opt to turn the service on after a short ~5 second wait a pop-up box appears notifying me that the S.M.A.R.T. service failed to start. An alert appears on the upper right informing me of such. If I then navigate to a different screen and back again the service is unexpectedly listed as running, despite it previously saying it failed to start. Looking at the processes confirms it is running.

  1. ps aux | grep smartd
    root 8393 0.0 0.0 10500 6004 - I Fri18 0:00.45 smartd

Errors from /var/log/debug.log are found below

smartd_daemon alert.png (26.2 KB) smartd_daemon alert.png David Johnson, 02/24/2019 03:06 PM
S.M.A.R.T. listed as running.png (120 KB) S.M.A.R.T. listed as running.png David Johnson, 02/24/2019 03:06 PM
S.M.A.R.T. starting as off.png (120 KB) S.M.A.R.T. starting as off.png David Johnson, 02/24/2019 03:06 PM
S.M.A.R.T. failing to start.png (314 KB) S.M.A.R.T. failing to start.png David Johnson, 02/24/2019 03:06 PM
Screen Shot 2019-03-14 at 10.11.11 AM.png (75.8 KB) Screen Shot 2019-03-14 at 10.11.11 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.09.57 AM.png (126 KB) Screen Shot 2019-03-14 at 10.09.57 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.12.07 AM.png (129 KB) Screen Shot 2019-03-14 at 10.12.07 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.14.03 AM.png (117 KB) Screen Shot 2019-03-14 at 10.14.03 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.15.52 AM.png (117 KB) Screen Shot 2019-03-14 at 10.15.52 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.18.18 AM.png (127 KB) Screen Shot 2019-03-14 at 10.18.18 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.19.36 AM.png (127 KB) Screen Shot 2019-03-14 at 10.19.36 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.21.43 AM.png (123 KB) Screen Shot 2019-03-14 at 10.21.43 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.22.29 AM.png (122 KB) Screen Shot 2019-03-14 at 10.22.29 AM.png Jeff Ervin, 03/14/2019 07:26 AM
Screen Shot 2019-03-14 at 10.24.58 AM.png (136 KB) Screen Shot 2019-03-14 at 10.24.58 AM.png Jeff Ervin, 03/14/2019 07:26 AM
55948
55956
55957
55965
61014
61023
61032
61041
61050
61059
61068
61077
61086
61095

Related issues

Copied to FreeNAS - Bug #79104: Fix SMART service startup on systems with lots of disksDone

History

#1 Updated by Dru Lavigne almost 3 years ago

  • Category changed from GUI (new) to Middleware
  • Assignee changed from Release Council to William Grzybowski
  • Private changed from No to Yes

#3 Updated by William Grzybowski almost 3 years ago

  • Assignee changed from William Grzybowski to Vladimir Vinogradenko
  • Target version changed from Backlog to 11.2-U3

#4 Updated by Vladimir Vinogradenko almost 3 years ago

  • Status changed from Unscreened to Blocked
  • Reason for Blocked set to Waiting for feedback

David, if you stop the SMART service via GUI now, will it stop?

If you start it again, will it behave the same way? (say it failed to start but really starts)

#5 Updated by David Johnson almost 3 years ago

Vladimir Vinogradenko wrote:

David, if you stop the SMART service via GUI now, will it stop?

If you start it again, will it behave the same way? (say it failed to start but really starts)

I just logged on and have no errors in the console. The SMART service is listed as on in the UI but smartd does not appear in response to `ps aux | grep smart` which was surprising.

Turning the service off and then back on again results in the same error as before. And it looks like the smartd process started up again

dorothy# ps aux | grep smartd
root       31817   0.0  0.0   10500   6072  -  Is   18:39       0:00.27 /usr/local/sbin/smartd -n -i 1800 -c /usr/local/etc/smartd.conf -p /var/run/smartd-daemon.pid

Let me know if there is any other information I can provide.

#6 Updated by Bug Clerk almost 3 years ago

  • Status changed from Blocked to In Progress

#7 Updated by Vladimir Vinogradenko over 2 years ago

(David, please ignore message below, it is for our QA department)

Acceptance Criteria

You'll need a machine with lots of disks, like tn11.

Open a terminal and type

while true; do date; ps ax|grep smart; cat /var/run/smartd-daemon.pid; sleep 1; done

It'll start showing something every second.

If the output is like

Sun Mar  3 16:03:07 EST 2019
40967  1  R+       0:00.00 grep smart
cat: /var/run/smartd-daemon.pid: No such file or directory

It means SMART is not running

If the output is like

Sun Mar  3 16:03:23 EST 2019
41098  -  DLs      0:00.03 /usr/local/sbin/smartd -n -i 1800 -c /usr/local/etc/smartd.conf -p /var/run/smartd-daemon.pid
41101  1  S+       0:00.00 grep smart
cat: /var/run/smartd-daemon.pid: No such file or directory

It means SMART is initializing

If the output is like

Sun Mar  3 16:03:41 EST 2019
41098  -  Ss       0:02.28 /usr/local/sbin/smartd -n -i 1800 -c /usr/local/etc/smartd.conf -p /var/run/smartd-daemon.pid
41227  1  S+       0:00.00 grep smart
41098

It means SMART is running

Go to services. Stop SMART service. Start SMART service. It should say SMART is running in a few seconds.

Stop SMART while it is still initializing (output #2). It should stop both in UI and output should start being like #1.

Stop SMART while it is running. It should stop both in UI and output should start being like #1.

Change SMART options while it is still initializing. It should change its PID (41098 in "41098 - Ss 0:02.28 /usr/local/sbin/smartd ...").

Change SMART options while it is running. Again, it should change its PID.

Do above with both old UI and new UI.

#8 Updated by Dru Lavigne over 2 years ago

  • File deleted (bug_report.log)

#9 Updated by Dru Lavigne over 2 years ago

  • Status changed from In Progress to Ready for Testing
  • Private changed from Yes to No

#10 Updated by Dru Lavigne over 2 years ago

  • Subject changed from S.M.A.R.T. Service Reports as Failed to Start but does, in fact, start to Fix SMART service startup on systems with lots of disks
  • Reason for Blocked deleted (Waiting for feedback)
  • Needs Doc changed from Yes to No
  • Needs Merging changed from Yes to No

#13 Updated by Bug Clerk over 2 years ago

  • Copied to Bug #79104: Fix SMART service startup on systems with lots of disks added

#15 Updated by Dru Lavigne over 2 years ago

  • File deleted (smartd.conf)

#17 Updated by Jeff Ervin over 2 years ago

  • Status changed from Passed Testing to Ready for Testing

Also available in: Atom PDF