SETI@Home Wow!-Event 2019

Page 6 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Modular

Diamond Member
Jul 1, 2005
5,027
67
91
Good point, I'll check. I can't find an event manager, but I did find an event viewer, is that what you meant?
Critical - Event ID 41 kernal-power - The system has rebooted without cleanly shutting down first. This error could be caused if the system stopped responding, crashed, or lost power unexpectedly.
After rebooting their is a small window open saying windows has encountered an error. Not much help there.



I can't remember the number of rails it has, I'll check later, along with getting my DMM out to check the 12v rail readings. I do remember it's an old Corsair 650w PSU, more info later....[update] It's a Corsair TX650w, it's got a single 12v rail. Apparently it's MBTF is ~11.4 yrs, I wonder when I got it? I think maybe I got it for when I 1st built my C2 rig(2007)! lol, although it might have been a bit later.

DMM readings from my PSU whilst running SETI on CPU (10 threads) & GPU :-
12v line - wire1 11.95v, wire 2 11.96v, wire 3 11.97v (small plug to mbrd), also seen at the main ATX plug (different WU maybe?) And a steady 12v (11.97-12.03v) to the GPU plug, I watched it on & off over several minutes whilst also looking at GPU-Z to see GPU load.
5v line - 4.86v (main ATX plug), 3v line - 3.32v (main ATX plug).
That good ole PSU is chugging along nicely

I wonder if it's some sort of problem relating to when the monitors go to sleep.....
Kernel Power = PSU is likely bad.


A multimeter doesn't really test a PSU, you need a load tester and a scope, Ideally to see what's happening. If the circuitry is wearing out, excessive ripple in the current will cause instability and harm to your components over time.

I had this exact issue with a PSU about 6 months ago. Oddly, it was a Corsair HX750. Random reboots with Kernel Power errors as well. Corsair is amazing, and they RMA'd the PSU. All has been well and no reboots or Kernel Power errors.
 

Kiska

Golden Member
Apr 4, 2012
1,025
291
136
I feel a great disturbance in the Force (seti's server).
<snip>

zoomed-in
<snip>
Happened since 1112 UTC (approx)

At least the server recovered

Code:
/usr/share/munin/plugins# time ./seti
multigraph results_setiathomev8_in_progress
inProgress.value 5812407
multigraph results_setiathomev8_rts
rts.value 662272
multigraph results_setiathome_AP
rts.value 0
inProgress.value 6888
multigraph workunits_setiathomev8
validation.value 4769138
assimilation.value 59
deletion.value 67
deletion_result.value 1
multigraph workunits_setiathome_AP
validation.value 8391
assimilation.value 0
deletion.value 0
deletion_result.value 0
multigraph transitioner_setiathome
backlog.value 0.000833333333333333

real    0m0.277s
user    0m0.168s
sys     0m0.044s
 

ao_ika_red

Golden Member
Aug 11, 2016
1,679
715
136
At least the server recovered
View attachment 9640
Code:
/usr/share/munin/plugins# time ./seti
multigraph results_setiathomev8_in_progress
inProgress.value 5812407
multigraph results_setiathomev8_rts
rts.value 662272
multigraph results_setiathome_AP
rts.value 0
inProgress.value 6888
multigraph workunits_setiathomev8
validation.value 4769138
assimilation.value 59
deletion.value 67
deletion_result.value 1
multigraph workunits_setiathome_AP
validation.value 8391
assimilation.value 0
deletion.value 0
deletion_result.value 0
multigraph transitioner_setiathome
backlog.value 0.000833333333333333

real 0m0.277s
user 0m0.168s
sys 0m0.044s
Mine is still having problem in uploading, but it's slowly improving, as expected.
And my goodness, get some sleep, Kiska.
 

Assimilator1

Elite Member
Nov 4, 1999
24,120
507
126
Kernel Power = PSU is likely bad.


A multimeter doesn't really test a PSU, you need a load tester and a scope, Ideally to see what's happening. If the circuitry is wearing out, excessive ripple in the current will cause instability and harm to your components over time.

I had this exact issue with a PSU about 6 months ago. Oddly, it was a Corsair HX750. Random reboots with Kernel Power errors as well. Corsair is amazing, and they RMA'd the PSU. All has been well and no reboots or Kernel Power errors.
Good point, we have an oscilloscope at work (I'm a mechanic), I wonder if our kit has enough fidelity to test 12v PSU ripple....

Interestingly since I probed the power plug it hasn't rebooted, and I've had this happen once before many years ago (with an Athlon XP rig I think), it turned out then that ATX plug had 1 pin that was making poor contact & overheating, probing it and so pushing it, temporarily gave it better contact & stopped the problem (temporarily). I'd better check all the plugs!
 

StefanR5R

Elite Member
Dec 10, 2016
5,684
8,249
136
I'm pining for the old times.
Dual E5-2696 v4,
August 2017, running MBv8_8.05r3345_avx_linux64 ...................................... 93,100 PPD​
August 2018, still running MBv8_8.05r3345_avx_linux64 ............................... 80,400 PPD​
August 2019, running MBv8_8.22r4008_avx2_intel_x86_64-pc-linux-gnu ..... 75,800 PPD​
 

StefanR5R

Elite Member
Dec 10, 2016
5,684
8,249
136
Of course I tested both 8.05r3345_avx and 8.22r4008_avx2 on current tasks, and the latter is slightly faster. 8.05r3345_avx is making about 74,000 PPD on the dual E5-2696 v4 these days.

That is, for the same hardware + application version, PPD have regressed proportionally from 2017 through 2018 to 2019.

(PS, processor clocks were the same between the two application versions: Most cores were in a linear distribution between 2.6 and 2.7 GHz, the rest near or at 2.8 GHz. 2.6 GHz is the all-core AVX turbo, 2.8 GHz is the all-core non-AVX turbo. Meaning that utilization of the vector units by either of the two applications is moderate.)
 
Last edited:

biodoc

Diamond Member
Dec 29, 2005
6,270
2,238
136
That is, for the same hardware + application version, PPD have regressed proportionally from 2017 through 2018 to 2019.

Can part of it be explained by kernel patches and microcode updates for mitigating security issues like Spectre/Meltdown/ZombieLoad?
 

Assimilator1

Elite Member
Nov 4, 1999
24,120
507
126
I see my client is running an Astropulse unit, the points from this still count for SETI right? I'm googling it but I haven't found an answer yet.........

Hmm, I might be running lunatics v0.43, that's the installable file I have anyway, I couldn't find matching numbers though from the task manager though. How do I find out what version I'm actually running?
 
Last edited:

StefanR5R

Elite Member
Dec 10, 2016
5,684
8,249
136
That is, for the same hardware + application version, PPD have regressed proportionally from 2017 through 2018 to 2019.
I believe this is due to the continued improvement of GPU application versions, and their increasing adoption. I failed to find a discussion of it in the setiathome forums.
 

Pokey

Platinum Member
Oct 20, 1999
2,766
457
126
OK, I am getting ready to show my ignorance now:
My Pisces team has several GPU Users Group members and they are killing it.
I have been looking around the Seti@Home hosts pages and am seeing folks with as many as 48 and 64 Nvidia cards associated with one processor. How pray tell does that work? My web searches have been fruitless.
 

StefanR5R

Elite Member
Dec 10, 2016
5,684
8,249
136
As far as I know, this is not for performance, but for deeper work buffers. They certainly have cards and application versions which take less than a minute to complete one task. Hence, SETI@home's server-side enforced limit of 100 tasks in progress per GPU makes for very shallow work queues.

Hence they configure their clients to tell the server that there are a lot more GPUs in it than there are in reality.

In addition, there is a client-side limit of 1000 runnable tasks, after which the client would no longer request more new work. With this limit in mind, it makes sense to tell the server that there are e.g. 11 GPUs in the host. People who want more than this need to recompile the client from patched sources.
 

pututu

Member
Jul 1, 2017
148
224
116
As far as I know, this is not for performance, but for deeper work buffers. They certainly have cards and application versions which take less than a minute to complete one task. Hence, SETI@home's server-side enforced limit of 100 tasks in progress per GPU makes for very shallow work queues.

Hence they configure their clients to tell the server that there are a lot more GPUs in it than there are in reality.

In addition, there is a client-side limit of 1000 runnable tasks, after which the client would no longer request more new work. With this limit in mind, it makes sense to tell the server that there are e.g. 11 GPUs in the host. People who want more than this need to recompile the client from patched sources.

I tried to compile the source code but got stuck. I loaded all the dependencies needed first. From what I know, to get the client to receive more than 1000 tasks you need to change the constant in one of the file and recompile. See my earlier post here: https://setiathome.berkeley.edu/forum_thread.php?id=84427 .
 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |