Is this a bug? Why is it my fault in that the server will not give me any more WUs?

log in

Advanced search

Message boards : SETI@home Enhanced : Is this a bug? Why is it my fault in that the server will not give me any more WUs?

Author Message
Profile CElliott
Volunteer tester
Send message
Joined: 16 Aug 05
Posts: 71
Credit: 50,937,947
RAC: 118,248
Message 41451 - Posted: 23 Oct 2011, 16:28:11 UTC

Last night the following error occurred when the client tried to resume 4 WUs. It resumed tens of WUs on the same day before this with no error and tens afterward w/o error, but these four did not work. Now the server has assigned me a very low quota and will not send any more WUs. This machine processes 210 WUs per day, but the quota is 170. According to Stderr output, the app was looking in two different slots for the same WU. Is this correct?

22-Oct-2011 22:28:26 [SETI@home Beta Test] [sched_op_debug] Reason: Unrecoverable error for result 12jl11aa.17391.43833.3.14.122_0 ( - exit code -5 (0xfffffffb))
22-Oct-2011 22:28:26 [SETI@home Beta Test] Computation for task 12jl11aa.17391.43833.3.14.122_0 finished
22-Oct-2011 22:28:26 [SETI@home Beta Test] Output file 12jl11aa.17391.43833.3.14.122_0_0 for task 12jl11aa.17391.43833.3.14.122_0 absent
22-Oct-2011 22:28:26 [SETI@home Beta Test] [cpu_sched] Starting 12jl11aa.17391.43833.3.14.121_0(resume)
22-Oct-2011 22:28:26 [SETI@home Beta Test] Restarting task 12jl11aa.17391.43833.3.14.121_0 using setiathome_v7 version 697
22-Oct-2011 22:28:28 [SETI@home Beta Test] [sched_op_debug] Deferring communication for 1 min 0 sec
22-Oct-2011 22:28:28 [SETI@home Beta Test] [sched_op_debug] Reason: Unrecoverable error for result 12jl11aa.17391.43833.3.14.121_0 ( - exit code -5 (0xfffffffb))
22-Oct-2011 22:28:28 [SETI@home Beta Test] Computation for task 12jl11aa.17391.43833.3.14.121_0 finished
22-Oct-2011 22:28:28 [SETI@home Beta Test] Output file 12jl11aa.17391.43833.3.14.121_0_0 for task 12jl11aa.17391.43833.3.14.121_0 absent
22-Oct-2011 22:28:28 [SETI@home Beta Test] [cpu_sched] Starting 12jl11aa.17391.43833.3.14.120_0(resume)
22-Oct-2011 22:28:28 [SETI@home Beta Test] Restarting task 12jl11aa.17391.43833.3.14.120_0 using setiathome_v7 version 697
22-Oct-2011 22:28:30 [SETI@home Beta Test] [sched_op_debug] Deferring communication for 1 min 0 sec
22-Oct-2011 22:28:30 [SETI@home Beta Test] [sched_op_debug] Reason: Unrecoverable error for result 12jl11aa.17391.43833.3.14.120_0 ( - exit code -5 (0xfffffffb))
22-Oct-2011 22:28:30 [SETI@home Beta Test] Computation for task 12jl11aa.17391.43833.3.14.120_0 finished
22-Oct-2011 22:28:30 [SETI@home Beta Test] Output file 12jl11aa.17391.43833.3.14.120_0_0 for task 12jl11aa.17391.43833.3.14.120_0 absent
22-Oct-2011 22:28:30 [SETI@home Beta Test] [cpu_sched] Starting 12jl11aa.17391.43833.3.14.119_0(resume)
22-Oct-2011 22:28:30 [SETI@home Beta Test] Restarting task 12jl11aa.17391.43833.3.14.119_0 using setiathome_v7 version 697
22-Oct-2011 22:28:31 [SETI@home Beta Test] [sched_op_debug] Deferring communication for 1 min 0 sec
22-Oct-2011 22:28:31 [SETI@home Beta Test] [sched_op_debug] Reason: Unrecoverable error for result 12jl11aa.17391.43833.3.14.119_0 ( - exit code -5 (0xfffffffb))
22-Oct-2011 22:28:31 [SETI@home Beta Test] Computation for task 12jl11aa.17391.43833.3.14.119_0 finished
22-Oct-2011 22:28:31 [SETI@home Beta Test] Output file 12jl11aa.17391.43833.3.14.119_0_0 for task 12jl11aa.17391.43833.3.14.119_0 absent
22-Oct-2011 22:28:31 [SETI@home Beta Test] [cpu_sched] Starting 12jl11aa.16936.43833.3.14.112_0(resume)
22-Oct-2011 22:28:31 [SETI@home Beta Test] Restarting task 12jl11aa.16936.43833.3.14.112_0 using setiathome_v7 version 697

____________

Claggy
Volunteer tester
Send message
Joined: 29 May 06
Posts: 656
Credit: 6,133,614
RAC: 15,758
Message 41455 - Posted: 23 Oct 2011, 17:20:24 UTC - in response to Message 41451.

Across your 4 most productive hosts you have approx 14,711 in progress tasks, that's almost half the Results in progress of 31,770, how are you going to report an interesting inconclusive result when you have that many results to look through?
at the rate you're going you are going eithier to get banned, or have get Anonymous platform use banned here.

Claggy

Raistmer
Volunteer tester
Avatar
Send message
Joined: 18 Aug 05
Posts: 965
Credit: 5,361,329
RAC: 11,854
Message 41629 - Posted: 16 Dec 2011, 7:24:23 UTC

Your top host still uses r365 that was deprecated and required to be updated to r390 more than week ago.
Please, do upgrade!

It's not connected with your first post, but to stay up to date is the duty of beta tester of anonymous platform here.
____________

Claggy
Volunteer tester
Send message
Joined: 29 May 06
Posts: 656
Credit: 6,133,614
RAC: 15,758
Message 41631 - Posted: 16 Dec 2011, 17:31:24 UTC - in response to Message 41629.

Your top host still uses r365 that was deprecated and required to be updated to r390 more than week ago.
Please, do upgrade!


Raistmer, you haven't supplied a Nvidia r390 app yet,

Claggy

Raistmer
Volunteer tester
Avatar
Send message
Joined: 18 Aug 05
Posts: 965
Credit: 5,361,329
RAC: 11,854
Message 41632 - Posted: 16 Dec 2011, 22:07:43 UTC - in response to Message 41631.

Your top host still uses r365 that was deprecated and required to be updated to r390 more than week ago.
Please, do upgrade!


Raistmer, you haven't supplied a Nvidia r390 app yet,

Claggy

Ah, right, sorry - I'm too distracted with other completely orthogonal tasks last weeks to maintain conscious state on SETI development.
Still need restore NV build environment for NV build.

____________

Fred J. Verster
Volunteer tester
Send message
Joined: 3 May 10
Posts: 88
Credit: 1,594,385
RAC: 7
Message 41633 - Posted: 16 Dec 2011, 22:53:33 UTC - in response to Message 41632.
Last modified: 16 Dec 2011, 23:11:35 UTC

I'm a bit late, too, I noticed my Q6600+GTX470 host, showing almost equal CPU times and RunTimes.

I've set puls_find_period_iterations to 1 and run 2 instances per GPU.
Result
GTX470 and rev.390.
The rev.390 app. does work well!

Will change this to 1, makes more sense when looking at those times.

(On my 2 5870 GPUs, had a few -177 errors, but that's 0.39 installer
probably due to running 1 per GPU, or a too high <flops>2 e+11</flops>?)

Also wanted to try a few catalyst drivers, on my I7 (2600) + 2 ATI 5870s host,
since 11.10 and 11.11 lacked OpenCL support, 11.6 and AMD APP (SDK) 2.5,
'gives' the best of both having OpenCL support, compatabillity with some video
editting programs, also 'working'with it. Also tried it at PrimeGrid, which uses 0.79CPU and 1GPU! Load is ~65% First GPU (0), always as the highest load and
throughput.

I should make a SETI BĂȘta account for this host, it runs the 0.39 installer and added AstroPulse for ATI HD5 GPU, rev.521, takes quite some time.
Have to try and change the cmd-line parameters.
____________

Fred J. Verster
Volunteer tester
Send message
Joined: 3 May 10
Posts: 88
Credit: 1,594,385
RAC: 7
Message 41634 - Posted: 17 Dec 2011, 0:30:04 UTC - in response to Message 41633.

The GPU Load is about 60% with rev.390, since this host also has a SETI@home
account, setting 1 MB WU per GPU for BĂȘta, means also 1 for SETI@home.

I crashed a few MB WU's, when I upped the GPU core/shader-clock from 650MHz.
to 725MHz. Memory speed has to be adjusted in the same %.

And they both, rev.365 and rev.390, are compatible and run nicely 1 + 1 on a
GTX470, now running stock or default.

I'll change them both to 1 per GPU.

TRuEQ & TuVaLu
Volunteer tester
Avatar
Send message
Joined: 28 Jan 11
Posts: 532
Credit: 1,383,581
RAC: 4,799
Message 41686 - Posted: 16 Jan 2012, 8:01:21 UTC
Last modified: 16 Jan 2012, 8:02:49 UTC

.ops
____________
TRuEQ & TuVaLu

Message boards : SETI@home Enhanced : Is this a bug? Why is it my fault in that the server will not give me any more WUs?


Return to SETI@home/AstroPulse Beta main page


Copyright © 2013 University of California

AstroPulse is funded in part by the NSF through grant AST-0307956