Quorum, Replication.

log in

Advanced search

Message boards : SETI@home Enhanced : Quorum, Replication.

1 · 2 · 3 · Next
Author Message
Profile Keith T.
Volunteer tester
Avatar
Send message
Joined: 9 Feb 07
Posts: 129
Credit: 25,809
RAC: 0
Message 35101 - Posted: 26 Oct 2008, 13:21:24 UTC
Last modified: 26 Oct 2008, 13:23:26 UTC

I re-enabled work requests on my old PII-233 for this project after reading the News:


October 22, 2008
We've switched workunit generation to single redundancy mode for testing purposes.

I was a bit surprised to get a Task which was created using the old replication and quorum rules.

My PII was almost always the third to report it's results, and therefore doing mostly unnecessary work.

My faster Athlon XP currently has a LTD for this project of -1.1 million.

On furher investigation, I noticed that my most recent task http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=4555349 was created on 10 Oct 2008 i.e before the change in policy, so I assume the there are still a lot of tasks in the pipeline like this before we get the new ones.
____________

Josef W. Segur
Volunteer tester
Send message
Joined: 14 Oct 05
Posts: 1020
Credit: 1,495,473
RAC: 222
Message 35102 - Posted: 26 Oct 2008, 14:08:21 UTC - in response to Message 35101.

A lot, yes. 148,151 in the "Ready to send" queue, which should last until next year. There were around 40000 MB after the old AP work was cancelled, then the mb_splitter ran again Oct 20 and added the rest.

Joe

Profile speedimic
Volunteer tester
Avatar
Send message
Joined: 15 Feb 07
Posts: 16
Credit: 145,803
RAC: 0
Message 35116 - Posted: 27 Oct 2008, 19:40:46 UTC

Just came back to beta with one machine - after 1.5y of absence.
Hope that helps to go through that quicker... :)
____________
mic.


Richard Haselgrove
Volunteer tester
Send message
Joined: 3 Jan 07
Posts: 972
Credit: 2,070,629
RAC: 18
Message 35133 - Posted: 30 Oct 2008, 11:26:02 UTC
Last modified: 30 Oct 2008, 11:45:46 UTC

Given the current state of Astropulse testing (Linux and Windows apps seem to have no chance of validating against each other), I've decided to implement Plan B again.

Got an allocation from the residual replication=3 backlog, so let's see if we can shift them pronto.

WinterKnight, if you're watching: this should demonstrate your TDCF/workfetch point perfectly. My Xeon is showing a TDCF of 0.9163 following the super-slow-mo AP stock test: that should come down to approaching 0.15 or below with an opti app on an exclusive diet of shorties. Watch for a repeating pattern of finish one/request three/report one.

Edit - well, a whole batch of noisy (30 spike) tasks slightly distorts the picture - TDCF down to 0.7648 already - but I don't think it invalidates the principle. And just for a moment I was afraid it was a bug in the Lunatics Beta app I'm testing, but they're validating against stock, so I think it's OK.

Josef W. Segur
Volunteer tester
Send message
Joined: 14 Oct 05
Posts: 1020
Credit: 1,495,473
RAC: 222
Message 35143 - Posted: 31 Oct 2008, 3:51:28 UTC

There are now 32512 non-redundant WUs at the end of the "Ready to send" queue, 1417147 through 1449658. Maybe plan B could get us there by the end of November, but it looks like AstroPulse is coming back to life.

Joe

Profile JLDun
Volunteer tester
Avatar
Send message
Joined: 23 May 07
Posts: 66
Credit: 17,824
RAC: 0
Message 35145 - Posted: 31 Oct 2008, 5:01:02 UTC - in response to Message 35143.

I'm doing my part- I have mine set to do Enhanced only (through the webpage settings).
____________

Winterknight
Volunteer tester
Send message
Joined: 15 Jun 05
Posts: 693
Credit: 246,694
RAC: 0
Message 35171 - Posted: 2 Nov 2008, 3:30:12 UTC - in response to Message 35133.

Only just got here, been at son's place, and he is only occasional cruncher, so not attached Beta, so couldn't post from his computer.
I'll keep watching. My LTD here is -404k (from 600+k), and am reluctant to manually adjust, considering my views on the scheduler, so nowhere near downloading yet.

Richard Haselgrove
Volunteer tester
Send message
Joined: 3 Jan 07
Posts: 972
Credit: 2,070,629
RAC: 18
Message 35192 - Posted: 3 Nov 2008, 22:13:53 UTC

Looks like all the IR=3 tasks have been cancelled. Got a whole new batch about an hour ago, starting with WU 1417796, showing IR=1.

Josef W. Segur
Volunteer tester
Send message
Joined: 14 Oct 05
Posts: 1020
Credit: 1,495,473
RAC: 222
Message 35193 - Posted: 3 Nov 2008, 22:26:48 UTC
Last modified: 3 Nov 2008, 22:36:58 UTC

The experiment is finally in progress, all the work with initial replication 3 has been cancelled. Last night they also added 15872 new WUs to the 32512 I noted Friday. Those (WUs 1417147 to 1465530) show the tabular data indicating which host has been given the task. The splitter seems to be running again and WUs above 1465530 do not show the table, as is supposed to happen with true Adaptive Replication.

[edit] Last WU as of a minute ago was 1482169.

Joe

zombie67 [MM]
Volunteer tester
Avatar
Send message
Joined: 18 May 06
Posts: 256
Credit: 9,498,161
RAC: 60,610
Message 35196 - Posted: 3 Nov 2008, 23:05:27 UTC

I am getting tasks cancelled by the server when computation had already started, or even completed. Why can't the nicer kind of server-side cancellation be used, where they are cancelled only if no computation has started?
____________

Richard Haselgrove
Volunteer tester
Send message
Joined: 3 Jan 07
Posts: 972
Credit: 2,070,629
RAC: 18
Message 35197 - Posted: 3 Nov 2008, 23:47:29 UTC - in response to Message 35196.

I am getting tasks cancelled by the server when computation had already started, or even completed. Why can't the nicer kind of server-side cancellation be used, where they are cancelled only if no computation has started?

I got some of those too, but I got full credit for the work I'd already done. WU 1384574.

zombie67 [MM]
Volunteer tester
Avatar
Send message
Joined: 18 May 06
Posts: 256
Credit: 9,498,161
RAC: 60,610
Message 35198 - Posted: 4 Nov 2008, 0:41:23 UTC - in response to Message 35197.

I am getting tasks cancelled by the server when computation had already started, or even completed. Why can't the nicer kind of server-side cancellation be used, where they are cancelled only if no computation has started?

I got some of those too, but I got full credit for the work I'd already done. WU 1384574.


Ah! When I originally posted, I was awarded 0. But they have since been changed to actual credits. Must have been a delay for some reason. Thanks!
____________

Urs Echternacht
Volunteer tester
Send message
Joined: 18 Jan 06
Posts: 808
Credit: 9,049,358
RAC: 18,571
Message 35202 - Posted: 4 Nov 2008, 2:52:26 UTC

After returning the first results that had

minimum quorum 1
initial replication 1
the initial replication gets 2 and a second result waits to be sent out, while my returnee (<-example) is pending.
So, single redundancy mode works ?
____________
_\|/_
Urs

Profile JLDun
Volunteer tester
Avatar
Send message
Joined: 23 May 07
Posts: 66
Credit: 17,824
RAC: 0
Message 35204 - Posted: 4 Nov 2008, 4:00:18 UTC - in response to Message 35198.
Last modified: 4 Nov 2008, 4:24:35 UTC

When I originally posted, I was awarded 0. But they have since been changed to actual credits.


Going to see how this applies to WU 1369534 Task 4562572; it also was server-aborted in the middle of crunching when I was doing an upload of another result.

[edit]Forgot the quotey...[/edit]
____________

Richard Haselgrove
Volunteer tester
Send message
Joined: 3 Jan 07
Posts: 972
Credit: 2,070,629
RAC: 18
Message 35207 - Posted: 4 Nov 2008, 9:47:18 UTC - in response to Message 35202.

Likewise. All 8 of the single-redundancy tasks I've returned so far have been sent back out for checking (well, put at the back of the queue for sending out...). That seems like a very high sample rate.

Looks like we're going to have to re-earn our "reliable" spurs the hard way.

Father Ambrose
Volunteer tester
Send message
Joined: 1 May 07
Posts: 543
Credit: 4,746,273
RAC: 4,091
Message 35208 - Posted: 4 Nov 2008, 10:10:47 UTC
Last modified: 4 Nov 2008, 10:34:11 UTC

Just checked mine all awaiting resends I think i'll NNT and finish what I have at the moment.

I see the server page states last look 136,663 awaiting to be sent.

Michael

Urs Echternacht
Volunteer tester
Send message
Joined: 18 Jan 06
Posts: 808
Credit: 9,049,358
RAC: 18,571
Message 35218 - Posted: 5 Nov 2008, 2:03:51 UTC
Last modified: 5 Nov 2008, 2:04:18 UTC

Now, i can see both hosts (P3, T7200) receive credit with single-redundancy tasks, while all wus before the outage are still pending.

The P3 usually fails on a wu every few days, so, maybe the reaction of the server can be seen on that host then.
____________
_\|/_
Urs

Richard Haselgrove
Volunteer tester
Send message
Joined: 3 Jan 07
Posts: 972
Credit: 2,070,629
RAC: 18
Message 35219 - Posted: 5 Nov 2008, 2:33:16 UTC

Agreed - all single-redundancy results reported before Tuesday's maintenance outage were sent for re-validation, but all results after the outage have been accepted for credit without validation. Methinks they flipped a switch somewhere....

Father Ambrose
Volunteer tester
Send message
Joined: 1 May 07
Posts: 543
Credit: 4,746,273
RAC: 4,091
Message 35225 - Posted: 5 Nov 2008, 9:55:57 UTC

Having upload problems, All WU's returned after outage valid, WU's from Before outage still pending.

Michael.

Winterknight
Volunteer tester
Send message
Joined: 15 Jun 05
Posts: 693
Credit: 246,694
RAC: 0
Message 35227 - Posted: 5 Nov 2008, 13:59:56 UTC

By definition I would suspect all hosts on here to be 'honest'. So the question is do we need:

faulty hosts,
old clients,
third party clients,
third party clients with inflated benchmarks,

to test out this one replication/quorum.

1 · 2 · 3 · Next

Message boards : SETI@home Enhanced : Quorum, Replication.


Return to SETI@home/AstroPulse Beta main page


Copyright © 2013 University of California

AstroPulse is funded in part by the NSF through grant AST-0307956