AstroPulse v4.33 Errors and reporting

log in

Advanced search

Message boards : AstroPulse : AstroPulse v4.33 Errors and reporting

Previous · 1 . . . 3 · 4 · 5 · 6
Author Message
Profile JLDun
Volunteer tester
Avatar
Send message
Joined: 23 May 07
Posts: 66
Credit: 17,824
RAC: 0
Message 34115 - Posted: 5 Jul 2008, 10:25:25 UTC - in response to Message 34082.

Something I 'should' have said originally...
Reported RDCF is... 1.349718

(RDCF=Result Duration Correction Factor)
____________

Profile Gary Charpentier
Volunteer tester
Avatar
Send message
Joined: 9 Apr 07
Posts: 956
Credit: 534,898
RAC: 2,619
Message 34127 - Posted: 6 Jul 2008, 5:26:40 UTC - in response to Message 33952.

Updating as the W/U is now overdue
Shows 66.770% cpu time 714:36:00 to finish 306:07:30.
714:36/66.77% = 1070.24 less 714.6 = 355.64 left


From now about 21.9 days if the time to complete is correct. But time to finish plus time spent running doesn't add up to the percent done divided by the time spent running. Something is going on there. Perhaps it is why the work unit got sent to this host in the first place.

The DCF in the client_state.xml file is the same for the Seti Beta project as on the web site. You didn't mention to look for the project in the file as it wasn't the first project.

Gary with what you gave
316 hrs/30% * 100 = 1052.8 hrs or 43 days

Using Explorer and find in the BOINC folder "client_state.xml" file you can search for <duration_correction_factor> to find the "active" number on your computer.
Please use notepad to prevent stray information from affecting the client_state.xml file.



Al:
That would be the correct host. Presently showing cpu time 315:58:xx, percent 30.015, to complete 525:45:xx with a DCF of 1.120672. Not sure where to find the DCF on the computer side but that's what's on the website.

Gary

Gary

Thank you for hanging in there, I presume that HostID 22531 is the one your are talking about.

Do you have an idea of the DCF and runtime numbers, CPU Time, Percent complete and estimated run time from BOINC Manager?

Al

Got a W/U that looks like it won't finish until about 6 days past it's deadline.
http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1147085
Looks like Astropulse should have never sent this machine a w/u as it's been running 24/7 on it. [uh maybe 23.75/7 -- ME's got to be rebooted daily] Before it downloaded the W/U it had an Einstein w/u in the queue that hadn't started yet that said it would take about 7 days to complete. Obviously the time to complete needs some revision on the server side before Astropulse goes live.

I'm going to let the w/u finish as I think I'll still be done before another result comes back. And also I may be providing a very slow machine data point for the w/u scheduling side of the world. And maybe a Windows ME data point too.


Oh, the Mac side of the house the 4.33 testing goes on. Had one w/u fail immediately after a reboot
http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1145206
Got another crunching.







____________

Idefix
Volunteer tester
Send message
Joined: 18 Jan 06
Posts: 36
Credit: 8,641
RAC: 0
Message 34130 - Posted: 6 Jul 2008, 14:10:05 UTC - in response to Message 33814.

Hi,

after more than 1 year not crunching for SETI beta I wanted to give Astropulse a try and downloaded my first AP workunit.

Finished after 268 hours. Initial estimates were around 185 hours (DCF climbed from 1 to 1.437656, Athlon XP 2000+, 256 MB RAM, Win98 SE).

But as you can see, my wingmen already got their credits (but I guess the credits were granted manually).

The already granted credits of my successful wingman were removed from the "Workunit details" page (but not from the "Computer summary" page) shortly after I started crunching. After this result passed a proper validation the result got 1,748.80 credits again. The computer summary now shows 3,498 total credits although this computer crunched only this task. That's a nice present for my wingman ... ;-)

Regards,
Carsten

____________

Odysseus
Volunteer tester
Avatar
Send message
Joined: 11 Feb 06
Posts: 529
Credit: 163,915
RAC: 97
Message 34131 - Posted: 6 Jul 2008, 20:43:23 UTC

I just got an odd set of messages reporting an AP task:

Sun 6 Jul 14:29:50 2008|SETI@home Beta Test|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
Sun 6 Jul 14:29:59 2008||Project communication failed: attempting access to reference site
Sun 6 Jul 14:30:00 2008|SETI@home Beta Test|Scheduler request failed: Server returned nothing (no headers, no data)
Sun 6 Jul 14:30:01 2008||Access to reference site succeeded - project servers may be temporarily down.
Sun 6 Jul 14:31:00 2008|SETI@home Beta Test|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
Sun 6 Jul 14:31:06 2008|SETI@home Beta Test|Scheduler request succeeded: got 0 new tasks
Sun 6 Jul 14:31:06 2008|SETI@home Beta Test|Message from server: Completed result ap_23ap08aa_B0_P1_00305_20080619_14751.wu_2 refused: result already reported as success

The WU was cleared from my BOINC Manager (v5.10.45) on the second attempt; the result indeed seems to have been received OK, awaiting quorum at the moment.
____________

Josef W. Segur
Volunteer tester
Send message
Joined: 14 Oct 05
Posts: 1016
Credit: 1,494,521
RAC: 328
Message 34133 - Posted: 7 Jul 2008, 3:39:25 UTC - in response to Message 34131.

I just got an odd set of messages reporting an AP task:

...
Sun 6 Jul 14:30:00 2008|SETI@home Beta Test|Scheduler request failed: Server returned nothing (no headers, no data)
...
Sun 6 Jul 14:31:06 2008|SETI@home Beta Test|Scheduler request succeeded: got 0 new tasks
Sun 6 Jul 14:31:06 2008|SETI@home Beta Test|Message from server: Completed result ap_23ap08aa_B0_P1_00305_20080619_14751.wu_2 refused: result already reported as success

The WU was cleared from my BOINC Manager (v5.10.45) on the second attempt; the result indeed seems to have been received OK, awaiting quorum at the moment.

It happens now and then that the first Scheduler reply doesn't get back to the host. I wonder if it might be better to replace the "refused" with "wasn't needed".
Joe

Profile Keith T.
Volunteer tester
Avatar
Send message
Joined: 9 Feb 07
Posts: 129
Credit: 25,809
RAC: 0
Message 34135 - Posted: 7 Jul 2008, 7:11:03 UTC - in response to Message 34130.
Last modified: 7 Jul 2008, 7:17:27 UTC

Hi,
after more than 1 year not crunching for SETI beta I wanted to give Astropulse a try and downloaded my first AP workunit.

Finished after 268 hours. Initial estimates were around 185 hours (DCF climbed from 1 to 1.437656, Athlon XP 2000+, 256 MB RAM, Win98 SE).

But as you can see, my wingmen already got their credits (but I guess the credits were granted manually).

The already granted credits of my successful wingman were removed from the "Workunit details" page (but not from the "Computer summary" page) shortly after I started crunching. After this result passed a proper validation the result got 1,748.80 credits again. The computer summary now shows 3,498 total credits although this computer crunched only this task. That's a nice present for my wingman ... ;-)

Regards,
Carsten


My Athlon XP 2200+ also completed a task yesterday http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=3933742

Final DCF 1.406594
Final CPU time 238:04:21 (857061.4 seconds)
Claimed credit 2197.95529762032
Granted credit 1265.22276477842

Previous task took 1108548 seconds, using 4.30
Speed up between versions ~ 77.3%

[edit]Task was inside deadline by ~ 36 hours. It did go into "high priority" a few times, usually when the computer was off for more than 12 hours. Resource share was about 33%.[/edit]

Profile Keith T.
Volunteer tester
Avatar
Send message
Joined: 9 Feb 07
Posts: 129
Credit: 25,809
RAC: 0
Message 34136 - Posted: 7 Jul 2008, 7:27:36 UTC - in response to Message 34127.

Updating as the W/U is now overdue
Shows 66.770% cpu time 714:36:00 to finish 306:07:30.
714:36/66.77% = 1070.24 less 714.6 = 355.64 left


http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1147085

Good Luck. The new wingman is running an Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [x86 Family 6 Model 15 Stepping 11].

I hope you get credit even if he reports first.

Profile Keith T.
Volunteer tester
Avatar
Send message
Joined: 9 Feb 07
Posts: 129
Credit: 25,809
RAC: 0
Message 34137 - Posted: 7 Jul 2008, 7:34:53 UTC - in response to Message 34099.

Jim et al

Various programs that allow multiple users to access the same "file" follow different rules about what happens when the file is open and someone touches it. Any Seti application follows the Common Sense Rules. Virus scanners tend to ignore those rules, IF they trap a virus they want to insure they have it LOCKED! Then it can not do any damage. That is a reason why people tell you to exclude BOINC and any Project Applications and/or directories.

I hope that makes more sense...

Al

I run a relatively old version of Symantec Antivirus. I'll see if I can exclude files.

I'm not sure I am comfortable with this explanation. Windows apps should be able to retry or somehow get around trying to read a file that is being virus scanned. Admitting that I know nothing about AP, why should it be any different than other Windows BOINC apps that don't have this problem?

Thanks,
Jim

I run a virus scan. I'm not sure how to figure out if it looks at the BOINC folder. Even so, why would I get this error? My other BOINC apps don't seem to mind.

Thanks,
Jim


Some virus scanners may lock files while they are scanning them.
The AP application probably needs exclusive access to its files.

Which scanner do you use?

In most virus scanners it is possible to exclude certain folders or files from scanning.




Thanks Al, I knew I had read about some BOINC projects having issues with some Virus Scanners, but I did not know the full details.

I use AVG Free ver 7.5 but I do not run regular scans of the whole HDD.

Juha
Volunteer tester
Send message
Joined: 18 Jun 08
Posts: 32
Credit: 20,837
RAC: 2
Message 34138 - Posted: 7 Jul 2008, 16:51:35 UTC

Just completed my first Astropulse workunit.

309 hours with 1.2 GHz AMD Duron.

-Juha

Profile Richard U
Volunteer tester
Avatar
Send message
Joined: 6 May 07
Posts: 69
Credit: 496,888
RAC: 90
Message 34141 - Posted: 9 Jul 2008, 2:16:53 UTC - in response to Message 34137.
Last modified: 9 Jul 2008, 2:18:24 UTC

Jim et al

Various programs that allow multiple users to access the same "file" follow different rules about what happens when the file is open and someone touches it. Any Seti application follows the Common Sense Rules. Virus scanners tend to ignore those rules, IF they trap a virus they want to insure they have it LOCKED! Then it can not do any damage. That is a reason why people tell you to exclude BOINC and any Project Applications and/or directories.

I hope that makes more sense...

Al

cut.


OK I have done some testing re the locked files, first with all Boinc files and directories excluded from real time scanning, and then with real time scanning turned off. (Vista/OneCare) The results are consistent in the fact that none of these procedures had any effect on the 'Can't delete previous state file; The system cannot find the file specified. (0x2)'

I have 4 applications running at all times and all of these write to the same file, is this what is locking the file and disallowing it's deletion? Or is one of the running apps trying to delete the file while another app has it in a rewrite state?

The other concern, is this behaviour causing or adding to any of the errors, both in Seti and AP?
____________
Richard U

Josef W. Segur
Volunteer tester
Send message
Joined: 14 Oct 05
Posts: 1016
Credit: 1,494,521
RAC: 328
Message 34143 - Posted: 9 Jul 2008, 3:53:44 UTC - in response to Message 34141.

OK I have done some testing re the locked files, first with all Boinc files and directories excluded from real time scanning, and then with real time scanning turned off. (Vista/OneCare) The results are consistent in the fact that none of these procedures had any effect on the 'Can't delete previous state file; The system cannot find the file specified. (0x2)'

I have 4 applications running at all times and all of these write to the same file, is this what is locking the file and disallowing it's deletion? Or is one of the running apps trying to delete the file while another app has it in a rewrite state?

The other concern, is this behaviour causing or adding to any of the errors, both in Seti and AP?

Those messages refer to the BOINC client_state.xml and client_state_prev.xml files, not files written by applications. Only the single boinc.exe instance touches those files.

What's involved is a safe writing procedure, BOINC should have 3 state files just after writing the new state info, then it deletes the oldest and renames the remaining two. If there wasn't any client_state_prev.xml to delete, it means you were at least temporarily operating with less safety than intended.

BOINC does updates to that info very often, for instance each time an application checkpoints so having four applications running does increase the rate of client_state updates.
Joe

Profile [KWSN]John Galt 007
Volunteer tester
Avatar
Send message
Joined: 13 Jul 07
Posts: 6
Credit: 520,875
RAC: 0
Message 34304 - Posted: 21 Jul 2008, 18:05:12 UTC

I know that 4.33 is old now, but is the Mac app going to take 370+ hours for a G4 like mine? Or is it the debugging that is going on that causes the long run times?

BTW, I tried 4.34 this morning and got 2 compute errors right away. Here is one of them. I notified Dotsch, and hopefully he can pinpoint the problem with 4.34 on PPC.
____________
Click to help Seti City.




Jim Wilkins
Volunteer tester
Send message
Joined: 1 Nov 06
Posts: 55
Credit: 344,829
RAC: 0
Message 34305 - Posted: 21 Jul 2008, 18:48:04 UTC - in response to Message 34304.

How did you get a Mac app? I don't see it in the apps list.

Jim

I know that 4.33 is old now, but is the Mac app going to take 370+ hours for a G4 like mine? Or is it the debugging that is going on that causes the long run times?

BTW, I tried 4.34 this morning and got 2 compute errors right away. Here is one of them. I notified Dotsch, and hopefully he can pinpoint the problem with 4.34 on PPC.

Profile [KWSN]John Galt 007
Volunteer tester
Avatar
Send message
Joined: 13 Jul 07
Posts: 6
Credit: 520,875
RAC: 0
Message 34306 - Posted: 21 Jul 2008, 19:19:52 UTC - in response to Message 34305.

How did you get a Mac app? I don't see it in the apps list.

Jim

I know that 4.33 is old now, but is the Mac app going to take 370+ hours for a G4 like mine? Or is it the debugging that is going on that causes the long run times?

BTW, I tried 4.34 this morning and got 2 compute errors right away. Here is one of them. I notified Dotsch, and hopefully he can pinpoint the problem with 4.34 on PPC.



I have been beta testing for Dotsch; it's not ready for prime time yet. PM him for more info.
____________
Click to help Seti City.




Jim Wilkins
Volunteer tester
Send message
Joined: 1 Nov 06
Posts: 55
Credit: 344,829
RAC: 0
Message 34307 - Posted: 21 Jul 2008, 19:27:37 UTC - in response to Message 34306.

Thanks,

Jim

How did you get a Mac app? I don't see it in the apps list.

Jim

I know that 4.33 is old now, but is the Mac app going to take 370+ hours for a G4 like mine? Or is it the debugging that is going on that causes the long run times?

BTW, I tried 4.34 this morning and got 2 compute errors right away. Here is one of them. I notified Dotsch, and hopefully he can pinpoint the problem with 4.34 on PPC.



I have been beta testing for Dotsch; it's not ready for prime time yet. PM him for more info.

Profile Gary Charpentier
Volunteer tester
Avatar
Send message
Joined: 9 Apr 07
Posts: 956
Credit: 534,898
RAC: 2,619
Message 34334 - Posted: 23 Jul 2008, 4:44:53 UTC - in response to Message 34136.

Updating as the W/U is now overdue
Shows 66.770% cpu time 714:36:00 to finish 306:07:30.
714:36/66.77% = 1070.24 less 714.6 = 355.64 left


http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1147085

Good Luck. The new wingman is running an Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [x86 Family 6 Model 15 Stepping 11].

I hope you get credit even if he reports first.


He reported a long time ago. Understand manual credits are being given. Anyway final update, now at 99.351% done showing about 7 hours to go and crunching for 1104 hours. It will be done by the time I wake up.

We will see what happens to the DCF after it reports!

Gary


____________

Profile Gary Charpentier
Volunteer tester
Avatar
Send message
Joined: 9 Apr 07
Posts: 956
Credit: 534,898
RAC: 2,619
Message 34342 - Posted: 23 Jul 2008, 18:06:11 UTC - in response to Message 34334.

Updating as the W/U is now overdue
Shows 66.770% cpu time 714:36:00 to finish 306:07:30.
714:36/66.77% = 1070.24 less 714.6 = 355.64 left


http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1147085

Good Luck. The new wingman is running an Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [x86 Family 6 Model 15 Stepping 11].

I hope you get credit even if he reports first.


He reported a long time ago. Understand manual credits are being given. Anyway final update, now at 99.351% done showing about 7 hours to go and crunching for 1104 hours. It will be done by the time I wake up.

We will see what happens to the DCF after it reports!

Gary



DCF is up to 1.747061 from 1.120672.

Got credit, but I see the claimed credit for this work unit varies from 669.38 up to 2,167.15 a factor of over 3! Something better be done about getting the counts correct before Astropulse goes live on the main site or the complaints will be flying.

I also see we have regular seti work units going out again.

Gary
____________

Josef W. Segur
Volunteer tester
Send message
Joined: 14 Oct 05
Posts: 1016
Credit: 1,494,521
RAC: 328
Message 34353 - Posted: 24 Jul 2008, 4:55:39 UTC - in response to Message 34342.

...
Got credit, but I see the claimed credit for this work unit varies from 669.38 up to 2,167.15 a factor of over 3! Something better be done about getting the counts correct before Astropulse goes live on the main site or the complaints will be flying.
...
Gary

AstroPulse 4.34 and 4.35 report exactly 1.61485e+15 fpops_cumulative for completed WUs, which converts to a claim of 1869.04 credits. The fpops are proportionally reduced if the app exits early, as for 30 pulses found.
Joe

Profile Gary Charpentier
Volunteer tester
Avatar
Send message
Joined: 9 Apr 07
Posts: 956
Credit: 534,898
RAC: 2,619
Message 34354 - Posted: 24 Jul 2008, 6:10:55 UTC - in response to Message 34353.

...
Got credit, but I see the claimed credit for this work unit varies from 669.38 up to 2,167.15 a factor of over 3! Something better be done about getting the counts correct before Astropulse goes live on the main site or the complaints will be flying.
...
Gary

AstroPulse 4.34 and 4.35 report exactly 1.61485e+15 fpops_cumulative for completed WUs, which converts to a claim of 1869.04 credits. The fpops are proportionally reduced if the app exits early, as for 30 pulses found.
Joe


Good to know. I was very much wondering as that machine of mine just completed a MB work unit http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1212923 and all three machines reporting claimed the same credit.

Gary


____________

Previous · 1 . . . 3 · 4 · 5 · 6

Message boards : AstroPulse : AstroPulse v4.33 Errors and reporting


Return to SETI@home/AstroPulse Beta main page


Copyright © 2013 University of California

AstroPulse is funded in part by the NSF through grant AST-0307956