SETI applications for NVIDIA GPU improvement - how you can help

Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 14 · Next

AuthorMessage
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1810103 - Posted: 18 Aug 2016, 9:47:25 UTC

New anticipated default for kernel target time is 60ms.

Try to run (both ATi and NV) with -tt 60 for some time and report if GUI lags too high for your particular host.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1810103 · Report as offensive     Reply Quote
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 33458
Credit: 79,922,639
RAC: 80
Germany
Message 1810130 - Posted: 18 Aug 2016, 13:22:07 UTC

See my bench and coment at Lunatics.
With each crime and every kindness we birth our future.
ID: 1810130 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1810135 - Posted: 18 Aug 2016, 13:47:55 UTC - in response to Message 1810130.  

See my bench and coment at Lunatics.

Done with request for further testing :)
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1810135 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814655 - Posted: 2 Sep 2016, 16:15:41 UTC

Here are new rev builds, please test
https://cloud.mail.ru/public/M7kP/4f6CqSGhB
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814655 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814659 - Posted: 2 Sep 2016, 16:22:01 UTC - in response to Message 1814655.  

Is there anything in particular we are looking for?
ID: 1814659 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814661 - Posted: 2 Sep 2016, 16:27:54 UTC - in response to Message 1814659.  

Is there anything in particular we are looking for?

just general performance and usability. It's new RC builds so if all OK they will go to main unchanged.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814661 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814663 - Posted: 2 Sep 2016, 16:36:44 UTC - in response to Message 1814661.  

28 Computation errors right away.

Not sure if I caused it or if it was from the new app.
ID: 1814663 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814666 - Posted: 2 Sep 2016, 16:40:01 UTC - in response to Message 1814663.  

ID: 1814666 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14511
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1814670 - Posted: 2 Sep 2016, 17:00:29 UTC
Last modified: 2 Sep 2016, 17:39:43 UTC

Thanks for the warning - I suspended all tasks, and let them loose one at a time.

Resuming a task that had been running under r3500:
http://setiathome.berkeley.edu/result.php?resultid=5133129501

Starting a new task from scratch:
http://setiathome.berkeley.edu/result.php?resultid=5133135420

Both went through the wisdom and CL compiler stages OK, but crashed (like Zalster's) with a memory access exception as soon as the real work started. That sounds like a program problem to me.

NV version, running on device 1, GTX 750 Ti

Edit - observation while cleaning BIN and Wisdom files prior to reverting to r3500. Machine has an intel_gpu in addition to the NV cards - but I haven't updated the intel_gpu app - it's still at r3430, and inactive (no tasks).

But the NV apps create

r3500_IntelRCoreTMi54690CPU350GHz_x86.wisdom
r3522_IntelRCoreTMi54690CPU350GHz_x86.wisdom

Harmless, but curious. Any suggestion why?

BOINC's OpenCL detections are

02/09/2016 18:07:59 | | OpenCL: NVIDIA GPU 0: GeForce GTX 970 (driver version 350.12, device version OpenCL 1.2 CUDA, 4096MB, 3839MB available, 4087 GFLOPS peak)
02/09/2016 18:07:59 | | OpenCL: NVIDIA GPU 1: GeForce GTX 750 Ti (driver version 350.12, device version OpenCL 1.2 CUDA, 2048MB, 1967MB available, 1639 GFLOPS peak)
02/09/2016 18:07:59 | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 4600 (driver version 10.18.10.3621, device version OpenCL 1.2, 1298MB, 1298MB available, 192 GFLOPS peak)
02/09/2016 18:07:59 | | OpenCL CPU: Intel(R) Core(TM) i5-4690 CPU @ 3.50GHz (OpenCL driver vendor: Intel(R) Corporation, driver version 3.0.1.10878, device version OpenCL 1.2 (Build 76413))

Edit 2 - r3500 has been generating that CPU wisdom file all along, timestamped at the start of every task. Just hadn't bothered to look for it before. It's 3 KB, 27 lines.
ID: 1814670 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814711 - Posted: 2 Sep 2016, 19:43:57 UTC
Last modified: 2 Sep 2016, 19:44:41 UTC

thanks
at least one bug fixed new rev in build
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814711 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814713 - Posted: 2 Sep 2016, 19:53:40 UTC - in response to Message 1814670.  


Edit 2 - r3500 has been generating that CPU wisdom file all along, timestamped at the start of every task. Just hadn't bothered to look for it before. It's 3 KB, 27 lines.

Sure, app uses FFTW lib too.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814713 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814723 - Posted: 2 Sep 2016, 21:03:51 UTC

https://cloud.mail.ru/public/2F9e/1xm2varnV
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814723 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814731 - Posted: 2 Sep 2016, 22:08:53 UTC - in response to Message 1814723.  

ID: 1814731 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814798 - Posted: 3 Sep 2016, 10:19:28 UTC - in response to Message 1814731.  

Ok, so far so good.

Stderr is showing the correct GPUs being used

http://setiathome.berkeley.edu/result.php?resultid=5133497505

http://setiathome.berkeley.edu/result.php?resultid=5133485089

http://setiathome.berkeley.edu/result.php?resultid=5133491214

http://setiathome.berkeley.edu/result.php?resultid=5133491133

Fftlength=8,pass=3:Tune: sum=5539.19(ms); min=3.181(ms); max=757(ms); mean=240.8(ms); s_mean=208.8; sleep=210(ms); delta=2602; N=23; usual
Fftlength=8,pass=4:Tune: sum=3912.48(ms); min=3.409(ms); max=330.2(ms); mean=170.1(ms); s_mean=138.9; sleep=135(ms); delta=2340; N=23; usual
Fftlength=8,pass=5:Tune: sum=2917.61(ms); min=3.404(ms); max=250.4(ms); mean=126.9(ms); s_mean=104.9; sleep=105(ms); delta=2077; N=23; usual

Do you see any lags?
If not consider to add -tt 1500 to effectively disable kernel size tuning.

currently you provide period_iterations_num 1, but kernel size exceed default 15ms so adaptation to 15ms-long kernel starts and effective iterations num starts to increase too.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814798 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814806 - Posted: 3 Sep 2016, 11:14:22 UTC

And another update: https://cloud.mail.ru/public/LFuj/uqJQnKiMc
Hope this one will go on beta servers later.
Version to 8.18 now be warned.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814806 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814816 - Posted: 3 Sep 2016, 13:54:39 UTC - in response to Message 1814806.  

Version to 8.18 now be warned.


Wild guess this is actually more for beta identification as we are still 8.12 here at main?

I'll test it here in a bit.

Some early morning chores here (8 hours behind you Raistmer) still before I can sit down and play with this
ID: 1814816 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814822 - Posted: 3 Sep 2016, 15:12:22 UTC - in response to Message 1814798.  

Ok, so far so good.

Stderr is showing the correct GPUs being used

http://setiathome.berkeley.edu/result.php?resultid=5133497505

http://setiathome.berkeley.edu/result.php?resultid=5133485089

http://setiathome.berkeley.edu/result.php?resultid=5133491214

http://setiathome.berkeley.edu/result.php?resultid=5133491133

Fftlength=8,pass=3:Tune: sum=5539.19(ms); min=3.181(ms); max=757(ms); mean=240.8(ms); s_mean=208.8; sleep=210(ms); delta=2602; N=23; usual
Fftlength=8,pass=4:Tune: sum=3912.48(ms); min=3.409(ms); max=330.2(ms); mean=170.1(ms); s_mean=138.9; sleep=135(ms); delta=2340; N=23; usual
Fftlength=8,pass=5:Tune: sum=2917.61(ms); min=3.404(ms); max=250.4(ms); mean=126.9(ms); s_mean=104.9; sleep=105(ms); delta=2077; N=23; usual

Do you see any lags?
If not consider to add -tt 1500 to effectively disable kernel size tuning.

currently you provide period_iterations_num 1, but kernel size exceed default 15ms so adaptation to 15ms-long kernel starts and effective iterations num starts to increase too.



No lags, I have not tried -tt1500 but I can try with new build r3525. I'll run it with original commandline for a little while, then with the changes you suggested. I'll also decrease the number of instances to 1 work unit per card. These were with 3 per card.
ID: 1814822 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814841 - Posted: 3 Sep 2016, 17:31:08 UTC - in response to Message 1814806.  
Last modified: 3 Sep 2016, 17:41:33 UTC

And another update: https://cloud.mail.ru/public/LFuj/uqJQnKiMc

1)VHAR
http://setiathome.berkeley.edu/result.php?resultid=5134840328
WU true angle range is : 12.356051
Run time 2 min 24 sec
CPU time 12 sec

2)VLAR
http://setiathome.berkeley.edu/result.php?resultid=5134840293
WU true angle range is : 0.009254
Run time 9 min 10 sec
CPU time 9 min 7 sec


http://setiathome.berkeley.edu/result.php?resultid=5134840099
WU true angle range is : 0.011647
Run time 9 min 8 sec
CPU time 9 min 5 sec


3) Normal AR
http://setiathome.berkeley.edu/result.php?resultid=5134800057
WU true angle range is : 0.416899
Run time 4 min 19 sec
CPU time 4 min 16 sec

http://setiathome.berkeley.edu/result.php?resultid=5134807062
WU true angle range is : 0.416928
Run time 4 min 20 sec
CPU time 4 min 17 sec


With original commandlines
1) VHAR
http://setiathome.berkeley.edu/result.php?resultid=5134821117
WU true angle range is : 2.086420
Run time 2 min 23 sec
CPU time 26 sec


http://setiathome.berkeley.edu/result.php?resultid=5134821116
WU true angle range is : 2.043501
Run time 2 min 25 sec
CPU time 19 sec


2) VLARs
http://setiathome.berkeley.edu/result.php?resultid=5134766659
WU true angle range is : 0.022368
Run time 7 min 46 sec
CPU time 7 min 44 sec


http://setiathome.berkeley.edu/result.php?resultid=5134820892
WU true angle range is : 0.015644
Run time 8 min 5 sec
CPU time 8 min 3 sec

http://setiathome.berkeley.edu/result.php?resultid=5134820937
WU true angle range is : 0.015644
Run time 8 min 4 sec
CPU time 8 min 2 sec


3) Normal AR
http://setiathome.berkeley.edu/result.php?resultid=5134766599
WU true angle range is : 0.416902
Run time 4 min 19 sec
CPU time 4 min 17 sec

http://setiathome.berkeley.edu/result.php?resultid=5134766724
WU true angle range is : 0.432514
Run time 4 min 6 sec
CPU time 2 min 38 sec


With -tt 1500 no -period_iterations_num
1) VHAR
http://setiathome.berkeley.edu/result.php?resultid=5134813343
WU true angle range is : 97.183501
Run time 2 min 24 sec
CPU time 7 sec


2) VLARs
http://setiathome.berkeley.edu/result.php?resultid=5134840108
WU true angle range is : 0.011647
Run time 7 min 26 sec
CPU time 7 min 24 sec

http://setiathome.berkeley.edu/result.php?resultid=5134840018
WU true angle range is : 0.011647
Run time 7 min 31 sec
CPU time 7 min 29 sec


3) Normal AR
http://setiathome.berkeley.edu/result.php?resultid=5134821120
WU true angle range is : 0.447831
Run time 3 min 58 sec
CPU time 2 min 54 sec

http://setiathome.berkeley.edu/result.php?resultid=5134807348
WU true angle range is : 0.416928
Run time 4 min 17 sec
CPU time 3 min 2 sec


single work units seem faster with the -tt 1500 than the -period_iterations_num 1

If you want times I can post but I figured you would want to see single runs


Edit..

Looking at 3 at a time

-tt 1500

http://setiathome.berkeley.edu/result.php?resultid=5134840207[url]
WU true angle range is : 0.006972
Run time 21 min 25 sec
CPU time 18 min 36 sec

http://setiathome.berkeley.edu/result.php?resultid=5134840104
WU true angle range is : 0.011647
Run time 21 min 30 sec
CPU time 18 min 48 sec

Original commandline
http://setiathome.berkeley.edu/result.php?resultid=5134487699
WU true angle range is : 0.024645
Run time 22 min 32 sec
CPU time 20 min 29 sec


http://setiathome.berkeley.edu/result.php?resultid=5134487700
WU true angle range is : 0.009254
Run time 20 min 50 sec
CPU time 19 min 27 sec

So not really seeing any advantage to -tt 1500 when running more than 1 work unit per GPU
ID: 1814841 · Report as offensive     Reply Quote
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6324
Credit: 106,370,077
RAC: 121
Russia
Message 1814843 - Posted: 3 Sep 2016, 17:37:12 UTC - in response to Message 1814841.  

To get max you should specify both -period_iterations_num 1 and -tt 1500
First one set initial increase and second ensures that app will not change it.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1814843 · Report as offensive     Reply Quote
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1814845 - Posted: 3 Sep 2016, 17:41:57 UTC - in response to Message 1814843.  
Last modified: 3 Sep 2016, 17:43:49 UTC

Ok, I will try it with both, thanks

edit.

Any particular order in the commandline?

-period_iterations_num then -tt

or

-tt then -period_iterations_num ?
ID: 1814845 · Report as offensive     Reply Quote
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 14 · Next

Message boards : Number crunching : SETI applications for NVIDIA GPU improvement - how you can help


 
©2022 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.