Astropulse 7,00 released for Linux 32&64, Win 32&64, Win32+AMD/NVIDIA/Intel GPU

Message boards : News : Astropulse 7,00 released for Linux 32&64, Win 32&64, Win32+AMD/NVIDIA/Intel GPU
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 35 · Next

AuthorMessage
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 51457 - Posted: 4 Jul 2014, 20:18:07 UTC - in response to Message 51453.  
Last modified: 4 Jul 2014, 20:22:52 UTC

Is there any way (predefined symbols) to tell at (OpenCL) compile time what the maximum workgroup size or shared data size is?

I would hope that OpenCL will eventually become a more general language where the compilers will do what is necessary to generate functional code regardless of what the inner workings of the GPU are.


Well, usually they do, in NULL provided as size, for example.
But that particular kernel requires precise number of threads and precise amount of shared memory so it failed on some devices.

Unfortunately, fix I proposed earlier will bot work - kernel referenced inside host code.
So, for now such GPUs out of testing.
Will provide 7.01 in next few days I hope.

GPU FLOPS on app page show that only subset of devices suffer from this issue.
ID: 51457 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 51458 - Posted: 4 Jul 2014, 20:37:24 UTC

The MacOS CPU and ATI GPU versions are out there now as well.
ID: 51458 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 51460 - Posted: 4 Jul 2014, 21:30:47 UTC - in response to Message 51458.  

The MacOS CPU and ATI GPU versions are out there now as well.

Great!
Meantime 2:0 Brazilia leads :D
ID: 51460 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 51463 - Posted: 4 Jul 2014, 22:09:56 UTC
Last modified: 4 Jul 2014, 22:24:51 UTC

OMG, again!

http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17271762

That famous crash after computation finish still here :(

EDIT:
normal finish:

TWIN_FFA OCL_ZERO_COPY USE_OPENCL USE_OPENCL_NV OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2488
GPU device synched
01:53:48 (3156): called boinc_finish


crash:

TWIN_FFA OCL_ZERO_COPY USE_OPENCL USE_OPENCL_NV OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2488


Unhandled Exception Detected...


So, crash inside finish routine?...
ID: 51463 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 51471 - Posted: 5 Jul 2014, 0:25:04 UTC - in response to Message 51463.  

  403de6:       8b 0d c0 b5 4e 00       mov    0x4eb5c0,%ecx
  403dec:       8b 15 c8 eb 4c 00       mov    0x4cebc8,%edx
  403df2:       8b 04 91                mov    (%ecx,%edx,4),%eax
  403df5:       50                      push   %eax
  403df6:       ff 15 b8 90 4a 00       call   *0x4a90b8


That seems odd to begin with. It appear to be reading into an array of ints starting at address 0x4eb5c0 (5158336) to get element 0x4cebc8 (5041096) which is at address 0x18264E0 (25322720). These numbers appear to be set at compile time, not at load time,

0x4eb5c0 appears a lot of times in the code. It looks like its a statically allocated structure. The value of the first 4 byte element, possibly a boolean is tested a lot. That's about all I can tell without a symbol table.
ID: 51471 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 2 Jul 13
Posts: 505
Credit: 5,019,318
RAC: 0
United States
Message 51474 - Posted: 5 Jul 2014, 2:08:46 UTC

ID: 51474 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 18 Jan 06
Posts: 1038
Credit: 18,734,730
RAC: 0
Germany
Message 51475 - Posted: 5 Jul 2014, 2:23:34 UTC - in response to Message 51458.  
Last modified: 5 Jul 2014, 2:52:26 UTC

The MacOS CPU and ATI GPU versions are out there now as well.


And the other two OpenCL apps for MacOSX will follow.

Important : One of the licensing files currently sent to Macs (and eventually to everyone else) is an old version and needs to be updated : astropulse_7.00_COPYRIGHT.txt
_\|/_
U r s
ID: 51475 · Report as offensive
Josef W. Segur
Volunteer tester

Send message
Joined: 14 Oct 05
Posts: 1137
Credit: 1,848,733
RAC: 0
United States
Message 51478 - Posted: 5 Jul 2014, 3:41:48 UTC

I presume the 1400.00 credit grants are from the --max_granted_credit argument. I'm very curious just how far David's drunken walk statistics is really staggering on the high side. For my host, 94.07 is the lowest I've seen.
                                                                   Joe
ID: 51478 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 51479 - Posted: 5 Jul 2014, 4:15:50 UTC - in response to Message 51478.  

Yep, it's the "traditional" settling in period.
ID: 51479 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 51480 - Posted: 5 Jul 2014, 9:19:57 UTC

With tasks grouping by outcome and app type we have good tool for testing outcome analyze. But adding third line - grouping by plan class/app version/device would be even more powerful. To discriminate between CPU/ATi/NV/iGPU results and between different SIMD levels in CPU.

Could we have such improvement in site design in the future?
ID: 51480 · Report as offensive
Profile TomTom
Volunteer tester
Avatar

Send message
Joined: 4 Dec 13
Posts: 98
Credit: 10,262,771
RAC: 0
France
Message 51482 - Posted: 5 Jul 2014, 12:48:17 UTC

All my AstroPulse v7 v7.00 (opencl_nvidia_100) tasks failed for Computer 69586 with the same error as task 17284516
WU's on first
ID: 51482 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 18 Jan 06
Posts: 1038
Credit: 18,734,730
RAC: 0
Germany
Message 51483 - Posted: 5 Jul 2014, 15:06:31 UTC - in response to Message 51482.  

All my AstroPulse v7 v7.00 (opencl_nvidia_100) tasks failed for Computer 69586 with the same error as task 17284516

Just for info, from OpenCL :
#define CL_INVALID_BINARY                           -42

INFO: can't open binary kernel file: D:\Boinc/projects/setiweb.ssl.berkeley.edu_beta\AstroPulse_Kernels_r2488.cl_QuadroFX880M.bin_V6_TWIN_FFA_33495, continue with recompile...
Error : Building Program (source, clBuildProgram):main kernels: not OK code -42
ptxas : error : Entry function 'GPU_fetch_array_kernel_twin_1D_cl' uses too much shared data (0x4034 bytes, 0x4000 max)

_\|/_
U r s
ID: 51483 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 2 Jul 13
Posts: 505
Credit: 5,019,318
RAC: 0
United States
Message 51489 - Posted: 5 Jul 2014, 21:41:45 UTC - in response to Message 51483.  

All my AstroPulse v7 v7.00 (opencl_nvidia_100) tasks failed for Computer 69586 with the same error as task 17284516

Just for info, from OpenCL :
#define CL_INVALID_BINARY                           -42

INFO: can't open binary kernel file: D:\Boinc/projects/setiweb.ssl.berkeley.edu_beta\AstroPulse_Kernels_r2488.cl_QuadroFX880M.bin_V6_TWIN_FFA_33495, continue with recompile...
Error : Building Program (source, clBuildProgram):main kernels: not OK code -42
ptxas : error : Entry function 'GPU_fetch_array_kernel_twin_1D_cl' uses too much shared data (0x4034 bytes, 0x4000 max)

I'm seeing a few of these;
GeForce GT 240 http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17289630
GeForce GTX 275 http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17283660
GeForce GTS 360M http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17289814
Quadro FX 880M http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17283665
GeForce 9300 GE http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17272049

Here's an ATI error, http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=17271825
ID: 51489 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 51490 - Posted: 5 Jul 2014, 23:30:23 UTC

ID: 51490 · Report as offensive
Thomas
Volunteer tester

Send message
Joined: 21 Sep 13
Posts: 223
Credit: 407,183
RAC: 0
France
Message 51491 - Posted: 6 Jul 2014, 7:34:31 UTC

Is it normal that these work units of 2013 are referenced in 7.00 version ?



So far, no problem for me with my GPU NVIDIA GT 640 :)
13 work units returned without problems
ID: 51491 · Report as offensive
Profile Mike
Volunteer tester
Avatar

Send message
Joined: 16 Jun 05
Posts: 2530
Credit: 1,074,556
RAC: 0
Germany
Message 51492 - Posted: 6 Jul 2014, 8:58:22 UTC

Yes it is normal.

Thats Seti V7 work.
This is Astropulse forum.
With each crime and every kindness we birth our future.
ID: 51492 · Report as offensive
Thomas
Volunteer tester

Send message
Joined: 21 Sep 13
Posts: 223
Credit: 407,183
RAC: 0
France
Message 51495 - Posted: 6 Jul 2014, 9:39:00 UTC - in response to Message 51492.  

Yes it is normal.

Thats Seti V7 work.
This is Astropulse forum.

OMG ! You're absolutely right Mike !
I'm aware that this is a thread dedicated to AP version 7
It's getting too hot in the South of France and I lose my head
Sorry to all for my mistake - I'm a stupid big noob :(
However, for my 13 work units successfully treated, I spoke to AP version 7.
ID: 51495 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 29 May 06
Posts: 1037
Credit: 8,440,339
RAC: 0
United Kingdom
Message 51497 - Posted: 6 Jul 2014, 13:20:15 UTC

Can't say i'm that impressed with the scheduler's decisions, while my Linux running C2D T8100 has been getting a mix of the four app_versions, (none of which have reached their 11 validations):

Application details for host 68093

My i7-2600K/HD7770 host on the otherhand because of the four day difference in release days has managed to complete it's 11 validations for the windows_intelx86 app, it's only managed to get three sse Wu's, and only one sse2 Wu,
now the windows_intelx86 app has completed it's 11 validations, the scheduler seems more inclined to send more and more work for that app version, rather than send work from app_versions that haven't completed their 11 validations,
I expect it'll send them, but not very frequently, and in the meantime lots of hosts will be running a sub-optimal app for a substantial time,

Application details for host 45274

AstroPulse v7 7.00 windows_intelx86  
Number of tasks completed 11 
Max tasks per day 44 
Number of tasks today 12 
Consecutive valid tasks 11 
Average processing rate 16.60 GFLOPS 
Average turnaround time 2.25 days 

AstroPulse v7 7.00 windows_x86_64 (sse2) 
Number of tasks completed 0 
Max tasks per day 33 
Number of tasks today 1 
Consecutive valid tasks 0 
Average turnaround time 0.00 days 

AstroPulse v7 7.00 windows_intelx86 (sse) 
Number of tasks completed 1 
Max tasks per day 34 
Number of tasks today 0 
Consecutive valid tasks 1 
Average processing rate 103.67 GFLOPS 
Average turnaround time 0.96 days 


Claggy
ID: 51497 · Report as offensive
BetelgeuseFive
Volunteer tester

Send message
Joined: 3 Jun 12
Posts: 64
Credit: 2,532,468
RAC: 0
Netherlands
Message 51498 - Posted: 6 Jul 2014, 16:39:18 UTC

No problems here on my GTX-750.
Still waiting for the first task with blanking (I sure hated these 'wasting' resources) so I can see how much improvement there is.
Are there any other changes in the new v7 version or is it just the improved handling of tasks with blanking ?

Tom
ID: 51498 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 51501 - Posted: 6 Jul 2014, 18:42:24 UTC - in response to Message 51482.  

All my AstroPulse v7 v7.00 (opencl_nvidia_100) tasks failed for Computer 69586 with the same error as task 17284516


Look earlier posts in this thread.
Bug already acknowledged and fix committed into sources.
ID: 51501 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 . . . 35 · Next

Message boards : News : Astropulse 7,00 released for Linux 32&64, Win 32&64, Win32+AMD/NVIDIA/Intel GPU


 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.