[H]ard|Forum

Go Back   [H]ard|Forum > Bits & Bytes > Distributed Computing

Reply
 
Thread Tools Search this Thread
  #1  
Old 10-10-2009, 03:12 PM
.:Burn:. [H]Lite, 60 Days
 
.:Burn:. is online now
why do nvidia cards fold so much better than ati?

as the title says... why does nvidia fold better than ati?
  #2  
Old 10-10-2009, 04:20 PM
Vaulter98c [H]ard|DCer of the Month - October 2009, 1.5 Years
 
Vaulter98c is offline
NVidia and ATI cards both have what we refer to as shaders, however, the way that NV cards do their computing is differant than how the ATI cards do, hence the large numbers on the ATI cards vs Nvidia cards

The client for the NV cards is just more efficient, its better written, and at the end of the day, despite all the differences in the 2 architecture (killed the sp on that one) it comes down to the NV client is better written than the ATI one

we can only hope that with the rumored release of the next GPU core in the coming months this issue is fixed, but until then, NV will stay on top

Thats the laymens version, im sure someone could get more in depth
__________________
Gamer Rig::Q66 @2.4 - 680iLT Mb - 2x G4saurus GTX295's QuadSLI/Physx/Folding - XFi Xtrm Gamer - 4G DDR2 800 (Wintec) - 500GB +74GB Raptor - all hook'd to the 1080P monitor, T240HD
Gaming Netbook-ASUS N10j ::Atom N270 @ 2.1 ghz | Nv 9300m GS (OC'd) | 2G Ram | 320 GB 7200.3 Hdd | b/g/n wireless + Gigabit Ethernet | HDMI + SPDIF HD outputs and so much more...
---------------------------------------------------------------------------------------------------------------
FOLDING :: 4x 88GT's | 3x GTX260 | 4x 98GX2's | 2x 295's | Q66 | E83 | E2180 | Phn1x3 | PS3
FOLDING :: 19 GPU / 4 CPU clients | Estimated 108,000 PPD for the [H]orde, Coming Soon!!!
Heat - Vaulter98c (26) | Ebay - Octavious98c (28 w/ 34 pos) (5 NOV 09) | NO NEG REVIEWS
  #3  
Old 10-10-2009, 04:31 PM
Atech [H]ardness Supreme, 2.4 Years
 
Atech is offline
Quote:
Originally Posted by Vaulter98c View Post
NVidia and ATI cards both have what we refer to as shaders, however, the way that NV cards do their computing is differant than how the ATI cards do, hence the large numbers on the ATI cards vs Nvidia cards

The client for the NV cards is just more efficient, its better written, and at the end of the day, despite all the differences in the 2 architecture (killed the sp on that one) it comes down to the NV client is better written than the ATI one

we can only hope that with the rumored release of the next GPU core in the coming months this issue is fixed, but until then, NV will stay on top

Thats the laymens version, im sure someone could get more in depth
Much better than your flawed guess :
http://forum.beyond3d.com/showthread.php?t=50539
__________________
i7 920 / BFG 285GTX / 2 x 21" CRT
P6T Deluxe OC Palm / 12 GB DDR3
X-Fi Ti Camp / Gigaworks S750
PhysX P1 PCI-E / Killer NiC M1
2x80GB(RAID 0) / 2x1TB
PC - XP Pro 32bit/Vista Ultimate x64/Windows 7 Ultimate x64
Dell D630 - Windows 7 Buisness x64 / PC-BSD 7.1
  #4  
Old 10-10-2009, 05:37 PM
Zero82z [H]ard|DCer of the Month - July 2009, 5.8 Years
 
Zero82z is offline
Quote:
Originally Posted by Atech View Post
Much better than your flawed guess :
http://forum.beyond3d.com/showthread.php?t=50539
It's not a flawed guess at all. He is right in saying that the difference is due to the fact that the ATI client is poorly optimized. It is designed for the HD2000 and HD3000 series and hasn't been updated to take advantage of architectural improvements in the 4000 and 5000 series that could lead to significant increases in performance. They are also using Brook+ and have not transitioned to OpenCL yet which should give another boost in performance once it's fully supported on ATI cards simply because Brook+ seems to be much more difficult to work with.

So to make a long story short, it's basically because the necessary resources and manpower to properly optimize the ATI client haven't been dedicated to the task, either because the project doesn't have the resources or because they are needed elsewhere.
__________________
Desktop: Q9550 C1 @3.8GHz w/Xigmatek HDT-S1283, ASUS P5Q Pro, 4GB G.Skill PC2-8000, Visiontek HD4870@790/4000, Audigy 2 ZS, WD 640GB, Samsung 1TB, Seagate 250GB IDE, Corsair HX520, CM Centurion 590, Dell S2209W, Samsung SyncMaster 750s, Vista Home Premium 64-bit SP1.
Folding/Secondary: E2140 @2.93GHz w/Thermalright SI-120, Foxconn P45 ELA, 2GB Crucial PC2-5300, EVGA GTX260 216SP Superclocked@710/1512/2106, WD 80GB, Antec NeoHE 500W, CM Centurion 5, Dell 1701FP, Ubuntu 9.04 64-bit.
Laptop: HP DV2404, A64 X2 TK-53 1.7GHz, 2GB DDR2-667, GeForce 6150SE, Seagate 160GB, 8X DVD-RW, 14.1" WXGA LCD, Vista Home Premium 32-bit SP1.

[H]ard|OCP Unofficial IRC Channel: irc://irc.dp.cx:6667 #hardocp
  #5  
Old 10-10-2009, 05:44 PM
jeremyshaw 2[H]4U, 3 Months
 
jeremyshaw is online now
Here's why:
ATi has several big traditional bruteforce processing (for lack of a better word) units, and lott 'o tiny MADD (math) units. All are counted as shaders.

nVidia has many bruteforce processing units, and the MADD units are just aren't counted as shader cores.
That is why you always have to divide the ATi SP count by 5 to get a better reading of actuall useable power (the way nVidia counts it).

Simply put, the nVidia architecture is just faster. The ATi cards (dunno about the current generation) is slower.

How DX11/DirectCompute may change this, we don't know. But right now, the new ATi architecture is actually limited by the current GPU client, as DX11 adds two new shader operations (on top of Geometry, Pixel, and Vertex), and that may change things - however, I'm not betting on it. I still belive, based on the limited performance gain compared to what is out there, and ATi's stated concentration on Graphics, any current nVidia GPU will probably still outdo any optimised ATi client, unless if F@H can run like a video game (resource wise), and not as a parrell processing application.

Dunno if that made sense.
__________________
GIVE UP everybody! S[h]eep herders are amongst us now!
Seriously, dreadlocks are HOT.
Yes, I underclock on principal.
Toki-o Hot-el
  #6  
Old 10-10-2009, 05:44 PM
jeremyshaw 2[H]4U, 3 Months
 
jeremyshaw is online now
Also, didn't ATi shift off of the Brook+ a while back? I thought it died with the R600 generation?
__________________
GIVE UP everybody! S[h]eep herders are amongst us now!
Seriously, dreadlocks are HOT.
Yes, I underclock on principal.
Toki-o Hot-el
  #7  
Old 10-10-2009, 05:46 PM
Atech [H]ardness Supreme, 2.4 Years
 
Atech is offline
Quote:
Originally Posted by Zero82z View Post
It's not a flawed guess at all. He is right in saying that the difference is due to the fact that the ATI client is poorly optimized. It is designed for the HD2000 and HD3000 series and hasn't been updated to take advantage of architectural improvements in the 4000 and 5000 series that could lead to significant increases in performance. They are also using Brook+ and have not transitioned to OpenCL yet which should give another boost in performance once it's fully supported on ATI cards simply because Brook+ seems to be much more difficult to work with.

So to make a long story short, it's basically because the necessary resources and manpower to properly optimize the ATI client haven't been dedicated to the task, either because the project doesn't have the resources or because they are needed elsewhere.
That is just a lame excuse for an architectual problem:
http://theovalich.wordpress.com/2008...uture-reveale/
http://foldingforum.org/viewtopic.ph...tart=0#p103025

Like I have said before...NVIDIA is generations ahead of AMD on GPGPU.
__________________
i7 920 / BFG 285GTX / 2 x 21" CRT
P6T Deluxe OC Palm / 12 GB DDR3
X-Fi Ti Camp / Gigaworks S750
PhysX P1 PCI-E / Killer NiC M1
2x80GB(RAID 0) / 2x1TB
PC - XP Pro 32bit/Vista Ultimate x64/Windows 7 Ultimate x64
Dell D630 - Windows 7 Buisness x64 / PC-BSD 7.1
  #8  
Old 10-10-2009, 05:53 PM
jeremyshaw 2[H]4U, 3 Months
 
jeremyshaw is online now
Quote:
Originally Posted by Atech View Post
That is just a lame excuse for an architectual problem:
http://theovalich.wordpress.com/2008...uture-reveale/
http://foldingforum.org/viewtopic.ph...tart=0#p103025

Like I have said before...NVIDIA is generations ahead of AMD on GPGPU.
You didn't say that, you linked to a flame war on B3D (Charlie, that ahole is believed to get most of his info from there).

Besides, I think my earlier post describes it clearly. (no lost love).
__________________
GIVE UP everybody! S[h]eep herders are amongst us now!
Seriously, dreadlocks are HOT.
Yes, I underclock on principal.
Toki-o Hot-el
  #9  
Old 10-10-2009, 05:58 PM
Atech [H]ardness Supreme, 2.4 Years
 
Atech is offline
Quote:
Originally Posted by jeremyshaw View Post
You didn't say that, you linked to a flame war on B3D (Charlie, that ahole is believed to get most of his info from there).

Besides, I think my earlier post describes it clearly. (no lost love).
The links can be found in my first link...but I guess people are getting more and more lazy...
__________________
i7 920 / BFG 285GTX / 2 x 21" CRT
P6T Deluxe OC Palm / 12 GB DDR3
X-Fi Ti Camp / Gigaworks S750
PhysX P1 PCI-E / Killer NiC M1
2x80GB(RAID 0) / 2x1TB
PC - XP Pro 32bit/Vista Ultimate x64/Windows 7 Ultimate x64
Dell D630 - Windows 7 Buisness x64 / PC-BSD 7.1
  #10  
Old 10-10-2009, 06:01 PM
jeremyshaw 2[H]4U, 3 Months
 
jeremyshaw is online now
Quote:
Originally Posted by Atech View Post
The links can be found in my first link...but I guess people are getting more and more lazy...
Yeah, I found it on the second page, after the only Folder stepped in

Last edited by jeremyshaw; 10-10-2009 at 07:29 PM..
__________________
GIVE UP everybody! S[h]eep herders are amongst us now!
Seriously, dreadlocks are HOT.
Yes, I underclock on principal.
Toki-o Hot-el
  #11  
Old 10-10-2009, 06:04 PM
Zero82z [H]ard|DCer of the Month - July 2009, 5.8 Years
 
Zero82z is offline
Quote:
Originally Posted by Atech View Post
That is just a lame excuse for an architectual problem:
http://theovalich.wordpress.com/2008...uture-reveale/
http://foldingforum.org/viewtopic.ph...tart=0#p103025

Like I have said before...NVIDIA is generations ahead of AMD on GPGPU.
I think it's funny that you would even imagine that you have read more about this subject than I have and that you assume that there are any links you can post that I haven't already seen and read long ago.

The fact is that nothing you have posted contradicts my statements in any way, and you are misinterpreting the currently flawed implementation of the F@H GPU client on ATI video cards as an issue with the architecture, when in reality it is merely a matter of the client not being properly designed to take advantage of the strengths of ATI's different GPU design.

As evidenced by the fact that ATI GPUs take less of a performance hit when performing calculations involving larger proteins, ATI's architecture actually has more brute force power than nVidia's architecture, since most of the calculations in question are simple MADD operations rather than the other transcendental operations that only 20% of ATI's SPs are capable of handling. The problem is that the ATI cores are still stuck in "R600 mode" as it is called by some, and it doesn't make use of the LDS that was added to RV770 which would mitigate many of the "calculate twice" issues that currently plague the client.

Another issue is that the system used to benchmark ATI workunits is using an RV670 GPU, so the points allocation is also geared towards people with those cards and not newer RV770 and RV870 GPUs which are much improved when it comes to GPGPU applications.
__________________
Desktop: Q9550 C1 @3.8GHz w/Xigmatek HDT-S1283, ASUS P5Q Pro, 4GB G.Skill PC2-8000, Visiontek HD4870@790/4000, Audigy 2 ZS, WD 640GB, Samsung 1TB, Seagate 250GB IDE, Corsair HX520, CM Centurion 590, Dell S2209W, Samsung SyncMaster 750s, Vista Home Premium 64-bit SP1.
Folding/Secondary: E2140 @2.93GHz w/Thermalright SI-120, Foxconn P45 ELA, 2GB Crucial PC2-5300, EVGA GTX260 216SP Superclocked@710/1512/2106, WD 80GB, Antec NeoHE 500W, CM Centurion 5, Dell 1701FP, Ubuntu 9.04 64-bit.
Laptop: HP DV2404, A64 X2 TK-53 1.7GHz, 2GB DDR2-667, GeForce 6150SE, Seagate 160GB, 8X DVD-RW, 14.1" WXGA LCD, Vista Home Premium 32-bit SP1.

[H]ard|OCP Unofficial IRC Channel: irc://irc.dp.cx:6667 #hardocp
  #12  
Old 10-10-2009, 06:06 PM
jeremyshaw 2[H]4U, 3 Months
 
jeremyshaw is online now
Quote:
Originally Posted by Zero82z View Post
I think it's funny that you would even imagine that you have read more about this subject than I have and that you assume that there are any links you can post that I haven't already seen and read long ago.

The fact is that nothing you have posted contradicts my statements in any way, and you are misinterpreting the currently flawed implementation of the F@H GPU client on ATI video cards as an issue with the architecture, when in reality it is merely a matter of the client not being properly designed to take advantage of the strengths of ATI's different GPU design.

As evidenced by the fact that ATI GPUs take less of a performance hit when performing calculations involving larger proteins, ATI's architecture actually has more brute force power than nVidia's architecture, since most of the calculations in question are simple MADD operations rather than the other transcendental operations that only 20% of ATI's SPs are capable of handling. The problem is that the ATI cores are still stuck in "R600 mode" as it is called by some, and it doesn't make use of the LDS that was added to RV770 which would mitigate many of the "calculate twice" issues that currently plague the client.

oopps!

I really need to get data on the new ATi GPUs. Guess I'l hold off buying that gts250, for now.
__________________
GIVE UP everybody! S[h]eep herders are amongst us now!
Seriously, dreadlocks are HOT.
Yes, I underclock on principal.
Toki-o Hot-el
  #13  
Old 10-10-2009, 06:07 PM
Atech [H]ardness Supreme, 2.4 Years
 
Atech is offline
Quote:
Originally Posted by Zero82z View Post
I think it's funny that you would even imagine that you have read more about this subject than I have and that you assume that there are any links you can post that I haven't already seen and read long ago.

The fact is that nothing you have posted contradicts my statements in any way, and you are misinterpreting the currently flawed implementation of the F@H GPU client on ATI video cards as an issue with the architecture, when in reality it is merely a matter of the client not being properly designed to take advantage of the strengths of ATI's different GPU design.

As evidenced by the fact that ATI GPUs take less of a performance hit when performing calculations involving larger proteins, ATI's architecture actually has more brute force power than nVidia's architecture, since most of the calculations in question are simple MADD operations rather than the other transcendental operations that only 20% of ATI's SPs are capable of handling. The problem is that the ATI cores are still stuck in "R600 mode" as it is called by some, and it doesn't make use of the LDS that was added to RV770 which would mitigate many of the "calculate twice" issues that currently plague the client.
No amount of "optimizing" drivers/software can create the cache AMD GPU's lack and NVIDIA's GPU have. ..which means that AMD GPU's have to do more work (due to being unable to store the data) compared to NVIDIA GPU's...like it or not.
__________________
i7 920 / BFG 285GTX / 2 x 21" CRT
P6T Deluxe OC Palm / 12 GB DDR3
X-Fi Ti Camp / Gigaworks S750
PhysX P1 PCI-E / Killer NiC M1
2x80GB(RAID 0) / 2x1TB
PC - XP Pro 32bit/Vista Ultimate x64/Windows 7 Ultimate x64
Dell D630 - Windows 7 Buisness x64 / PC-BSD 7.1
  #14  
Old 10-10-2009, 06:13 PM
Zero82z [H]ard|DCer of the Month - July 2009, 5.8 Years
 
Zero82z is offline
Quote:
Originally Posted by Atech View Post
No amount of "optimizing" drivers/software can create the cache AMD GPU's lack and NVIDIA's GPU have. ..which means that AMD GPU's have to do more work (due to being unable to store the data) compared to NVIDIA GPU's...like it or not.
The problem isn't that ATI GPUs can't store "enough" data, it's that they aren't storing "any" data at all right now since F@H doesn't use the LDS. And a single step of a single GPU workunit doesn't require a particularly large amount of data storage, especially not with the small proteins that are currently being used for most of the workunits that are in the wild right now. Each shader unit (set of 4 standard FPUs and one special-function unit) has a 16KB LDS in RV770 and 32KB in RV870, which is more than enough to give a significant performance boost to overall work production speed.
__________________
Desktop: Q9550 C1 @3.8GHz w/Xigmatek HDT-S1283, ASUS P5Q Pro, 4GB G.Skill PC2-8000, Visiontek HD4870@790/4000, Audigy 2 ZS, WD 640GB, Samsung 1TB, Seagate 250GB IDE, Corsair HX520, CM Centurion 590, Dell S2209W, Samsung SyncMaster 750s, Vista Home Premium 64-bit SP1.
Folding/Secondary: E2140 @2.93GHz w/Thermalright SI-120, Foxconn P45 ELA, 2GB Crucial PC2-5300, EVGA GTX260 216SP Superclocked@710/1512/2106, WD 80GB, Antec NeoHE 500W, CM Centurion 5, Dell 1701FP, Ubuntu 9.04 64-bit.
Laptop: HP DV2404, A64 X2 TK-53 1.7GHz, 2GB DDR2-667, GeForce 6150SE, Seagate 160GB, 8X DVD-RW, 14.1" WXGA LCD, Vista Home Premium 32-bit SP1.

[H]ard|OCP Unofficial IRC Channel: irc://irc.dp.cx:6667 #hardocp
  #15  
Old 10-10-2009, 11:31 PM
Vaulter98c [H]ard|DCer of the Month - October 2009, 1.5 Years
 
Vaulter98c is offline
Thank you for defending me Zero, I couldnt think of anyone Id rather have on my side

/hiding now
__________________
Gamer Rig::Q66 @2.4 - 680iLT Mb - 2x G4saurus GTX295's QuadSLI/Physx/Folding - XFi Xtrm Gamer - 4G DDR2 800 (Wintec) - 500GB +74GB Raptor - all hook'd to the 1080P monitor, T240HD
Gaming Netbook-ASUS N10j ::Atom N270 @ 2.1 ghz | Nv 9300m GS (OC'd) | 2G Ram | 320 GB 7200.3 Hdd | b/g/n wireless + Gigabit Ethernet | HDMI + SPDIF HD outputs and so much more...
---------------------------------------------------------------------------------------------------------------
FOLDING :: 4x 88GT's | 3x GTX260 | 4x 98GX2's | 2x 295's | Q66 | E83 | E2180 | Phn1x3 | PS3
FOLDING :: 19 GPU / 4 CPU clients | Estimated 108,000 PPD for the [H]orde, Coming Soon!!!
Heat - Vaulter98c (26) | Ebay - Octavious98c (28 w/ 34 pos) (5 NOV 09) | NO NEG REVIEWS
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -5. The time now is 08:44 PM.


Valid XHTML 1.0 Transitional

Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Copyright 2000 - 2009 KB Networks, Inc.