![]() |
|
#1
|
|||
|
|||
|
why do nvidia cards fold so much better than ati?
as the title says... why does nvidia fold better than ati?
|
|
#2
|
|||
|
|||
|
NVidia and ATI cards both have what we refer to as shaders, however, the way that NV cards do their computing is differant than how the ATI cards do, hence the large numbers on the ATI cards vs Nvidia cards
The client for the NV cards is just more efficient, its better written, and at the end of the day, despite all the differences in the 2 architecture (killed the sp on that one) it comes down to the NV client is better written than the ATI one we can only hope that with the rumored release of the next GPU core in the coming months this issue is fixed, but until then, NV will stay on top Thats the laymens version, im sure someone could get more in depth
|
|
#3
|
|||
|
|||
|
Quote:
:http://forum.beyond3d.com/showthread.php?t=50539
|
|
#4
|
|||
|
|||
|
Quote:
So to make a long story short, it's basically because the necessary resources and manpower to properly optimize the ATI client haven't been dedicated to the task, either because the project doesn't have the resources or because they are needed elsewhere.
|
|
#5
|
|||
|
|||
|
Here's why:
ATi has several big traditional bruteforce processing (for lack of a better word) units, and lott 'o tiny MADD (math) units. All are counted as shaders. nVidia has many bruteforce processing units, and the MADD units are just aren't counted as shader cores. That is why you always have to divide the ATi SP count by 5 to get a better reading of actuall useable power (the way nVidia counts it). Simply put, the nVidia architecture is just faster. The ATi cards (dunno about the current generation) is slower. How DX11/DirectCompute may change this, we don't know. But right now, the new ATi architecture is actually limited by the current GPU client, as DX11 adds two new shader operations (on top of Geometry, Pixel, and Vertex), and that may change things - however, I'm not betting on it. I still belive, based on the limited performance gain compared to what is out there, and ATi's stated concentration on Graphics, any current nVidia GPU will probably still outdo any optimised ATi client, unless if F@H can run like a video game (resource wise), and not as a parrell processing application. Dunno if that made sense.
|
|
#6
|
|||
|
|||
|
Also, didn't ATi shift off of the Brook+ a while back? I thought it died with the R600 generation?
|
|
#7
|
|||
|
|||
|
Quote:
http://theovalich.wordpress.com/2008...uture-reveale/ http://foldingforum.org/viewtopic.ph...tart=0#p103025 Like I have said before...NVIDIA is generations ahead of AMD on GPGPU.
|
|
#8
|
|||
|
|||
|
Quote:
Besides, I think my earlier post describes it clearly. (no lost love ).
|
|
#9
|
|||
|
|||
|
The links can be found in my first link...but I guess people are getting more and more lazy...
|
|
#10
|
|||
|
|||
|
Quote:
![]() Last edited by jeremyshaw; 10-10-2009 at 07:29 PM..
|
|
#11
|
|||
|
|||
|
Quote:
The fact is that nothing you have posted contradicts my statements in any way, and you are misinterpreting the currently flawed implementation of the F@H GPU client on ATI video cards as an issue with the architecture, when in reality it is merely a matter of the client not being properly designed to take advantage of the strengths of ATI's different GPU design. As evidenced by the fact that ATI GPUs take less of a performance hit when performing calculations involving larger proteins, ATI's architecture actually has more brute force power than nVidia's architecture, since most of the calculations in question are simple MADD operations rather than the other transcendental operations that only 20% of ATI's SPs are capable of handling. The problem is that the ATI cores are still stuck in "R600 mode" as it is called by some, and it doesn't make use of the LDS that was added to RV770 which would mitigate many of the "calculate twice" issues that currently plague the client. Another issue is that the system used to benchmark ATI workunits is using an RV670 GPU, so the points allocation is also geared towards people with those cards and not newer RV770 and RV870 GPUs which are much improved when it comes to GPGPU applications.
|
|
#12
|
|||
|
|||
|
Quote:
oopps! I really need to get data on the new ATi GPUs. Guess I'l hold off buying that gts250, for now.
|
|
#13
|
|||
|
|||
|
Quote:
|
|
#14
|
|||
|
|||
|
The problem isn't that ATI GPUs can't store "enough" data, it's that they aren't storing "any" data at all right now since F@H doesn't use the LDS. And a single step of a single GPU workunit doesn't require a particularly large amount of data storage, especially not with the small proteins that are currently being used for most of the workunits that are in the wild right now. Each shader unit (set of 4 standard FPUs and one special-function unit) has a 16KB LDS in RV770 and 32KB in RV870, which is more than enough to give a significant performance boost to overall work production speed.
|
|
#15
|
|||
|
|||
|
Thank you for defending me Zero, I couldnt think of anyone Id rather have on my side
![]() /hiding now
|
![]() |
| Thread Tools | Search this Thread |
|
|