NVidia driver stop responding... looking for the 'why'

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530
Ok I'll try to make ir short. First, these are my specs:

Intel i5-750
Motherboard Asus P7P55D Deluxe
8 GB RAM DDR3 1600 Mhz
3 HDs (320 GB - 1 TB - 500 GB)
DVDRW Optical Drive
Power Suppley Thermaltake TR2 RX-850 W0319RU (v2.3)
MSi N580GTX
Windows 7 64 Ultimate

Well, the problem I have is that randomly I could receive a warning that my driver has stopped working and has recovered.

It happened when playing Battlefield 3, League of Legends, and Batman Arkham City. Sometimes it woulnd't happen at all.

Games would generally will be forced to minimize when this happens, and I can't return to them, so I need to close the application (except League of Legends that would cause my screen to go black for a second and then recover by itself).

Before I had this new brand MSi videocard, I bought a EVGA 580 GTX, and I had to return it because it was preciselly returning that random error message when playing. I didn't note the problem with BF3 that time, but I would suffer it when playing League of Legends.

After I returned the EVGA thinking it was flawed somehow, I installed the new MSi 580 GTX and here comes the oddity... the problem persisted when playing League of Legends! So I thought it was an application related issue.

Time later, I bought a new LG LED Monitor and I started to play the previous games in HD. And was then when the problem presented itself this time playing Batman Arkham City and BF3 also.

I can't believe I just bought two faulty videocards from two different providers. It makes no sense. I can rule out RAM from the equation because I recently changed 2x2GB 1333 Mhz modules for the current 2x4GB 1666 Mhz.

I didn't perform any overclocking in any element of my PC. I use MSi Afterburner and when the problem appears, I would note that every value in the graph will go down and at the same time I will suffer a 100% GPU Usage spike.

I was thinking that maybe the PSU is somehow not feeding well my videocard, but just I don't know, I'm guessing.

My PSU is a couple of years old but I never had any weird problem besides this one. I do know that my PSU has 4 lines of 12v, 18A each. I noticed that recent versions of the same PSU come with an unified 12v line.

So I suppose my question is... could my PSU be the reason behind the cited problems?
 
D

Deleted member 217926

Guest
It very well could be the power supply. The Thermaltake TR2 is a smoking bag of crap on a good day so it really would not surprise me. If you can try the computer with a different power supply you can confirm if that is the problem.

Have you tried uninstalling the driver and reinstalling it?
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Yes I tried the whole uninstall/reinstall thingy weeks ago with no avail.

Today I made a test using OCCT of my videocard for about 10 minutes, no errors and 70° max temperature reached.

However, when I try to test my PSU with OCCT it would finish quickly with an error because my CPU reaches 85° so the test stops for security reasons.

I have no way to test with another PSU except buying a new one, but since I'm not sure my PSU is the problem, I'm hesitating to make such purchase. I need to be sure.

What shoould I do next?
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Hi there.

I'm using the 295 right now, and before the 285.

The problem don't seem to be related to the drivers.

Yesterday I made a little test, I played for about two minutes BF3 and checked in background with OCCT, my CPU was over 90°... is that even normal?
 
D

Deleted member 217926

Guest
No those temps are too high. You also mentioned your CPU reaching 85C and that should never happen even with stock cooling as long as you are not overclocking on it.

What case do you have? Sounds like you have very poor to no airflow. Use some monitoring software and note your max temps while stressing the computer. Then try taking the side off and do the same thing and report the results.

Have you cleaned the computer with compressed air lately?
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Yes I've doing some research and these values are too high!

Today tested BF3 by about 2 minutes and temps of my cores according to OCCT reached:

CPUTIN 98°C
Core #0 88°C
Core #1 97°C
Core #2 82°C
Core #3 99°C

But, in the meantime, my GPU just reached 60°C

When iddle, the cores will stay between 35° and 40°.

My case is a Thermaltake, it's pretty big and with sufficient air flow.

I'm buying a new cooler for my i5 this week, I don't honestly know if this is the cause of my problem, but definetly must be addressed.

I'll try with the side off and check back here.
 
D

Deleted member 217926

Guest
Sounds like the cooler you have now is not seated correctly. Unless it is completely clogged with dust and dirt. I hate the stock Intel cooler, it's easy to break and can be hard to tell if it's mounted correctly.

If you are going with aftermarket cooling then the Hyper 212+ or Hyper 212 Evo are good low cost options. They will also fit with most RAM even RAM with high heat spreaders.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530
Well here I am reporting the news.

I just installed a new cooler for my CPU, Zalman CNPS5 Performa.

Doing a load test I noticed how much the temperature went down. It's amazing. Now I never reach the 50°C under full load.

Also, I noticed something in the connection to my motherboard. The EATXPWR connection was only partially connected. I mean, it was connected to the PSU with just 4 pins instead the 8 I'm able to. Weird, I mean, it was working that way, but maybe not enough power was being pumped into the system?

Well, I'll do some tests the next days and report any news.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530
Well I played by 15 minutes BF3 and the problem appeared again, and this time even I was able to see some green artifacs.

Same pattern, sudden 100% GPU spike and all the other values going down.

I'm out of options right now.

I can't believe both the EVGA and the MSi are flawed or damaged. Makes no sense.

I don't know what else to do.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


I know but I can't be sure. The only way is to buy a new PSU and if the problem persists I'll be even more pissed.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Wow that review destroys the PSU lol.

Before the two 580 videocards, I was using a 550 GTi and I never had any kind of problem.

Today I made two video stress tests using OCCT for over 15+ minutes, not a single error detected.

Later I made the PSU test that will stress both CPU and GPU for over 10 minutes, no error whatsoever.

Cleaned the registry using CCleaner. Modified the TdrDelay parameter on registry without avail.
 
D

Deleted member 217926

Guest
The GTX580 stresses your power supply a whole lot more than a GTX550 Ti. As you can see from the review stressing that power supply is a bad thing that can damage components and cause instability and crashes.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Yeah I got that, but I can't understand how the tests will not generate any error and these are stressing the system a lot more than games like BF3. In fact BF3 will crash with a suspicious sudden 100% GPU use spike, but before it, the average use was about 75%-80% and the temps around 60°. In the other hand, the GPU test will put the GPU use almost to 100% all the time and the temps will get around 80°.

I'll continue testing, I downclocked a bit some parameters with MSI Afterburner.

Reporting as fast I got news.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530
Well after 4 or 5 matches of BF3 the problem appeared again.

According to the Event Manager, this is what always happens:

In some random point, under the Application tab:

'A request to disable the Desktop Window Manager was made by process (4)' being the origin the Desktop Windows Manager, event ID 9010.

At the same time, also Desktop Windows Manager under event ID 9013 informs that it can't be restarted because an application make it so.

4 sencods later, under the System tab on Event Manager appears the classic 'nvlddmkm' error under event ID 4101 cited as a 'warning'.

It's always the same pattern.

I understand that the cause of such error could be virtually ANYTHING.

Should I think I bought 2 flawed 580 videocards? Sigh...

 
D

Deleted member 217926

Guest
You KNOW you have a flawed power supply that can be dangerous to your components. A bad power supply can also cause general system instability. So what's logical? Two bad graphics cards in a row? Or a bad PSU causing all your problems? ;)

The power supply is the single most important part of a computer. Since you were willing to fork out $500 or better for a GPU don't you think it's a good idea to protect that investment with a good quality power supply?
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Yeah I agree with you, but I'm gonna be REALLY pissed when I buy a new PSU and still the problem persists. I almost can feel I will not resolve this particular issue that way. But well, I guess I should make a try.

BTW, what PSU should I pick? I was reading that Corsair make nice ones. What about the wattage according to my specs? Corsair Tx850m 850w Enthusiast maybe? Do I really need 850w?
 
D

Deleted member 217926

Guest
You only need 850 watts if you plan on adding a second GTX580. For a single GTX580 a quality 550-600w PSU will be fine.

Seasonic makes the best power supplies. They actually make many of the power supplies for Corsair, XFX, PC Power and Cooling and quite a few other good brands.

Everything you ever wanted to know about power supply makers.

http://www.realhardtechx.com/index_archivos/Page541.htm

A few brands always stand out as not having bad models. Corsair, Seasonic, PC Power and Cooling, XFX, Enermax, Silverstone, OCZ and Antec are almost always high quality.

This is on great sale and is modular.

http://www.newegg.com/Product/Product.aspx?Item=N82E16817151095
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530


Thanks for your guide. As you say, a 550-600w PSU should be fine, so how is possible that my Thermaltake can't handle it if it were the case? I mean, I know it's a faulty PSU according to the reviews, but damn, it's over 200+ W of what I really need.

Also, I'm trying to find any other case where somebody solved a similar problem replacing the PSU. I can't find none! Everytime it's almost the same case, a faulty videocard. It's full of similar topics and almost everytime it's the GPU.

Now I'm depressed lol.
 
D

Deleted member 217926

Guest
The problem with the TR2 is ripple and noise. It does not put out "clean" power. Hell it could even be the power supply damaging the video cards. HardwareSecrets and jonnyguru are the 2 best power supply review sites on the net. If they call a unit dangerous I believe them. I would replace it in my system even if I was not having any problems.
 

Pilotito

Distinguished
Mar 10, 2012
37
0
18,530
Thanks again. Some points to clarify my case.

In the case of the EVGA I noticed the problem the very same day I installed it when playing League of Legends. That's why I returned it.

The same happened with the MSi, very same day I got it, but since Skyrim and BF3 were doing it well I tought it was software related. The problems with BF3 started like 1 month and half later when I bought the LG LED Monitor and started to play in 1360*768 resolution. Before that I had a crappy monitor so I used to play BF3 in 1024x768.

The funny part is that NOW League of Legends works fine, except in a couple of times in like 1 month when the same error happened, but a lot, lot less than before, and when it used to fail more (daily) was when I was playing it in 1024x768, and now I'm playing it in 1080 dpi. How does that make any sense?
 
This could have no relevance to your issue at all, but I believe it does.

When I first built my computer, it was pretty awesome. 965BE, 6950, 8GB DDR2, etc. Now it played everything flawlessly for quite some time, until one day, I got the exact same problem you seem to be having. When doing something graphics-intensive, it would run for a few minutes fine, and the screen would go black, and come back, saying "AMD driver has stopped responding and has recovered." I played LoL like this for a good month until I got tired of it happening. My friend gave me his "broken" Nvidia 8800GT, which I fixed, and I thought it solved the problem, but low and behold it still happened. I figured the card was actually broken, and took out my fully working 8800GTX, (what the 6950 replaced) and the problem still persisted. I even swapped power supplies and most of the parts in the computer.

Raging about my computer one day while staring at the insides of it, I figured "might as well" and took 2 RAM sticks out of my computer, so I was running on 4GB, and low and behold THE PROBLEM WAS GONE. It never happened again after that. Since then I have used that 965 in another build, but I thought this might be helpful to you.