GTX 660: VIDEO_TDR_ERROR (bugcheck: stop 0x116), black screen + sound loop when playing games.

uncannytranny

Honorable
Dec 19, 2013
9
0
10,510
Video of it happening: http://youtu.be/QRGtxdn3zlM?t=2m05s

Some of the errors:

(crash dump mini diagnosis)

On Tue 4/8/2014 1:51:43 AM GMT your computer crashed
crash dump file: C:\Windows\Minidump\040714-14570-01.dmp
This was probably caused by the following module: nvlddmkm.sys (nvlddmkm+0x9BBE2C)
Bugcheck code: 0x116 (0xFFFFFA800B04A010, 0xFFFFF88011E09E2C, 0xFFFFFFFFC000009A, 0x4)
Error: VIDEO_TDR_ERROR
file path: C:\Windows\system32\drivers\nvlddmkm.sys
product: NVIDIA Windows Kernel Mode Driver, Version 335.23
company: NVIDIA Corporation
description: NVIDIA Windows Kernel Mode Driver, Version 335.23
Bug check description: This indicates that an attempt to reset the display driver and recover from a timeout failed.
A third party driver was identified as the probable root cause of this system error. It is suggested you look for an update for the following driver: nvlddmkm.sys (NVIDIA Windows Kernel Mode Driver, Version 335.23 , NVIDIA Corporation).
Google query: NVIDIA Corporation VIDEO_TDR_ERROR

(crash dump big diagnosis) http://pastebin.com/GvXaRMYs

DirectX Errors: DirectX function "GetDeviceRemovedReason" failed with
DXGI_ERROR_DEVICE_HUNG



I've had my main 660 since August of last year. I bought the second one in February and the issue has been plaguing me ever since. Unfortunately the second one was an Amazon warehouse item, and cannot be returned or refunded.

GPU1 = old GTX 660
GPU2 = new GTX 660

GPU1 alone = no crashes ever
GPU1 primary + GPU2 secondary = black screen, hard crash with looping sound requiring hard shutdown (only when on tasking games like BF4, never happens on Source games)
GPU2 alone = display driver crashes every so often, displaying similar effects as the hard crash without the actual hard crash
GPU2 primary + GPU1 secondary = black screen hard crashes with sound loop more frequent, even on light games including Source.


If anyone here has any clue as to what I need to do to fix this issue, I'd highly appreciate it.

Specs:

AMD FX-8320 - error free, oc'd at 4.3GHz, 24hr stable Prime95 under 50*C avg, sometimes breaks into 51 or 52.
GA-990FXA-UD3 - no failing parts
2x4GB Kingston HyperX Blu DDR3 1600MHz - 100% functional tested by memtest
2TB Seagate Barracuda 7200RPM SATA III 64MB
1TB Hitachi Deskstar 7200RPM SATA II 32MB
OCZ ModXStream Pro 700W - likely not to be the issue.
and of course, 2x EVGA GeForce GTX 660 in SLI. No issues with SLI bridge.