Tom's Hardware > Forum > Homebuilt Systems > General Homebuilt > Troubleshooting a relatively new build

Troubleshooting a relatively new build

Forum Homebuilt Systems : General Homebuilt - Troubleshooting a relatively new build

Tom's Hardware: Over 1.4 million members in 6 different countries available to answer all your high-tech questions. Sign up now! Its free!
Word :    Username :           
 

I started putting my first PC together in May, and it's been a lot of fun.

* CPU: Intel Core i7 920
* Motherboard: Asus X58 P6T
* Cooler: Cooler Master Hyper N520
* RAM: OCZ Gold 3 x 2GB 1600MHz (6 GB)
* GPU: BFG NVIDIA GeForce 260 OC Maxcore (Core 216)
* PSU: PC Power & Cooling 750W Quad Red Silencer
* Case: Antec Nine Hundred Two
* HDD: 1TB Samsung Spinpoint F3 and 500GB WD Caviar
* OS: Windows 7 RTM 64-Bit

Wow, it's fast (not overclocked)!!! I'm not a hard-core gamer, but I had to pick up a couple games to test the muscle in this baby. It plays Crysis, Batman Arkham Asylum, and Resident Evil 5 great.

And I love Windows 7.

It runs great most of the time. But every once in a while, it freezes. No blue screen or anything, but the 21" Samsung LCD immediately displays a "Not Optimum Mode Recommended mode: 1650x1080 60Hz" message.

Initially, this seemed to happen when I left it on overnight, so I thought it was a standby thing. So I turned off all standby modes, but it didn't solve the problem. I also played with S1 versus S3 standby in the BIOS, but it didn't help.

I thought it might be the monitor, so I tried my old Dell FP, and it would also lose signal in the middle of the night.

I googled, and saw someone had a similar issue (which remains unsolved), but he pointed out that it's probably not a monitor issue, as he notices his keyboard caps-lock indicator doesn't light up when he's in this state. That's what I'm seeing to.

And then, occasionally, it would do this same thing WHILE I WAS USING THE COMPUTER. This was annoying, but it was revealing--I was playing a game, and the audio seemed to loop. So it's definitely not just a display issue, or USB issue... it seems to be locked up.

As I first mentioned, this is my first build, so I need your help in troubleshooting. I've tried unplugging everything and checking every cable and plugging it all back in tight. And I'm monitoring my CPU and GPU temps, and they are okay. And my computer is not dusty.

The frequency of these issues is: if I leave it running overnight, it happens about every other night; when I'm actually using my computer, it's probably locked up on me about 5 or 6 times since May.

Oh, and it's not the OS--I've reinstalled Windows 7 several times in moving from RC to RTM.

Can you guys assist me in troubleshooting this issue? Could it be some other kind of overheating that I'm not monitoring? Northbridge or something? Does this sound like a Motherboard issue? I never lose power, so does that eliminate power supply issues?

Also, I had this problem before and after changing CPU coolers, and before and after adding the 1TB HD.

Thank you in advance!

Z

Sponsored Links
Register or log in to remove.

The first thing to do is try to find a way to duplicate the error. Find something that is going to cause it every time.

prime95:
Instructions
http://www.playtool.com/pages/prime95/prime95.html

DL
http://www.mersenne.org/freesoft.htm

memtest
http://www.memtest.org/

HWMonitor
http://www.cpuid.com/hwmonitor.php

Sleep mode troubleshooting
http://www.vistax64.com/tutorials/ [...] blems.html

------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon

In addition, eliminate heat as a possible cause. Please use CPUID Hardware Monitor (or RealTemp) to monitor temps on your cores, gpu. You can get them here:

http://forums.tweaktown.com/f69/la [...] ols-30530/

Reply to Twoboxer

I have run some stress tests earlier, and they seemed fine--memtest, prime95, furmark, etc... but I haven't run them in a few months now, and I never ran anything except memtest for more than maybe 45 minutes--they all seemed to stabilize.

I do always use RivaTuner and Everest for monitoring, using the Gadget/Widget thing to view. Currently, while not doing much, my GPU is at 41C and my CPU cores are between 32C and 36C. When playing 3D games, GPU goes up to high 50s and CPU cores go up to the 40s.

Current Ambient temp is 34C. Current System temp is 34C.

Let's see, the last time I really checked temps was after installing this new cooler in June. I've got some numbers here somewhere... no, can't find 'em. I'll rerun prime later, and run memtest overnight and see if it doesn't freeze.

Z

Reply to zinzan

We do not use RivaTuner and Everest for monitoring, so it would be helpful if you took the time to DL the requested utilities and check temps.

Reply to Twoboxer

Okay. I have those already--just had the everest numbers at hand via the Win7 Gadget.

No problem. I'll get back to you later tonight.

Reply to zinzan

Here are some test results. I didn't get a chance to run everything--I have a newborn at home, and my time was kinda divided. :-)

First, I ran Prime95 for 15 minutes with the following settings:

http://webpages.charter.net/dean_money/img/prime95_settings.jpg

And the HWMonitor results were:

http://webpages.charter.net/dean_money/img/HWMonitor_after_Prime95.jpg

Ambient temp in the room was about 22C.

Then I ran a furmark stability test for 10 minutes, but I screwed up the screenshots. The GPU got as hot as 67.

But I wanted the screenshots, so I reran for 15 minutes. This time, it only peaked at 62C:

http://webpages.charter.net/dean_money/img/furmark_after_15mins.jpg

http://webpages.charter.net/dean_money/img/HWMonitor_after_furmark.jpg

I then kicked off memtest86+ and went to bed. Woke up this morning and it said the test was complete, no errors, everything passed.

How do my temps look? Any ideas?


Message edited by zinzan on 09-25-2009 at 05:22:21 PM
Reply to zinzan

Oh, one more note. I've tried multiple BIOS settings. On the most recent one now. This problem exists on all BIOS versions tested.

Z

Reply to zinzan

Your temps are very solid, fine. So that's not the issue.

You tried two monitors, so losing synch with the monitor's resolution is not the issue.

Corrupt/confused Drivers? Seems not to be, since you say you have re-installed the OS several times and you have had the same issue across the installs.

Personally, I'd run Prime95 for several hours semi-observed and then continuing overnight with "Detect Rounding Errors" checked. Its unlikely to fail, but then again these failures are pretty obscure.

If that fails, I guess I'm liking the mobo for this problem.

Reply to Twoboxer

Okay, sounds good. Blend mode is okay?

But if it just freezes after several hours, will we actually learn anything? Does Prime95 or HWMonitor create a log that will indicate anything?

Reply to zinzan

I want to make sure we can't make it fail consistently after, say, 1 hr 14 min of Prime95. If that were to happen, and it were not heat related, that might tell us something.

That's why the first run semi-observed, for much longer than the 15 minutes you did.

The rest of the run is for . . . having something to do overnight lol. You're right, prolly won't learn anything - but may if it runs long enough to get a "rounding" error.

Reply to Twoboxer

I just ran it for just under 3 hours. Similar peak temps on the CPU--high 50s. Didn't crash or freeze. But I had to stop it because I am sick and can't stay up to semi-monitor it. Really don't like cooking my CPUs for 11 hours, unmonitored...

but I just realized I forgot to turn on Detect Rounding Errors. I'll run it again tomorrow for several hours.

Z

Reply to zinzan

OK Z.

If it doesn't approach the "fry" zone after 3 hours, if you didn't notice any gradual climb in max temp after an hour or more, overnight is safe.

Be interesting to see it run without "losing the monitor".

Reply to Twoboxer

I ran it today with Detect rounding errors on. Checked it after 3 hours. Everything okay--CPU temps maxed at 59C. Checked it after 4 hours, same results. Sometime before 5 hours, freeze occurred as mentioned in original post.

Any other ideas? Or time to try to RMA to Asus?

Z

Reply to zinzan

Disable EIST and C1 state in bios. The new core i5s/i7s crash when idle... probably an undervolting issue. I'm not sure if its just a Win7 issue, but thats what I run as well. I've reported it to intel.

Reply to bishyaler

I can give that a try.

Can you point me to a reference somewhere on this problem?

Also, let me make sure I understand what I'll be losing by disabling these features. I'll be losing automatic down-clocking which will use more power, right? And I'll be losing the turbo feature of the i7 when only using 1 core?

Z

Message quoted 1 times
Message edited by zinzan on 09-28-2009 at 04:24:50 AM
Reply to zinzan

zinzan wrote :

I can give that a try.

Can you point me to a reference somewhere on this problem?

Also, let me make sure I understand what I'll be losing by disabling these features. I'll be losing automatic down-clocking which will use more power, right? And I'll be losing the turbo feature of the i7 when only using 1 core?

Z



There's no reference, I could only find a few google hits on similar problems. I discovered the fix myself after a few days of frustrating troubleshooting. Right, no power saving, no turbo mode.

Reply to bishyaler

Well, the cpu was hardly idle when this shutdown occurred. It was running 8 threads of Prime95.

I guess I'd put the problem to Asus and Intel. Having similar shutdowns at idle (after several hours) and in Prime95 (after 5 hours) says to me that one of the two devices has an issue. I'd like to hear what they say.

Reply to Twoboxer

Good point about the freeze (not shutdown) during Prime95.

When you say "put the problem to Asus and Intel", do you mean for me to open tickets with them through their support?

Z

Reply to zinzan

Okay.

So I talked to Intel, and they basically said it doesn't sound like a processor issue. They said the issue would be less sporadic/more constant.

I called Asus, and they said it could very well be a motherboard issue. They have issued an RMA.

Hopefully, this will resolve the problem.

I forget... I won't have to reinstall my software when I replace the motherboard, will I? I'll just have to reauthenticate Windows 7?

Z

Reply to zinzan

Those are two pretty reasonable responses from the Big Guys. Can't wait to see what happens next.

Please let us know.

Reply to Twoboxer

Just got the replacement mobo from Asus yesterday. Hope to put it in tonight. Unfortunately, I'm out of town all weekend, starting tomorrow. I guess I'll just play with it a few hours tonight, then leave it running, and hopefully it will still be going on Sunday night when I return home.

Z

Reply to zinzan

We'll be here. Hopefully the new mobo will fix it.

Reply to Twoboxer

Well, I've installed the replacement motherboard. It is clearly a used board--maybe that's not a surprise. But I was disappointed to see old thermal paste all over the CPU socket cap and the metal spring clip holding it in place. I lifted it out, and as soon as I did, a "ball" of the dried thermal paste fell into the socket. Not the middle pin area, but in the tall pin area around the outside of the socket.

http://www.cpu-world.com/Sockets/L_LGA1366-top.jpg

I was tempted to just return it to Asus right then, but unfortunately, I had already pulled my board out and cleaned the CPU and HSF and just wanted to get this tested. I took some compressed air, and blew the ball of thermal paste out. It didn't go easily, and left a little residue--I was weary of bending the pins, so I just dropped a little ArctiClean in the area, let it dry, then blew it again with the compressed air.

It's running okay. Temperatures seem about the same as before, but I haven't stressed it yet. I'll run a few tests tonight, then will leave it to run over the weekend. Hopefully, it will still be running when I get home, and I'll run some further stress tests on Sunday.

Lesson learned--thoroughly check out the replacement part BEFORE uninstalling your current parts!!!

I'll send an update later.

Z

Reply to zinzan

Well, damn! It was locked up when I got home last night--didn't have the Samsung "not optimum mode... recommended mode..." error dancing around the screen, but the monitor was black, and the computer was still on. USB keyboard didn't seem to do anything (caps lock indicator wouldn't come on). I rebooted, messed around with the computer for a couple hours, then left it on.

Same thing this morning. *sigh*

Any ideas of how to log this error? Could it be a keyboard issue? I might try switching back to a PS2 keyboard...

Z

Reply to zinzan

Sleep mode issues can be a number of things. If you look at this link, I think I included a link in it for sleep mode issues:
http://www.tomshardware.com/forum/ [...] leshooting

------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon

Proximon,

If you recall, I've had this issue occur WHILE RUNNING PRIME95 overnight. It doesn't seem like a standby issue, but I could be wrong. I think I mentioned that I thought it might be a sleep/standby issue in the past, so I turned off all sleep options in Windows, and also tried S1 instead of S3 in the BIOS.

Any other ideas, or any way to log exactly what the computer is doing when it freezes up?

Surely this must be fixable, or at least avoidable. I'm using quality components.

Z

Reply to zinzan

Are we talking about the same issue? or a different one?

You could have more than one thing wrong you know ;)

------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon

Quote :

It runs great most of the time. But every once in a while, it freezes. No blue screen or anything, but the 21" Samsung LCD immediately displays a "Not Optimum Mode Recommended mode: 1650x1080 60Hz" message.

Initially, this seemed to happen when I left it on overnight, so I thought it was a standby thing. So I turned off all standby modes, but it didn't solve the problem. I also played with S1 versus S3 standby in the BIOS, but it didn't help.

I thought it might be the monitor, so I tried my old Dell FP, and it would also lose signal in the middle of the night.

I googled, and saw someone had a similar issue (which remains unsolved), but he pointed out that it's probably not a monitor issue, as he notices his keyboard caps-lock indicator doesn't light up when he's in this state. That's what I'm seeing to.

And then, occasionally, it would do this same thing WHILE I WAS USING THE COMPUTER. This was annoying, but it was revealing--I was playing a game, and the audio seemed to loop. So it's definitely not just a display issue, or USB issue... it seems to be locked up.



I'm talking about this.

After another day and a half of using the computer, it has frozen up two times again, with the "Not Optimum Mode..." message on the Samsung monitor.

Z

Reply to zinzan

We had someone last month with a tough case like this, and it turned out they needed a firmware update to their graphics card.

Since the CPU and RAM and MB seem less likely now, I would be looking at the GPU, PSU, and even the HD.

------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon

Yeah, I'm starting to lean towards the GPU and PSU myself. I've updated the firmware drivers on the GPU, so I don't think it's that. I've even tried rolling back to older versions. No luck.

I just tried switching to PS/2 keyboard and mouse, and that didn't solve the problem.

I'm thinking of trying to RMA the GPU, or at least opening a case with them, telling them I've tried swapping out the motherboard, monitor, keyboard, and mouse.

Oh, and I'm also thinking I'm going to keep my original motherboard and send back this refurbished RMA replacement. It didn't solve the problem, and the system temp seems to run a little higher.

Z

P.S. Oh, and I don't think it's the HD. I had this problem with my original 74GB WD Raptor, and have since decomissioned it (using it as an external eSata drive, not currently connected) and replaced it with a 1TB Samsung Spinpoint F3. I am still using the same 500GB secondary storage HD (WD Caviar).


Message edited by zinzan on 10-15-2009 at 12:25:10 AM
Reply to zinzan

*sigh*

I went ahead and bought a second video card (same as before, BFG NVIDIA GeForce 260 OC). Figured I'd use this to test my machine before RMA'ing my current video card, and after everything was resolved, I'd have a backup GPU and a second card to use for SLI down the road.

Unfortunately, I again found my machine "stuck" (display signal lost, USB keyboard unresponsive) when left on overnight. Power was still on, as usual.

So, to summarize:
* I've updated motherboard BIOS.
* I've tried various video card drivers, including the latest.
* I've tried a different monitor, video card, hard drive, and keyboard and mouse (USB and PS2).
* I've turned off standby in Windows, and tried S3, S1, and Auto setting in BIOS.
* Temps seem good.
* Memory tests always pass.
* I've unplugged and replugged in power cables (obviously necessary when changing components).

That pretty much just leaves CPU and power supply, right? I mean, I'm discounting a loose cable, since I've checked, and have obviously had to unplug and replug them all in testing different video cards, motherboard, etc.

Does this sound more like a PSU or CPU issue now? PC Power & Cooling's warranty says I can send it in (at my expense), and they will repair or replace at their discretion (if they find a problem). I did try to open a case with Intel, but they pushed me to trying a different motherboard first. Since that didn't work, maybe they'll now accept an RMA on the CPU, or they might push me towards further analysis on the power supply.

Z

Reply to zinzan

If it never locks up when in use, I would lean towards the PSU myself.

------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon

Proximon wrote :

If it never locks up when in use, I would lean towards the PSU myself.



It has done so while in use. See post at 09-27-2009 at 06:31:19 PM.

Z


Message edited by zinzan on 10-20-2009 at 11:54:44 PM
Reply to zinzan

Ah yes, sorry.

Could this be temperature related? I know your sensors are all reporting fine, but I'm talking about the PSU temp. Could the PSU be getting warm when this happens?

A combination of rising ambient temps (computer has been on several hours) and variations in the temp of the room?


------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon

Room is pretty stable, around 22-23C. It's in an open room in my house downstairs.

This PSU (PC Power & Cooling 750W Quad Red Silencer) doesn't seem to share the temperature, as far as I can tell--I don't think there's a fan pin lead to the motherboard, like there was on my previous Enermax 525W. How would I obtain the PSU temp on this Quad Red Silencer?

Z

Reply to zinzan

There's no easy way. You could monitor exhaust temps with something, and if they were higher after a crash that would at least help point the way.

A PSU can take a long time to heat up, and also to cool down, and so can cause issues after hours of running, or faster with a few degrees ambient temp increase.

I've been wanting one of these toys, maybe for Christmas ;)
http://www.amazon.com/gp/product/B [...] 07NNN3M0WZ

------------------------------ My Guide to choosing parts
A big list of recommended parts
Troubleshooting Guide w/links
Reply to Proximon
Tom's Hardware > Forum > Homebuilt Systems > General Homebuilt > Troubleshooting a relatively new build
Go to:

There are 750 identified and unidentified users. To see the list of identified users, Click here.

Sponsored links
  • Ask the community now
  • Publish
Ad
They won a badge
Join us in greeting them