Linux Lock-ups?

Zorak

Distinguished
Mar 20, 2006
505
0
18,990
In my experience, Linux has been a pretty stable platform (i've been able to leave my machine on for days), but I have noticed that on a few different occasions it has locked up for some unknown reason and I am forced to do a hard reset.

I don't know if I am doing anything wrong here, and I am not sure whether there is a more graceful way to get out of system lockups than what I have been doing previously. Any help is really appreciated.

Also, i did read an article about some interupt key patterns to use if the system locks up, but this last time even that didn't work (neither <ctrl><alt> backspace nor <alt> <prt scr> REISUB worked for me).

I wrote this post because I found it pretty disturbing that all i was doing was watching a couple of videos on youtube and the system just ground to a halt. I sincerely hope this is a stupid configuration error on my part as opposed to some sort of hardware malady, because as I understand it, Linux will only do this when something has gone horribly wrong. On a related note, there may very well be a problem with the linux version of the firefox flash plugin since it has caused firefox to lock up several times ( but never system wide before).

Thanks guys.

-Zorak
 

linux_0

Splendid
Flash player is notoriously unstable and will lock up all the time.

You are correct however that a full system wide lock up could potentially be something far more serious.
 

knightrous

Distinguished
Jan 27, 2006
271
0
18,780
This might be way off, but I had a similar issue in Ubuntu when I updated my Nvidia drivers for my 6600GT, suddenly my system would freeze up randomly, or so I thought. Music would keep playing in the background, but the whole computer looked frozen and wouldn't respond to any commands. Hard reset was the only way to fix it. When I installed Ubuntu 7.10 a few weeks after the problem arrived, it stopped happening... No idea why, but the problem stopped for me :D
 

Zorak

Distinguished
Mar 20, 2006
505
0
18,990
Well, knightrous, on a couple of occasions i have had the lockup with music still playing in the background (and it could very well be the nvidia driver as i have an nvidia card myself) but this last time, the music from the youtube video kept replaying about the last 20 milliseconds again and again and the entire system locked up and wouldn't respond to any of the interrupt combinations i know.

Is there anything I could do to check my system aside from just running memtest86+ to check for ram problems (i.e. any other good diagnostic software)?

Thanks

-Zorak
 

linux_0

Splendid
prime 95, memtest, compile the kernel a few times and run some linux benchmarks.

Also make sure nothing is shorting in your system and that none of your components are overheating.

Your GPU, CPU and chipset may very well lock up at the hardware level if they overheat or if there is a hardware bug or a severe driver bug.

If I remember correctly you have a huge Antec 900 or better with a number of fans so it may seem unlikely that it is a heat issue but if any of the heatsinks have come loose, if there is insufficient or too much heatsink compound or if a component is getting very hot in a certain spot it can still crash.

If X11 locks up but the music keeps playing normally and you can still ping the server from another PC and maybe ssh to it or connect to open ports then that is most likely just a GUI crash and if you can ssh in you should be able to kill the GUI / X11 and restart it.

That is where the magic sysrq keys come in which you have to enable ( check my PM from yesterday on how to enable them ).

If the hardware has not locked up and your kernel hasn't suffered a kernel panic you should always be able to use the sysrq keys.

Also install sensors and check your temps and voltages.

GL :)
 

Zorak

Distinguished
Mar 20, 2006
505
0
18,990
I know this isn't a real benchmark, but on a couple of occasions after this lockup I installed and played the Crysis single player demo for several hours (its fun messing around) and I didn't experience any crashes due to overheating or the like while I did that. Additionally I have been able to do the same with no ill effect in the Unreal Tournament 3 demo. I would think that since windows is less stable over all, if I had a loose heatsink or too much thermal compound, i'd have the same issues there, but I don't. I guess as soon as I get the chance I will try running a few of the benches that you suggested ( i have like 3 tests and 2 projects coming up :( )

Next time I experience a lockup in Linux (which i hope doesn't happen soon), I will try pinging my machine from another box and maybe i can gracefully kill X remotely since I can't do it via <ctrl><alt> backspace or via a virtual terminal.

also, about compiling the kernel, would i just grab a copy from kernel.org, untar it to a temporary spot and just type make? I just want to make sure that I won't inadvertently replace my current kernel with a new "hand-compiled" kernel because I have no experience compiling kernels and I don't want to have to spend time fixing my machine when I really need it for an upcoming assignment.

Also, an a somewhat unrelated note, audacious media player just broke on me last night for no reason! I am hoping that it was just a bug in an update or something because it is strange when you have a large playlist of files with random select and repeat options enabled and it "randomly" elects to play the same song again and again...

Anyways, thanks guys.

-Zorak
 

linux_0

Splendid
To compile the kernel just grab the latest full source from kernel.org currently http://kernel.org/pub/linux/kernel/v2.6/linux-2.6.23.1.tar.bz2


F = Full Source

extract it

make menuconfig

use tab to exit from the configuration menu and let it save the config when prompted

make

sensors

make clean

make

sensors

rinse and repeat :)


you need to install and configure sensors if you don't already have'em



here's some benchmarks to google:

mysql-bench

postgresql

tscp

POV-Ray

lame-mp3

gzip

bzip2

oggenc

http://lbs.sourceforge.net/


GL :)
 

Zorak

Distinguished
Mar 20, 2006
505
0
18,990
Thanks man, I think i will go ahead and give it a try sometime this weekend. Also, to answer your question, Knightrous, no I wasn't using Ktorrent or Amarok (i haven't even really used any KDE programs besides K3b :D )

-Zorak
 


That's not a complete lockup, that's X locking up. Keyboard, mouse, and video are controlled by X, so the music still playing tells me it's not the whole machine locking up. If you have a second machine, try sshing into the "locked" one. Chances are you can do this as you would normally. Kill the X process, rmmod the GPU driver (nvidia), modprobe the driver, restart xdm/gdm/kdm and you're good to go.

I have had this happen on my machine when playing ETQW and the aforementioned steps work well. Just beware that if you restart X without rmmoding and modprobing the GPU driver and X locked up because of the GPU driver, you actually will lock up the system (your ssh link will die.)

Is there anything I could do to check my system aside from just running memtest86+ to check for ram problems (i.e. any other good diagnostic software)?

Thanks

-Zorak

Check to see that you are putting enough volts to your RAM in the BIOS and making sure that it is running at the appropriate speed and timings. If you are OCing, then return the clocks and volts to normal.