DebuggingKernelSuspend

Differences between revisions 2 and 3
Revision 2 as of 2007-02-23 20:59:51
Size: 2710
Editor: rtg
Comment:
Revision 3 as of 2007-02-23 21:00:45
Size: 2714
Editor: rtg
Comment:
Deletions are marked like this. Additions are marked like this.
Line 22: Line 22:

This page describes how to debug Suspend to RAM/Resume problems on your computer. Do not confuse this with Suspend to disk.

Suspend and Resume use facilities within your BIOS called ACPI, or Advanced Configuration and Power Interface. Linux provides an ACPI subsystem that manages the suspend and resume process. The usual problem occurs when resuming, and normally the culprit is a device driver that does not recover from a powered down state. If your computer successfully performs a suspend, then it is quite likely any resume problems are due to another device driver and not the ACPI subsystem.

Resume problems are difficult to debug since there is no non-volatile storage available at the time resume is bringing up your computer. The only hardware on a PC motherboard that retains information across power cycles is the real time clock (RTC). For those that just have to know, read Documentation/power/s2ram.txt in your kernel sources. The implementation of suspend/resume debug trace is in drivers/base/power/trace.c.

Caveat Emptor: Using the following debug suggestions will radically change the values in your RTC chip, so much so that your file system will think it has been eons since the last fsck. You can avoid a long fsck delay by using 'tune2fs'. For example, 'tune2fs -i 0 /dev/sda1' disables fsck on boot.

NTP is not sufficient to recover the correct time of day. By default NTP is configured to to only correct for about an hour drift. You must use the 'date' command to get the RTC value within the drift tolerance.

In order to stimulate your suspend/resume process, enter the following commands (as root):

sync; echo 1 > /sys/power/pm_trace; echo mem > /sys/power/state

At this point your computer should enter the suspend state within a few seconds. Usually the power LED will slowly flash when in the suspended state. When that has happened, initiate the resume process by pressing the power button. Observe closely if the disk light comes on briefly. This indicates that resume has begun. If resume fails to complete, then press the power button until the computer turns off. Power on your computer making sure that it loads the same kernel that exhibited the resume problem. Start a console and enter:

dmesg > dmesg.txt

You can edit this file and find line similar to these:

[ 11.323206] Magic number: 0:798:264

[ 11.323257] hash matches drivers/base/power/resume.c:46

There may well be another 'hash matches' line beyond that. If so, then you are in luck because it is the likely culprit. For example:

hash matches device i2c-9191

The only way to prove this is to remove the module prior to initiating suspend. Repeat as needed...

DebuggingKernelSuspend (last edited 2019-01-28 21:01:56 by penalvch)