[Nix-dev] Freeze on boot, bug in nouveau or related module

Eike eike at eknet.org
Mon Jan 12 21:55:19 CET 2015


Dear all,

I have run into a problem after updating my server to the latest 14.12
release (14.12.298.0667587). I lost the server after a reboot and it
turned out that it freezed during boot while running

    systemd-udevd[1382]: worker [1389] failed while handling '/devices/pci0000:00/0000:00:03.0/0000:02:00.0'

This device is the grahpic card. A few lines before, there is

    BUG: unable to handle kernel paging request at ffffc90012d93000

right after loading the fbcon module. This did not happen with
14.12.231.139ead2, although the kernel version was the same.

I use the generated hardware-configuration.nix unchanged. My first naive
guess was that a memory chip is broken, but booting the old version
always works, which makes this unlikely imho. A log excerpt is below
this mail.

Has someone seen this before? I'm not sure what caused this error. I
would think either the nouveau or only the fbcon module…?

Any thoughts on this are much welcome! I can provide more information,
if needed.


Thanks and regards
Eike


-- Reboot --
Jan 11 12:40:52 skyros systemd-journal[1365]: Runtime journal is using 8.0M (max allowed 300.3M, trying to leave 450.5M free of 2.9G available → current limit 300.3M).
Jan 11 12:40:53 skyros systemd-journal[1365]: Permanent journal is using 1.6G (max allowed 4.0G, trying to leave 4.0G free of 1.2T available → current limit 4.0G).
Jan 11 12:40:53 skyros systemd-journal[1365]: Time spent on flushing to /var is 661.705ms for 2 entries.
Jan 11 12:40:53 skyros kernel: Initializing cgroup subsys cpuset
Jan 11 12:40:53 skyros kernel: Initializing cgroup subsys cpu
Jan 11 12:40:53 skyros kernel: Initializing cgroup subsys cpuacct
Jan 11 12:40:53 skyros kernel: Linux version 3.14.27 (nixbld at localhost) (gcc version 4.8.3 (GCC) ) #1-NixOS SMP Thu Jan 1 00:00:01 UTC 1970
Jan 11 12:40:54 skyros kernel: Command line: BOOT_IMAGE=(md/0)/nix/store/w0ml58bivg2jyczpjm66rl3c7vklnbkj-linux-3.14.27/bzImage systemConfig=/nix/store/gjci7k6kix4ygdgvzpvybavpazn9h4x6-nixos-14.12.298.0667587 init=/nix/store/gjci7k6kix4ygdgvzpvybavpazn9h4x6-nixos-14.12.298.0667587/init loglevel=4
Jan 11 12:40:54 skyros kernel: e820: BIOS-provided physical RAM map:
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x0000000000000000-0x000000000009ebff] usable
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x000000000009ec00-0x000000000009ffff] reserved
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x0000000000100000-0x00000000bf77ffff] usable
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000bf780000-0x00000000bf78dfff] ACPI data
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000bf78e000-0x00000000bf7cffff] ACPI NVS
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000bf7d0000-0x00000000bf7dffff] reserved
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000bf7ec000-0x00000000bfffffff] reserved
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000fee00000-0x00000000fee00fff] reserved
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x00000000ffc00000-0x00000000ffffffff] reserved
Jan 11 12:40:54 skyros kernel: BIOS-e820: [mem 0x0000000100000000-0x000000033fffffff] usable
Jan 11 12:40:54 skyros kernel: NX (Execute Disable) protection: active
Jan 11 12:40:54 skyros kernel: SMBIOS 2.5 present.
Jan 11 12:40:54 skyros kernel: DMI: MSI MS-7522/MSI X58 Pro (MS-7522)  , BIOS V8.14B8 11/09/2012
<....>
Jan 11 12:40:54 skyros kernel: nouveau  [     DRM] 0xC592: Parsing digital output script table
Jan 11 12:40:54 skyros kernel: nouveau  [     DRM] MM: using M2MF for buffer copies
Jan 11 12:40:54 skyros kernel: nouveau  [     DRM] Setting dpms mode 3 on TV encoder (output 3)
Jan 11 12:40:54 skyros kernel: nouveau 0000:02:00.0: No connectors reported connected with modes
Jan 11 12:40:54 skyros kernel: [drm] Cannot find any crtc or sizes - going 1024x768
Jan 11 12:40:54 skyros kernel: nouveau  [     DRM] allocated 1024x768 fb: 0x9000, bo ffff88032a10b400
Jan 11 12:40:54 skyros kernel: fbcon: nouveaufb (fb0) is primary device
Jan 11 12:40:54 skyros kernel: BUG: unable to handle kernel paging request at ffffc90012d93000
Jan 11 12:40:56 skyros kernel: IP: [<ffffffffa0b5379b>] nouveau_bo_wr32+0x1b/0x30 [nouveau]
Jan 11 12:40:56 skyros kernel: PGD 33300f067 PUD 333030067 PMD 32a2f2067 PTE 0
Jan 11 12:40:56 skyros kernel: Oops: 0002 [#1] SMP
Jan 11 12:40:56 skyros kernel: Modules linked in: nouveau(+) fbcon bitblit softcursor font tileblit video ttm drm_kms_helper drm agpgart i2c_algo_bit i2c_core evdev mac_hid gpio_ich mxm_wmi coretemp crc32c_intel serio_raw tpm_infineon r8169 tpm_tis acpi_cpufreq mii wmi processor tpm microcode snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore thermal_sys button shpchp hwmon nf_conntrack_ftp lpc_ich nf_conntrack i7core_edac edac_core configs loop cpufreq_ondemand kvm_intel kvm ipv6 autofs4 ext4 crc16 jbd2 mbcache raid1 md_mod sd_mod crc_t10dif crct10dif_generic crct10dif_common uhci_hcd ahci libahci libata ehci_pci ehci_hcd scsi_mod usbcore atkbd usb_common libps2 i8042 serio rtc_cmos unix dm_mod
Jan 11 12:40:56 skyros kernel: CPU: 2 PID: 1389 Comm: systemd-udevd Not tainted 3.14.27 #1-NixOS
Jan 11 12:40:56 skyros kernel: Hardware name: MSI MS-7522/MSI X58 Pro (MS-7522)  , BIOS V8.14B8 11/09/2012
Jan 11 12:40:56 skyros kernel: task: ffff88032e094c80 ti: ffff88032e6ec000 task.ti: ffff88032e6ec000
Jan 11 12:40:56 skyros kernel: RIP: 0010:[<ffffffffa0b5379b>]  [<ffffffffa0b5379b>] nouveau_bo_wr32+0x1b/0x30 [nouveau]
Jan 11 12:40:56 skyros kernel: RSP: 0000:ffff88032e6ed378  EFLAGS: 00010246
Jan 11 12:40:56 skyros kernel: RAX: ffffc90012d83000 RBX: 0000000000002180 RCX: 0000000000000002
Jan 11 12:40:56 skyros kernel: RDX: 0000000020000000 RSI: ffffc90012d93000 RDI: ffff88032a10b800
Jan 11 12:40:56 skyros kernel: RBP: ffff88032e6ed3b0 R08: 0000000000000400 R09: 0000000000000001
Jan 11 12:40:56 skyros kernel: R10: ffff8800ba945e00 R11: ffff8800ba945f00 R12: 0000000000000001
Jan 11 12:40:56 skyros kernel: R13: 0000000000000000 R14: ffff8800bb057300 R15: 0000000000000860
Jan 11 12:40:56 skyros kernel: FS:  00007f6c523757c0(0000) GS:ffff88033fc40000(0000) knlGS:0000000000000000
Jan 11 12:40:56 skyros kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 11 12:40:56 skyros kernel: CR2: ffffc90012d93000 CR3: 000000032d031000 CR4: 00000000000007e0
Jan 11 12:40:56 skyros kernel: Stack:
Jan 11 12:40:56 skyros kernel:  ffffffffa0b4f3bd 0000008100000000 ffff8800bb057300 0000000000000080
Jan 11 12:40:56 skyros kernel:  0000000000000100 ffff88032d29dc00 0000000000000081 ffff88032e6ed3f0
Jan 11 12:40:56 skyros kernel:  ffffffffa0b61f66 0000000000000400 ffff8800ba8c4c00 ffff88032d1bd800
Jan 11 12:40:56 skyros kernel: Call Trace:
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b4f3bd>] ? nouveau_dma_wait+0xfd/0x600 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b61f66>] nv04_fbcon_imageblit+0x206/0x310 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b5d975>] nouveau_fbcon_imageblit+0xf5/0x100 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0abdc9f>] bit_putcs+0x2ef/0x54c [bitblit]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b6c96e>] ? nouveau_hw_load_state_palette+0x10e/0x120 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0a8029a>] ? drm_fb_helper_setcmap+0x26a/0x450 [drm_kms_helper]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0ac3341>] ? get_color.isra.17+0x31/0x140 [fbcon]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0ac43d8>] fbcon_putcs+0x128/0x160 [fbcon]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0abd9b0>] ? bit_cursor+0x660/0x660 [bitblit]
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132c6ae>] do_update_region+0xfe/0x170
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132e758>] redraw_screen+0x1d8/0x240
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132f0c1>] vc_do_resize+0x481/0x4b0
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132f10f>] vc_resize+0x1f/0x30
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0ac6dae>] fbcon_init+0x33e/0x530 [fbcon]
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132cd48>] visual_init+0xb8/0x120
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132f4e3>] do_bind_con_driver+0x163/0x330
Jan 11 12:40:56 skyros kernel:  [<ffffffff8132fc94>] do_take_over_console+0x114/0x1c0
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0ac2cf2>] do_fbcon_takeover+0x52/0xa0 [fbcon]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0ac7ad3>] fbcon_event_notify+0x6c3/0x770 [fbcon]
Jan 11 12:40:56 skyros kernel:  [<ffffffff811ff6fb>] ? __kernfs_create_file+0x9b/0xc0
Jan 11 12:40:56 skyros kernel:  [<ffffffff8108624c>] notifier_call_chain+0x4c/0x70
Jan 11 12:40:56 skyros kernel:  [<ffffffff8108654d>] __blocking_notifier_call_chain+0x4d/0x70
Jan 11 12:40:56 skyros kernel:  [<ffffffff81086586>] blocking_notifier_call_chain+0x16/0x20
Jan 11 12:40:56 skyros kernel:  [<ffffffff812c0fdb>] fb_notifier_call_chain+0x1b/0x20
Jan 11 12:40:56 skyros kernel:  [<ffffffff812c3026>] register_framebuffer+0x1e6/0x320
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0a816cf>] drm_fb_helper_initial_config+0x32f/0x510 [drm_kms_helper]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b5dc87>] nouveau_fbcon_init+0x107/0x140 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b4de47>] nouveau_drm_load+0x4b7/0x8a0 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0a30a6b>] drm_dev_register+0x7b/0x160 [drm]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0a32f10>] drm_get_pci_dev+0xa0/0x220 [drm]
Jan 11 12:40:56 skyros kernel:  [<ffffffff812a5d1b>] ? __pci_set_master+0x2b/0x90
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0b4e43a>] nouveau_drm_probe+0x20a/0x290 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffff812aacf5>] local_pci_probe+0x45/0xa0
Jan 11 12:40:56 skyros kernel:  [<ffffffff812ac0b1>] pci_device_probe+0xd1/0x130
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134f66f>] driver_probe_device+0x7f/0x250
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134f913>] __driver_attach+0x93/0xa0
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134f880>] ? __device_attach+0x40/0x40
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134d863>] bus_for_each_dev+0x63/0xa0
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134f17e>] driver_attach+0x1e/0x20
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134edb8>] bus_add_driver+0x178/0x230
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0bd9000>] ? 0xffffffffa0bd8fff
Jan 11 12:40:56 skyros kernel:  [<ffffffff8134ff54>] driver_register+0x64/0xf0
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0bd9000>] ? 0xffffffffa0bd8fff
Jan 11 12:40:56 skyros kernel:  [<ffffffff812aa68b>] __pci_register_driver+0x4b/0x50
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0a331aa>] drm_pci_init+0x11a/0x130 [drm]
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0bd9000>] ? 0xffffffffa0bd8fff
Jan 11 12:40:56 skyros kernel:  [<ffffffffa0bd9043>] nouveau_drm_init+0x43/0x45 [nouveau]
Jan 11 12:40:56 skyros kernel:  [<ffffffff81002182>] do_one_initcall+0x112/0x170
Jan 11 12:40:56 skyros kernel:  [<ffffffff81086558>] ? __blocking_notifier_call_chain+0x58/0x70
Jan 11 12:40:56 skyros kernel:  [<ffffffff810ceb05>] load_module+0x1aa5/0x21f0
Jan 11 12:40:56 skyros kernel:  [<ffffffff810cb1a0>] ? m_show+0x1e0/0x1e0
Jan 11 12:40:56 skyros kernel:  [<ffffffff810cb431>] ? copy_module_from_fd.isra.53+0x121/0x180
Jan 11 12:40:56 skyros kernel:  [<ffffffff810cf38e>] SyS_finit_module+0x7e/0x80
Jan 11 12:40:56 skyros kernel:  [<ffffffff814634a9>] system_call_fastpath+0x16/0x1b
Jan 11 12:40:56 skyros kernel: Code: 55 48 89 c7 48 89 e5 e8 84 b3 73 e0 5d c3 66 90 66 66 66 66 90 f6 87 40 02 00 00 80 48 8b 87 30 02 00 00 89 f6 48 8d 34 b0 75 05 <89> 16 c3 66 90 55 89 d7 48 89 e5 e8 25 b4 73 e0 5d c3 0f 1f 00
Jan 11 12:40:56 skyros kernel: RIP  [<ffffffffa0b5379b>] nouveau_bo_wr32+0x1b/0x30 [nouveau]
Jan 11 12:40:57 skyros kernel:  RSP <ffff88032e6ed378>
Jan 11 12:40:57 skyros kernel: CR2: ffffc90012d93000
Jan 11 12:40:57 skyros kernel: ---[ end trace 10727e405c2fe506 ]---
Jan 11 12:40:57 skyros systemd-journal[1365]: Journal started
Jan 11 12:40:57 skyros systemd-modules-load[1364]: Inserted module 'kvm_intel'
Jan 11 12:40:57 skyros systemd-udevd[1382]: starting version 217
Jan 11 12:40:57 skyros systemd-modules-load[1364]: Inserted module 'cpufreq_ondemand'
Jan 11 12:40:57 skyros systemd-udevd[1382]: specified group 'input' unknown
Jan 11 12:40:52 skyros systemd-modules-load[1364]: Inserted module 'loop'
Jan 11 12:40:52 skyros systemd-modules-load[1364]: Inserted module 'configs'
Jan 11 12:40:52 skyros systemd-modules-load[1364]: Inserted module 'nf_conntrack_ftp'
Jan 11 12:40:52 skyros systemd-modules-load[1364]: Inserted module 'snd_pcm_oss'
Jan 11 12:40:53 skyros systemd-modules-load[1364]: Inserted module 'microcode'
Jan 11 12:40:54 skyros systemd-udevd[1382]: worker [1389] terminated by signal 9 (Killed)
Jan 11 12:40:54 skyros systemd-udevd[1382]: worker [1389] failed while handling '/devices/pci0000:00/0000:00:03.0/0000:02:00.0'

--
gpg: AD7AC35E
finger print: 137F BB0B 1639 D25F DC5D  E59C B412 C5F5 AD7A C35E


More information about the nix-dev mailing list