1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
|
other: 0.861
graphic: 0.860
semantic: 0.855
performance: 0.850
debug: 0.836
KVM: 0.827
permissions: 0.820
PID: 0.759
network: 0.733
device: 0.728
vnc: 0.717
socket: 0.699
files: 0.630
boot: 0.608
kernel NULL pointer in -virtual (-server) kernel
When stress testing eucalyptus we have run into this oops inside VMs
[ 82.907577] BUG: unable to handle kernel NULL pointer dereference at 0000000000000358^M
[ 82.908842] IP: [<ffffffff813982e8>] sym_int_sir+0x2a8/0x750^M
[ 82.909773] PGD 0 ^M
[ 82.910110] Thread overran stack, or stack corrupted^M
[ 82.910870] Oops: 0000 [#1] SMP ^M
[ 82.911407] last sysfs file: /sys/devices/virtual/block/ram9/uevent^M
We launched 18 instances, 2 of them failed this way. The instances run with 192M of memory. With 6 VM launches on a single node all at the same time the host is under heavy load.
This occurred in 20100323 lucid x86_64 uec-image instance.
ProblemType: Bug
AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
AplayDevices: Error: [Errno 2] No such file or directory
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory
CurrentDmesg:
Date: Wed Mar 24 22:06:32 2010
DistroRelease: Ubuntu 10.04
Frequency: Once a day.
Lsusb: Bus 001 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
MachineType: Bochs Bochs
Package: linux-image-2.6.32-16-virtual 2.6.32-16.25
PciMultimedia:
ProcCmdLine: root=/dev/sda1 console=ttyS0
ProcEnviron:
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcVersionSignature: User Name 2.6.32-16.25-server
Regression: No
Reproducible: No
SourcePackage: linux
TestedUpstream: No
Uname: Linux 2.6.32-16-server x86_64
dmi.bios.date: 01/01/2007
dmi.bios.vendor: Bochs
dmi.bios.version: Bochs
dmi.chassis.type: 1
dmi.chassis.vendor: Bochs
dmi.modalias: dmi:bvnBochs:bvrBochs:bd01/01/2007:svnBochs:pnBochs:pvr:cvnBochs:ct1:cvr:
dmi.product.name: Bochs
dmi.sys.vendor: Bochs
I saw this https://bugzilla.redhat.com/show_bug.cgi?id=560114 which links to http://<email address hidden>/msg08927.html
just for the record, ubuntu-bug was run on a different instance (same kernel/filesystem, but not the one that failed)
Hi Scott,
This bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .
If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.
apport-collect -p linux 546458
Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.
Thanks in advance.
[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]
This is still a concern, and not easily reproducible. I would rather keep it open, unless you have reason to consider the OOPS resolved on current 2.6.32-21. I am stress-testing (by extension) kvm, and hopefully either I can state it has not happened on the last (few) thousands tries, and recornfirm it.
then let's set the linux task to Triaged. i only like to use 'new' for actual new bugs. :-)
~JFo
I see no indication that this is actually a qemu issue. If there's any evidence that it is, please reopen.
Just to confirm -- still present, 10.04 LTS up-to-date, UEC images also 10.04 up-to-date. A 2,000 run creating KVM instances under Eucalyptus shows 6 occurences of this OOPS:
WARNING:INSTANCE i-3EDE078A:[ 129.998256] BUG: unable to handle kernel NULL pointer dereference at 00000358
WARNING:INSTANCE i-406C06CE:[ 89.245841] BUG: unable to handle kernel NULL pointer dereference at 0000000000000358
WARNING:INSTANCE i-411D0851:[ 158.375444] BUG: unable to handle kernel NULL pointer dereference at 00000358
WARNING:INSTANCE i-4E1C08D4:[ 196.089623] BUG: unable to handle kernel NULL pointer dereference at 0000000000000358
WARNING:INSTANCE i-54800A8D:[ 67.825483] BUG: unable to handle kernel NULL pointer dereference at 0000000000000358
WARNING:INSTANCE i-5E970AA3:[ 87.610866] BUG: unable to handle kernel NULL pointer dereference at 00000358
Per comment #7, marking this invalid for qemu-kvm.
Description of problem:
During an VNC installation on a virt guest, the installer appeared to hang while installing the bootloader. Closer inspection on the console shows a kernel NULL pointer dereference.
Version-Release number of selected component (if applicable):
* anaconda-13.23
* kernel-2.6.33-0.23.rc5.git1.fc13.x86_64
How reproducible:
* This is the first time out of 8 or more installs
Steps to Reproduce:
1. Perform an installation against http://alt.fedoraproject.org/pub/alt/stage/rawhide-testing/
Actual results:
Running anaconda 13.23, the Fedora system installer - please wait.
21:09:16 Starting VNC...
21:09:17 The VNC server is now running.
21:09:17
WARNING!!! VNC server running with NO PASSWORD!
You can use the vncpassword=<password> boot option
if you would like to secure the server.
21:09:17 Please manually connect your vnc client to test1200.test.redhat.com:1 (10.10.10.200) to begin the install.
Press <enter> for a shell
21:09:18 Starting graphical installation.
sh-4.1# BUG: unable to handle kernel NULL pointer dereference at 0000000000000358
IP: [<ffffffffa010846b>] sym_int_sir+0x646/0x1549 [sym53c8xx]
PGD 2b2ef067 PUD 3eea2067 PMD 0
Oops: 0000 [#1] SMP
last sysfs file: /sys/devices/pci0000:00/0000:00:04.0/host2/target2:0:0/2:0:0:0/block/sda/removable
CPU 0
Pid: 0, comm: swapper Not tainted 2.6.33-0.23.rc5.git1.fc13.x86_64 #1 /
RIP: 0010:[<ffffffffa010846b>] [<ffffffffa010846b>] sym_int_sir+0x646/0x1549 [sym53c8xx]
RSP: 0018:ffff880003c039b0 EFLAGS: 00010087
RAX: 000000000000000a RBX: ffff88003e41c000 RCX: 0000000000000070
RDX: 0000000000000000 RSI: ffffffffa0103c22 RDI: ffffc90000a5a006
RBP: ffff880003c03a30 R08: ffffffff81a4b830 R09: 0000000000000001
R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
R13: ffff88003e41c090 R14: ffff88003e6a800b R15: 0000000000000084
FS: 0000000000000000(0000) GS:ffff880003c00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000358 CR3: 000000002b2ec000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process swapper (pid: 0, threadinfo ffffffff81a00000, task ffffffff81a4b020)
Stack:
0000000000000000 0000000000000002 ffffffff81a4b020 0000000000000000
<0> 0000000000000000 0000000000000000 0000000000000000 0000000000000046
<0> 0000000000000000 0000000000000000 ffff880003c03a60 ffff88003e41c000
Call Trace:
<IRQ>
[<ffffffffa010980c>] sym_interrupt+0x49e/0x6d2 [sym53c8xx]
[<ffffffffa0103c2a>] sym53c8xx_intr+0x4d/0x7b [sym53c8xx]
[<ffffffff810abec8>] handle_IRQ_event+0x53/0x119
[<ffffffff810add95>] handle_fasteoi_irq+0x90/0xd0
[<ffffffff8100c437>] handle_irq+0x88/0x91
[<ffffffff8147ca74>] do_IRQ+0x5c/0xc3
[<ffffffff81477b13>] ret_from_intr+0x0/0x16
[<ffffffff814778d6>] ? _raw_spin_unlock_irqrestore+0x4c/0x56
[<ffffffff812fb08f>] ? spin_unlock_irqrestore+0xe/0x10
[<ffffffff812fbb16>] ? scsi_dispatch_cmd+0x1c1/0x234
[<ffffffff81301b81>] ? scsi_request_fn+0x476/0x4a3
[<ffffffff812152ed>] ? __blk_run_queue+0x45/0x74
[<ffffffff812153d1>] ? blk_run_queue+0x26/0x3a
[<ffffffff81300ff4>] ? scsi_run_queue+0x300/0x3ac
[<ffffffff812e8964>] ? put_device+0x17/0x19
[<ffffffff81301ddc>] ? scsi_next_command+0x3b/0x4b
[<ffffffff813027cb>] ? scsi_io_completion+0x1f7/0x448
[<ffffffff81300c43>] ? spin_unlock_irqrestore+0xe/0x10
[<ffffffff812fb834>] ? scsi_finish_command+0xf5/0xfe
[<ffffffff81302b42>] ? scsi_softirq_done+0x111/0x11a
[<ffffffff8121a69a>] ? blk_done_softirq+0x82/0x92
[<ffffffff810567e4>] ? __do_softirq+0xf8/0x1cd
[<ffffffff8100ab9c>] ? call_softirq+0x1c/0x30
[<ffffffff8100c357>] ? do_softirq+0x4b/0xa3
[<ffffffff810563cf>] ? irq_exit+0x4a/0x8c
[<ffffffff8147cac4>] ? do_IRQ+0xac/0xc3
[<ffffffff81477b13>] ? ret_from_intr+0x0/0x16
<EOI>
[<ffffffff81029289>] ? native_safe_halt+0xb/0xd
[<ffffffff8107d04f>] ? trace_hardirqs_on+0xd/0xf
[<ffffffff810115f5>] ? default_idle+0x3b/0x5d
[<ffffffff81008bfc>] ? cpu_idle+0xaf/0xe9
[<ffffffff8145ed6a>] ? rest_init+0x7e/0x80
[<ffffffff81d82e2f>] ? start_kernel+0x440/0x44b
[<ffffffff81d822bc>] ? x86_64_start_reservations+0xa7/0xab
[<ffffffff81d823b8>] ? x86_64_start_kernel+0xf8/0x107
Code: b2 d5 10 a0 48 89 da eb 65 48 8b 9f b0 01 00 00 48 81 c7 a0 01 00 00 e8 68 02 1e e1 48 c7 c7 ec d5 10 a0 48 89 c6 48 89 da eb 6f <49> 8b 84 24 58 03 00 00 48 8b 90 80 00 00 00 48 8b 38 4c 8b a2
RIP [<ffffffffa010846b>] sym_int_sir+0x646/0x1549 [sym53c8xx]
RSP <ffff880003c039b0>
CR2: 0000000000000358
---[ end trace f535af648735afc9 ]---
Kernel panic - not syncing: Fatal exception in interrupt
Pid: 0, comm: swapper Tainted: G D 2.6.33-0.23.rc5.git1.fc13.x86_64 #1
Call Trace:
<IRQ> [<ffffffff81474628>] panic+0x7a/0x142
[<ffffffff81478b03>] oops_end+0xb7/0xc7
[<ffffffff8102f869>] no_context+0x1fc/0x20b
[<ffffffff81029d1a>] ? pvclock_clocksource_read+0x47/0x83
[<ffffffff8102fa0a>] __bad_area_nosemaphore+0x192/0x1b5
[<ffffffff81029237>] ? kvm_clock_read+0x21/0x23
[<ffffffff8102fa40>] bad_area_nosemaphore+0x13/0x15
[<ffffffff8147a60b>] do_page_fault+0x16f/0x2df
[<ffffffff81477e75>] page_fault+0x25/0x30
[<ffffffffa0103c22>] ? sym53c8xx_intr+0x45/0x7b [sym53c8xx]
[<ffffffffa010846b>] ? sym_int_sir+0x646/0x1549 [sym53c8xx]
[<ffffffffa010980c>] sym_interrupt+0x49e/0x6d2 [sym53c8xx]
[<ffffffffa0103c2a>] sym53c8xx_intr+0x4d/0x7b [sym53c8xx]
[<ffffffff810abec8>] handle_IRQ_event+0x53/0x119
[<ffffffff810add95>] handle_fasteoi_irq+0x90/0xd0
[<ffffffff8100c437>] handle_irq+0x88/0x91
[<ffffffff8147ca74>] do_IRQ+0x5c/0xc3
[<ffffffff81477b13>] ret_from_intr+0x0/0x16
[<ffffffff814778d6>] ? _raw_spin_unlock_irqrestore+0x4c/0x56
[<ffffffff812fb08f>] ? spin_unlock_irqrestore+0xe/0x10
[<ffffffff812fbb16>] ? scsi_dispatch_cmd+0x1c1/0x234
[<ffffffff81301b81>] ? scsi_request_fn+0x476/0x4a3
[<ffffffff812152ed>] ? __blk_run_queue+0x45/0x74
[<ffffffff812153d1>] ? blk_run_queue+0x26/0x3a
[<ffffffff81300ff4>] ? scsi_run_queue+0x300/0x3ac
[<ffffffff812e8964>] ? put_device+0x17/0x19
[<ffffffff81301ddc>] ? scsi_next_command+0x3b/0x4b
[<ffffffff813027cb>] ? scsi_io_completion+0x1f7/0x448
[<ffffffff81300c43>] ? spin_unlock_irqrestore+0xe/0x10
[<ffffffff812fb834>] ? scsi_finish_command+0xf5/0xfe
[<ffffffff81302b42>] ? scsi_softirq_done+0x111/0x11a
[<ffffffff8121a69a>] ? blk_done_softirq+0x82/0x92
[<ffffffff810567e4>] ? __do_softirq+0xf8/0x1cd
[<ffffffff8100ab9c>] ? call_softirq+0x1c/0x30
[<ffffffff8100c357>] ? do_softirq+0x4b/0xa3
[<ffffffff810563cf>] ? irq_exit+0x4a/0x8c
[<ffffffff8147cac4>] ? do_IRQ+0xac/0xc3
[<ffffffff81477b13>] ? ret_from_intr+0x0/0x16
<EOI> [<ffffffff81029289>] ? native_safe_halt+0xb/0xd
[<ffffffff8107d04f>] ? trace_hardirqs_on+0xd/0xf
[<ffffffff810115f5>] ? default_idle+0x3b/0x5d
[<ffffffff81008bfc>] ? cpu_idle+0xaf/0xe9
[<ffffffff8145ed6a>] ? rest_init+0x7e/0x80
[<ffffffff81d82e2f>] ? start_kernel+0x440/0x44b
[<ffffffff81d822bc>] ? x86_64_start_reservations+0xa7/0xab
[<ffffffff81d823b8>] ? x86_64_start_kernel+0xf8/0x107
Expected results:
No kernel oops
This seems to be the same kvm scsi emulation issue reported and discussed on the LKML,
http://<email address hidden>/msg08927.html
and also Debian bug #511914
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=511914
Rgds, Ariel
Which qemu-kvm version are you using? Could you try the one from virt-preview?
qemu's SCSI support isn't the one that gets most developer attention; can you please try ide or virtio?
(In reply to comment #2)
> Which qemu-kvm version are you using? Could you try the one from virt-preview?
* qemu-kvm-0.11.0-12.fc12.x86_64
I'm running into other issues when trying to use the F-12 virt-preview repository. I'll take those to <email address hidden> for feedback.
> qemu's SCSI support isn't the one that gets most developer attention; can you
> please try ide or virtio?
I can work around this by using ide or virtio. This is certainly specific to SCSI KVM installs.
(In reply to comment #3)
> (In reply to comment #2)
> > Which qemu-kvm version are you using? Could you try the one from virt-preview?
>
> * qemu-kvm-0.11.0-12.fc12.x86_64
>
> I'm running into other issues when trying to use the F-12 virt-preview
> repository. I'll take those to <email address hidden> for feedback.
I'm still not able to start a guest using virt-preview repository. I've filed bug#566425 to address that issue.
This package has changed ownership in the Fedora Package Database. Reassigning to the new owner of this component.
This package has changed ownership in the Fedora Package Database. Reassigning to the new owner of this component.
This bug appears to have been reported against 'rawhide' during the Fedora 13 development cycle.
Changing version to '13'.
More information and reason for this action is here:
http://fedoraproject.org/wiki/BugZappers/HouseKeeping
The issue which blocks use of virt-preview/F-13 qemu has been resolved. Would it be possible to retest with qemu-0.12.x on the host?
This message is a reminder that Fedora 13 is nearing its end of life.
Approximately 30 (thirty) days from now Fedora will stop maintaining
and issuing updates for Fedora 13. It is Fedora's policy to close all
bug reports from releases that are no longer maintained. At that time
this bug will be closed as WONTFIX if it remains open with a Fedora
'version' of '13'.
Package Maintainer: If you wish for this bug to remain open because you
plan to fix it in a currently maintained version, simply change the 'version'
to a later Fedora version prior to Fedora 13's end of life.
Bug Reporter: Thank you for reporting this issue and we are sorry that
we may not be able to fix it before Fedora 13 is end of life. If you
would still like to see this bug fixed and are able to reproduce it
against a later version of Fedora please change the 'version' of this
bug to the applicable version. If you are unable to change the version,
please add a comment here and someone will do it for you.
Although we aim to fix as many bugs as possible during every release's
lifetime, sometimes those efforts are overtaken by events. Often a
more recent Fedora release includes newer upstream software that fixes
bugs or makes them obsolete.
The process we are following is described here:
http://fedoraproject.org/wiki/BugZappers/HouseKeeping
Fedora 13 changed to end-of-life (EOL) status on 2011-06-25. Fedora 13 is
no longer maintained, which means that it will not receive any further
security or bug fix updates. As a result we are closing this bug.
If you can reproduce this bug against a currently maintained version of
Fedora please feel free to reopen this bug against that version.
Thank you for reporting this bug and we are sorry it could not be fixed.
Closing this bug with Won't fix as this kernel / release is no longer supported.
Please feel free to open a new bug report if you're still experiencing this on a newer release (Bionic 18.04.3 / Disco 19.04)
Thanks!
|