1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
|
graphic: 0.861
semantic: 0.824
KVM: 0.804
device: 0.800
assembly: 0.751
other: 0.735
vnc: 0.733
instruction: 0.732
boot: 0.673
mistranslation: 0.633
socket: 0.539
network: 0.491
guest disk accesses lead to ATA errors + host vcpu0 unhandled wrmsr/rdmsr
Hi.
This is from http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=717724.
Using Debian sid with 1.5.0-5 my Linux VMs (also Debian sid) are broken. When they boot I get gazillions of ATA errors inside the guest, as well as:
[ 242.479951] kvm [7790]: vcpu0 unhandled rdmsr: 0x345
[ 242.483683] kvm [7790]: vcpu0 unhandled wrmsr: 0x680 data 0
[ 242.483687] kvm [7790]: vcpu0 unhandled wrmsr: 0x6c0 data 0
[ 242.483689] kvm [7790]: vcpu0 unhandled wrmsr: 0x681 data 0
[ 242.483691] kvm [7790]: vcpu0 unhandled wrmsr: 0x6c1 data 0
[ 242.483693] kvm [7790]: vcpu0 unhandled wrmsr: 0x682 data 0
[ 242.483696] kvm [7790]: vcpu0 unhandled wrmsr: 0x6c2 data 0
[ 242.483698] kvm [7790]: vcpu0 unhandled wrmsr: 0x683 data 0
[ 242.483700] kvm [7790]: vcpu0 unhandled wrmsr: 0x6c3 data 0
[ 242.483702] kvm [7790]: vcpu0 unhandled wrmsr: 0x684 data 0
[ 242.483704] kvm [7790]: vcpu0 unhandled wrmsr: 0x6c4 data 0
[ 242.988307] kvm [7790]: vcpu0 unhandled rdmsr: 0xe8
[ 242.988312] kvm [7790]: vcpu0 unhandled rdmsr: 0xe7
[ 242.988314] kvm [7790]: vcpu0 unhandled rdmsr: 0xce
[ 242.988316] kvm [7790]: vcpu0 unhandled rdmsr: 0xce
[ 242.988318] kvm [7790]: vcpu0 unhandled rdmsr: 0x1ad
[ 242.988320] kvm [7790]: vcpu0 unhandled rdmsr: 0xce
[ 242.988322] kvm [7790]: vcpu0 unhandled rdmsr: 0xe8
[ 242.988324] kvm [7790]: vcpu0 unhandled rdmsr: 0xe7
[ 242.988598] kvm [7790]: vcpu0 unhandled rdmsr: 0xce
in the host.
Please have a look at the Debian bug, for screenshots and more info.
The problem didn't occur in 1.5.0-4 and there were basically no changes inside the VM (i.e. no kernel upgrade or so).
Thanks,
Chris.
For the record, the difference between debian qemu 1.5.0-4 and 1.5.0-5 is the 1.5.1 patch - I applied it without actually changing the "upstream" version number.
Hi.
Finally I was able to get it running with IDE instead of SATA.
That works,... fine... no ATA errors.
As you already expected,.. I still get the:
[20424.793207] kvm [20947]: vcpu0 unhandled wrmsr: 0x6c1 data 0
[20424.793211] kvm [20947]: vcpu0 unhandled wrmsr: 0x682 data 0
[20424.793215] kvm [20947]: vcpu0 unhandled wrmsr: 0x6c2 data 0
[20424.793218] kvm [20947]: vcpu0 unhandled wrmsr: 0x683 data 0
[20424.793222] kvm [20947]: vcpu0 unhandled wrmsr: 0x6c3 data 0
[20424.793226] kvm [20947]: vcpu0 unhandled wrmsr: 0x684 data 0
[20424.793229] kvm [20947]: vcpu0 unhandled wrmsr: 0x6c4 data 0
[20425.946617] kvm [20947]: vcpu0 unhandled rdmsr: 0xe8
[20425.946623] kvm [20947]: vcpu0 unhandled rdmsr: 0xe7
[20425.946625] kvm [20947]: vcpu0 unhandled rdmsr: 0xce
[20425.946628] kvm [20947]: vcpu0 unhandled rdmsr: 0xce
So that seems to be unrelated... nevertheless.. I haven't seen these before with 1.5.0.
Anyway...I will try to get it running with virtio as well, as you suggested... but I have troubles to convince libvirt of using it.
btw: Might be unrealted... but with 1.5.1 and IDE now... when I do an update-initramfs -u inside the guest I gues these:
[ 276.130974] traps: ld-linux-x32.so[5642] general protection ip:f7788e8d sp:ffe12528 error:0 in ld-2.17.so[f7772000+21000]
[ 276.141129] traps: ld-linux-x32.so[5656] general protection ip:f77b8e8d sp:ff978458 error:0 in ld-2.17.so[f77a2000+21000]
[ 276.151444] traps: ld-linux-x32.so[5670] general protection ip:f7721e8d sp:ffd44cc8 error:0 in ld-2.17.so[f770b000+21000]
[ 276.207037] traps: ld-linux-x32.so[5714] general protection ip:f7770e8d sp:ff92cce8 error:0 in ld-2.17.so[f775a000+21000]
[ 276.278651] traps: ld-linux-x32.so[5775] general protection ip:f76fbe8d sp:ffe73048 error:0 in ld-2.17.so[f76e5000+21000]
[ 276.319277] traps: ld-linux-x32.so[5805] general protection ip:f76fee8d sp:fff486f8 error:0 in ld-2.17.so[f76e8000+21000]
[ 276.329552] traps: ld-linux-x32.so[5819] general protection ip:f77a8e8d sp:ffc9b448 error:0 in ld-2.17.so[f7792000+21000]
[ 276.338671] traps: ld-linux-x32.so[5833] general protection ip:f77bfe8d sp:ffe597b8 error:0 in ld-2.17.so[f77a9000+21000]
[ 276.347835] traps: ld-linux-x32.so[5847] general protection ip:f77b9e8d sp:ffcb1f08 error:0 in ld-2.17.so[f77a3000+21000]
[ 276.356548] traps: ld-linux-x32.so[5861] general protection ip:f77d1e8d sp:ffee16d8 error:0 in ld-2.17.so[f77bb000+21000]
[ 313.662052] do_general_protection: 24 callbacks suppressed
[ 313.662371] traps: ld-linux-x32.so[8926] general protection ip:f77e9e8d sp:ffa443a8 error:0 in ld-2.17.so[f77d3000+21000]
[ 313.670615] traps: ld-linux-x32.so[8940] general protection ip:f7796e8d sp:ff879cf8 error:0 in ld-2.17.so[f7780000+21000]
[ 313.677952] traps: ld-linux-x32.so[8954] general protection ip:f7791e8d sp:ffa25948 error:0 in ld-2.17.so[f777b000+21000]
[ 313.708061] traps: ld-linux-x32.so[8998] general protection ip:f7724e8d sp:ffe6b858 error:0 in ld-2.17.so[f770e000+21000]
[ 313.750866] traps: ld-linux-x32.so[9059] general protection ip:f77c7e8d sp:ff8a32a8 error:0 in ld-2.17.so[f77b1000+21000]
[ 313.771839] traps: ld-linux-x32.so[9089] general protection ip:f77b0e8d sp:ffdfa038 error:0 in ld-2.17.so[f779a000+21000]
[ 313.780747] traps: ld-linux-x32.so[9103] general protection ip:f777ae8d sp:ffc699a8 error:0 in ld-2.17.so[f7764000+21000]
[ 313.789250] traps: ld-linux-x32.so[9117] general protection ip:f778de8d sp:ffce6358 error:0 in ld-2.17.so[f7777000+21000]
[ 313.797782] traps: ld-linux-x32.so[9131] general protection ip:f7794e8d sp:ffcc8a38 error:0 in ld-2.17.so[f777e000+21000]
[ 313.806169] traps: ld-linux-x32.so[9145] general protection ip:f7703e8d sp:ff9e61a8 error:0 in ld-2.17.so[f76ed000+21000]
Seems also kinda weird.... never seen that before in that context.
Cheers,
Chris.
Also, when using SATA for the virtual disk,... it seems that it takes much longer till it get's to the point where it (the initrd) tries to mount root.
Till that point I see no errors... but then, after some time hanging at the root mount... it starts with the ATA errors.
So, what changed so you were able to boot from IDE without errors? As far as I understand, your initial prob was I/O errors on IDE (virtual) drive, but now you can boot it without errors. Or was it always SATA not IDE?
The rdmsr/wrmsr are unrelated, I already told you that, it is accessing optional CPU features which has nothing to do with disk access.
The segfaults during mkinitrd are unrelated too _unless_ you're running it from "failing" virtual drive. If you run it from failing virtual drive, it is remotely possible to have errors due to some file not read properly due to errors, but even this is a very remote possibility.
The time it takes to start mounting root is most likely also unrelated, it depends on the guest.
It _looks_ like the whole prob is due to FLUSH CASHE command not working correctly with SATA and IDE, but you confused me here - I don't understand where you were and are getting errors and where you don't.
Besides, you never answered to this: do you have enough free space in the host filesystem where your disk image(s) are located?
Sorry for the confusion.
The initial problem was _always_ with SATA... I couldn't try IDE at first, as the guest initrd haven't had and the IDE drivers.
Yesterday then, I tried IDE and it worked with that.
Sure, I just wanted to confirm that you were really right ;-)
Sorry I have missed the question about enough space before.. there is plenty of free host space left (133 GB).
Having tried now with virtio as well: works
So:
virtio, ide = OK
sata = broken
The fix is in a62eaa26c1d6d48fbdc3ac1d32bd1314f5fdc8c9 "ahci: Fix FLUSH command", should be in 1.6 and 1.5.3.
From a first test of the Debian package in experimental, I'd say that the issue has been fixed :)
Thanks a lot for all your help!
|