System Config is SuperMicro Motherboard X9DR3-F, 3ware 9750SE-8i, CentOS 6.3
Getting this crash/hardlockup when intel_iommu=on is passed on the kernel. I don't get this when crash when I set intel_iommu=off and pci=nomsi.
sd 6:0:0:0: WARNING: (0x06:0x002C): Command (0x2a) timed out, resetting card. Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 6 Pid: 0, comm: swapper Not tainted 2.6.32-279.9.1.el6.x86_64 #1 Call Trace:[ ] ? panic+0xa0/0x168 [ ] ? watchdog_overflow_callback+0xcd/0xd0 [ ] ? __perf_event_overflow+0x9d/0x230 [ ] ? perf_event_overflow+0x14/0x20 [ ] ? intel_pmu_handle_irq+0x336/0x550 [ ] ? kprobe_exceptions_notify+0x16/0x430 [ ] ? perf_event_nmi_handler+0x39/0xb0 [ ] ? notifier_call_chain+0x55/0x80 [ ] ? atomic_notifier_call_chain+0x1a/0x20 [ ] ? notify_die+0x2e/0x30 [ ] ? do_nmi+0x173/0x2b0 [ ] ? nmi+0x20/0x30 [ ] ? intel_unmap_page+0x0/0x180 [ ] ? _spin_lock_irqsave+0x2f/0x40 < > [ ] ? add_unmap+0x2b/0xe0 [ ] ? intel_unmap_page+0xca/0x180 [ ] ? igb_poll+0x4d1/0xb00 [igb] [ ] ? native_smp_send_reschedule+0x49/0x60 [ ] ? net_rx_action+0x103/0x2f0 [ ] ? __do_softirq+0xc1/0x1e0 [ ] ? handle_IRQ_event+0x60/0x170 [ ] ? call_softirq+0x1c/0x30 [ ] ? do_softirq+0x65/0xa0 [ ] ? irq_exit+0x85/0x90 [ ] ? do_IRQ+0x75/0xf0 [ ] ? ret_from_intr+0x0/0x11 [ ] ? intel_idle+0xde/0x170 [ ] ? intel_idle+0xc1/0x170 [ ] ? cpuidle_idle_call+0xa7/0x140 [ ] ? cpu_idle+0xb6/0x110 [ ] ? start_secondary+0x22a/0x26d Initializing cgroup subsys cpuset Initializing cgroup subsys cpu Linux version 2.6.32-279.9.1.el6.x86_64 (mockbuild@c6b9.bsys.dev.centos.org) (gcc version 4.4.6 20120305 (Red Hat 4.4.6-4) (GCC) ) #1 SMP Tue Sep 25 21:43:11 UTC 2012 Command line: ro root=UUID=09b1f254-2cc0-4654-9875-6c91b18bab17 rd_NO_LUKS rd_NO_LVM LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM intel_iommu=on console=tty0 console=ttyS1,115200n8 irqpoll nr_cpus=1 reset_devices cgroup_disable=memory mce=off memmap=exactmap memmap=553K@64K memmap=135619K@49705K elfcorehdr=185324K memmap=64K$0K memmap=23K$617K memmap=128K$896K memmap=1120K#2069328K memmap=12288K$2070448K memmap=496K#2082736K memmap=612K$2083232K memmap=28K#2083844K memmap=208K$2083872K memmap=32K#2084080K memmap=164K$2084112K memmap=4684K#2084276K memmap=262144K$2097152K memmap=144K$4174960K memmap=16384K$4177920K KERNEL supported cpus: Intel GenuineIntel AMD AuthenticAMD Centaur CentaurHauls BIOS-provided physical RAM map: BIOS-e820: 0000000000000100 - 000000000009a400 (usable) BIOS-e820: 000000000009a400 - 00000000000a0000 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 000000007e4d4000 (usable) BIOS-e820: 000000007e4d4000 - 000000007e5ec000 (ACPI NVS) BIOS-e820: 000000007e5ec000 - 000000007f1ec000 (reserved) BIOS-e820: 000000007f1ec000 - 000000007f268000 (ACPI data) BIOS-e820: 000000007f268000 - 000000007f301000 (reserved) BIOS-e820: 000000007f301000 - 000000007f308000 (ACPI NVS) BIOS-e820: 000000007f308000 - 000000007f33c000 (reserved) BIOS-e820: 000000007f33c000 - 000000007f344000 (ACPI NVS) BIOS-e820: 000000007f344000 - 000000007f36d000 (reserved) BIOS-e820: 000000007f36d000 - 000000007f800000 (ACPI NVS) BIOS-e820: 0000000080000000 - 0000000090000000 (reserved) BIOS-e820: 00000000fed1c000 - 00000000fed40000 (reserved) BIOS-e820: 00000000ff000000 - 0000000100000000 (reserved) BIOS-e820: 0000000100000000 - 0000001080000000 (usable) last_pfn = 0x1080000 max_arch_pfn = 0x400000000 user-defined physical RAM map: user: 0000000000000000 - 0000000000010000 (reserved) user: 0000000000010000 - 000000000009a400 (usable) user: 000000000009a400 - 00000000000a0000 (reserved) user: 00000000000e0000 - 0000000000100000 (reserved) user: 000000000308a400 - 000000000b4fb000 (usable) user: 000000007e4d4000 - 000000007e5ec000 (ACPI data) user: 000000007e5ec000 - 000000007f1ec000 (reserved) user: 000000007f1ec000 - 000000007f268000 (ACPI data) user: 000000007f268000 - 000000007f301000 (reserved) user: 000000007f301000 - 000000007f308000 (ACPI data) user: 000000007f308000 - 000000007f33c000 (reserved) user: 000000007f33c000 - 000000007f344000 (ACPI data) user: 000000007f344000 - 000000007f36d000 (reserved) user: 000000007f36d000 - 000000007f800000 (ACPI data) user: 0000000080000000 - 0000000090000000 (reserved) user: 00000000fed1c000 - 00000000fed40000 (reserved) user: 00000000ff000000 - 0000000100000000 (reserved) DMI 2.7 present. SMBIOS version 2.7 @ 0xF0480 AMI BIOS detected: BIOS may corrupt low RAM, working around it. last_pfn = 0xb4fb max_arch_pfn = 0x400000000 x86 PAT enabled: cpu 0, old 0x7010600070106, new 0x7010600070106 x2apic enabled by BIOS, switching to x2apic ops Using GB pages for direct mapping init_memory_mapping: 0000000000000000-000000000b4fb000 RAMDISK: 0b10e000 - 0b4ee36e ACPI: RSDP 00000000000f0450 00024 (v02 ALASKA) ACPI: XSDT 000000007f1ec090 0009C (v01 ALASKA A M I 01072009 AMI 00010013) ACPI: FACP 000000007f1f6940 000F4 (v04 ALASKA A M I 01072009 AMI 00010013) ACPI: DSDT 000000007f1ec1b8 0A788 (v02 ALASKA A M I 00000015 INTL 20051117) ACPI: FACS 000000007f341f80 00040 ACPI: APIC 000000007f1f6a38 001B4 (v03 ALASKA A M I 01072009 AMI ACPI: SRAT 000000007f1f6bf0 00430 (v01 A M I AMI SRAT 00000001 AMI. 00000000) ACPI: SLIT 000000007f1f7020 00030 (v01 A M I AMI SLIT 00000000 AMI. 00000000) ACPI: HPET 000000007f1f7050 00038 (v01 ALASKA A M I 01072009 AMI. 00000005) ACPI: SPMI 000000007f1f7088 00040 (v05 A M I OEMSPMI 00000000 AMI. 00000000) ACPI: SSDT 000000007f1f70c8 70104 (v02 INTEL CpuPm 00004000 INTL 20051117) ACPI: SPCR 000000007f2671d0 00050 (v01 A M I APTIO4 01072009 AMI. 00000005) ACPI: BGRT 000000007f267220 00038 (v00 ALASKA A M I 01072009 AMI 00010013) ACPI: MCFG 000000007f267258 0003C (v01 ALASKA A M I 01072009 MSFT 00000097) ACPI: DMAR 000000007f267298 00170 (v01 A M I OEMDMAR 00000001 INTL 00000001) ACPI: EINJ 000000007f267408 00130 (v01 AMI AMI EINJ 00000000 00000000) ACPI: ERST 000000007f267538 00210 (v01 AMIER AMI ERST 00000000 00000000) ACPI: HEST 000000007f267748 000A8 (v01 AMI AMI HEST 00000000 00000000) ACPI: BERT 000000007f2677f0 00030 (v01 AMI AMI BERT 00000000 00000000) Setting APIC routing to cluster x2apic. SRAT: PXM 0 -> APIC 0 -> Node 0 SRAT: PXM 0 -> APIC 1 -> Node 0 SRAT: PXM 0 -> APIC 2 -> Node 0 SRAT: PXM 0 -> APIC 3 -> Node 0 SRAT: PXM 0 -> APIC 4 -> Node 0 SRAT: PXM 0 -> APIC 5 -> Node 0 SRAT: PXM 0 -> APIC 6 -> Node 0 SRAT: PXM 0 -> APIC 7 -> Node 0 SRAT: PXM 0 -> APIC 8 -> Node 0 SRAT: PXM 0 -> APIC 9 -> Node 0 SRAT: PXM 0 -> APIC 10 -> Node 0 SRAT: PXM 0 -> APIC 11 -> Node 0 SRAT: PXM 1 -> APIC 32 -> Node 1 SRAT: PXM 1 -> APIC 33 -> Node 1 SRAT: PXM 1 -> APIC 34 -> Node 1 SRAT: PXM 1 -> APIC 35 -> Node 1 SRAT: PXM 1 -> APIC 36 -> Node 1 SRAT: PXM 1 -> APIC 37 -> Node 1 SRAT: PXM 1 -> APIC 38 -> Node 1 SRAT: PXM 1 -> APIC 39 -> Node 1 SRAT: PXM 1 -> APIC 40 -> Node 1 SRAT: PXM 1 -> APIC 41 -> Node 1 SRAT: PXM 1 -> APIC 42 -> Node 1 SRAT: PXM 1 -> APIC 43 -> Node 1 SRAT: Node 0 PXM 0 0-80000000 SRAT: Node 0 PXM 0 100000000-880000000 SRAT: Node 1 PXM 1 880000000-1080000000 Bootmem setup node 0 0000000000000000-000000000b4fb000 NODE_DATA [0000000000011040 - 000000000004503f] bootmap [0000000000046000 - 000000000004769f] pages 2 (8 early reservations) ==> bootmem [0000000000 - 000b4fb000] #0 [0000000000 - 0000001000] BIOS data page ==> [0000000000 - 0000001000] #1 [0000006000 - 0000008000] TRAMPOLINE ==> [0000006000 - 0000008000] #2 [0004000000 - 0005012024] TEXT DATA BSS ==> [0004000000 - 0005012024] #3 [000b10e000 - 000b4ee36e] RAMDISK ==> [000b10e000 - 000b4ee36e] #4 [000009a400 - 0000100000] BIOS reserved ==> [000009a400 - 0000100000] #5 [0005013000 - 0005013368] BRK ==> [0005013000 - 0005013368] #6 [0000010000 - 0000011000] PGTABLE ==> [0000010000 - 0000011000] #7 [0000011000 - 0000011030] ACPI SLIT ==> [0000011000 - 0000011030] found SMP MP-table at [ffff8800000fcee0] fcee0 Zone PFN ranges: DMA 0x00000010 -> 0x00001000 DMA32 0x00001000 -> 0x00100000 Normal 0x00100000 -> 0x00100000 Movable zone start PFN for each node early_node_map[2] active PFN ranges 0: 0x00000010 -> 0x0000009a 0: 0x0000308b -> 0x0000b4fb ACPI: PM-Timer IO Port: 0x408 Setting APIC routing to cluster x2apic. ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 almost reached. Keeping one slot for boot cpu. Processor 0/0x0 ignored. ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 almost reached. Keeping one slot for boot cpu. Processor 1/0x2 ignored. ACPI: LAPIC (acpi_id[0x04] lapic_id[0x04] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 almost reached. Keeping one slot for boot cpu. Processor 2/0x4 ignored. ACPI: LAPIC (acpi_id[0x06] lapic_id[0x06] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 almost reached. Keeping one slot for boot cpu. Processor 3/0x6 ignored. ACPI: LAPIC (acpi_id[0x08] lapic_id[0x08] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 almost reached. Keeping one slot for boot cpu. Processor 4/0x8 ignored. ACPI: LAPIC (acpi_id[0x0a] lapic_id[0x0a] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 almost reached. Keeping one slot for boot cpu. Processor 5/0xa ignored. ACPI: LAPIC (acpi_id[0x0c] lapic_id[0x20] enabled) ACPI: LAPIC (acpi_id[0x0e] lapic_id[0x22] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 7/0x22 ignored. ACPI: LAPIC (acpi_id[0x10] lapic_id[0x24] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 8/0x24 ignored. ACPI: LAPIC (acpi_id[0x12] lapic_id[0x26] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 9/0x26 ignored. ACPI: LAPIC (acpi_id[0x14] lapic_id[0x28] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 10/0x28 ignored. ACPI: LAPIC (acpi_id[0x16] lapic_id[0x2a] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 11/0x2a ignored. ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 12/0x1 ignored. ACPI: LAPIC (acpi_id[0x03] lapic_id[0x03] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 13/0x3 ignored. ACPI: LAPIC (acpi_id[0x05] lapic_id[0x05] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 14/0x5 ignored. ACPI: LAPIC (acpi_id[0x07] lapic_id[0x07] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 15/0x7 ignored. ACPI: LAPIC (acpi_id[0x09] lapic_id[0x09] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 16/0x9 ignored. ACPI: LAPIC (acpi_id[0x0b] lapic_id[0x0b] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 17/0xb ignorePUS/possible_cpus limit of 1 reached. Processor 18/0x21 ignored. ACPI: LAPIC (acpi_id[0x0f] lapic_id[0x23] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 19/0x23 ignored. ACPI: LAPIC (acpi_id[0x11] lapic_id[0x25] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 20/0x25 ignored. ACPI: LAPIC (acpi_id[0x13] lapic_id[0x27] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 21/0x27 ignored. ACPI: LAPIC (acpi_id[0x15] lapic_id[0x29] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 22/0x29 ignored. ACPI: LAPIC (acpi_id[0x17] lapic_id[0x2b] enabled) ACPI: NR_CPUS/possible_cpus limit of 1 reached. Processor 23/0x2b ignored. ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x02] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x04] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x06] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x08] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x0a] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x0c] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x0e] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x10] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x12] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x14] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x16] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x01] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x03] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x05] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x07] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x09] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x0b] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x0d] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x0f] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x11] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x13] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x15] high edge lint[0x1]) ACPI: LAPIC_NMI (acpi_id[0x17] high edge lint[0x1]) ACPI: IOAPIC (id[0x00] address[0xfec00000] gsi_base[0]) IOAPIC[0]: apic_id 0, version 32, address 0xfec00000, GSI 0-23 ACPI: IOAPIC (id[0x02] address[0xfec01000] gsi_base[24]) IOAPIC[1]: apic_id 2, version 32, address 0xfec01000, GSI 24-47 ACPI: IOAPIC (id[0x03] address[0xfec40000] gsi_base[48]) IOAPIC[2]: apic_id 3, version 32, address 0xfec40000, GSI 48-71 ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) Using ACPI (MADT) for SMP configuration information ACPI: HPET id: 0x8086a701 base: 0xfed00000 24 Processors exceeds NR_CPUS limit of 1 SMP: Allowing 1 CPUs, 0 hotplug CPUs PM: Registered nosave memory: 000000000009a000 - 000000000009b000 PM: Registered nosave memory: 000000000009b000 - 00000000000a0000 PM: Registered nosave memory: 00000000000a0000 - 00000000000e0000 PM: Registered nosave memory: 00000000000e0000 - 0000000000100000 PM: Registered nosave memory: 0000000000100000 - 000000000308b000 Allocating PCI resources starting at b4fb000 (gap: b4fb000:72fd9000) Booting paravirtualized kernel on bare hardware NR_CPUS:4096 nr_cpumask_bits:1 nr_cpu_ids:1 nr_node_ids:2 PERCPU: Embedded 31 pages/cpu @ffff880003200000 s94424 r8192 d24360 u2097152 pcpu-alloc: s94424 r8192 d24360 u2097152 alloc=1*2097152 pcpu-alloc: [0] 0 Built 1 zonelists in Node order, mobility grouping on. Total pages: 33408 Policy zone: DMA32 Kernel command line: ro root=UUID=09b1f254-2cc0-4654-9875-6c91b18bab17 rd_NO_LUKS rd_NO_LVM LANG=en_US.UTF-8 rd_NO_MD SYSFONT=latarcyrheb-sun16 KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM intel_iommu=on console=tty0 console=ttyS1,115200n8 irqpoll nr_cpus=1 reset_devices cgroup_disable=memory mce=off memmap=exactmap memmap=553K@64K memmap=135619K@49705K elfcorehdr=185324K memmap=64K$0K memmap=23K$617K memmap=128K$896K memmap=1120K#2069328K memmap=12288K$2070448K memmap=496K#2082736K memmap=612K$2083232K memmap=28K#2083844K memmap=208K$2083872K memmap=32K#2084080K memmap=164K$2084112K memmap=4684K#2084276K memmap=262144K$2097152K memmap=144K$4174960K memmap=16384K$4177920K Intel-IOMMU: enabled Misrouted IRQ fixup and polling support enabled This may significantly impact system performance Disabling memory control group subsystem PID hash table entries: 1024 (order: 1, 8192 bytes) xsave/xrstor: enabled xstate_bv 0x7, cntxt size 0x340 Checking aperture... No AGP bridge found Queued invalidation will be enabled to support x2apic and Intr-remapping. Memory: 111036k/185324k available (5153k kernel code, 49156k absent, 25132k reserved, 7165k data, 1260k init) Hierarchical RCU implementation. NR_IRQS:33024 nr_irqs:256 Extended CMOS year: 2000 Spurious LAPIC timer interrupt on cpu 0 do_IRQ: 0.230 No irq handler for vector (irq -1) do_IRQ: 0.197 No irq handler for vector (irq -1) Console: colour VGA+ 80x25 console [tty0] enabled console [ttyS1] enabled Fast TSC calibration using PIT Detected 2000.108 MHz processor. Calibrating delay loop (skipped), value calculated using timer frequency.. 4000.21 BogoMIPS (lpj=2000108) pid_max: default: 32768 minimum: 301 Security Framework initialized SELinux: Initializing. Dentry cache hash table entries: 32768 (order: 6, 262144 bytes) Inode-cache hash table entries: 16384 (order: 5, 131072 bytes) Mount-cache hash table entries: 256 Initializing cgroup subsys ns Initializing cgroup subsys cpuacct Initializing cgroup subsys memory Initializing cgroup subsys devices Initializing cgroup subsys freezer Initializing cgroup subsys net_cls Initializing cgroup subsys blkio Initializing cgroup subsys perf_event Initializing cgroup subsys net_prio CPU: Physical Processor ID: 1 CPU: Processor Core ID: 0 using mwait in idle threads. SMP alternatives: switching to UP code Freeing SMP alternatives: 34k freed ACPI: Core revision 20090903 ftrace: converting mcount calls to 0f 1f 44 00 00 ftrace: allocating 21019 entries in 83 pages DMAR: Host address width 46 DMAR: DRHD base: 0x000000fbffe000 flags: 0x0 IOMMU fbffe000: ver 1:0 cap d2078c106f0462 ecap f020fe DMAR: DRHD base: 0x000000dfffc000 flags: 0x1 IOMMU dfffc000: ver 1:0 cap d2078c106f0462 ecap f020fe DMAR: RMRR base: 0x0000007f285000 end: 0x0000007f2a1fff DMAR: ATSR flags: 0x0 DMAR: RHSA base: 0x000000fbffe000 proximity domain: 0x1 DMAR: RHSA base: 0x000000dfffc000 proximity domain: 0x0 IOAPIC id 3 under DRHD base 0xfbffe000 IOAPIC id 0 under DRHD base 0xdfffc000 IOAPIC id 2 under DRHD base 0xdfffc000 HPET id 0 under DRHD base 0xdfffc000 Your BIOS has requested that x2apic be disabled. This will leave your machine vulnerable to irq-injection attacks. Use 'intremap=no_x2apic_optout' to override BIOS request.
2 comments:
Buy a mac. :-)
Unfortunately, Apple doesn't build servers anymore. At least not rack mountable ones. They'd be too expensive and pretty to put in a rack.
Post a Comment