HPE Superdome Flex systems may log errors related to the ioatdma driver

This document (7024063) is provided subject to the disclaimer at the end of this document.

Environment

SUSE Linux Enterprise Server 12 Service Pack 2 (SLES 12 SP2)
SUSE Linux Enterprise Server 12 Service Pack 3 (SLES 12 SP3)

@namespace url(http://www.w3.org/1999/xhtml); @-moz-document domain("bugzilla.suse.com") { .bz_comment { margin: 0 0 20px; width: 100% !important; } .bz_comment_text { width: 100% !important; } }SUSE Linux Enterprise Server 12 Service Pack 4 (SLES 12 SP4)

Situation

During boot of a HPE Superdome flex system the following errors may be observed.

[  130.150189] ioatdma 0006:00:04.2: failed to start channel chanerr: 0x2000
[  130.150215] Modules linked in: lpc_ich ipmi_devintf(+) ioatdma(+)
i2c_i801(+) mfd_core shpchp dca wmi ipmi_msghandler fjes processor(+) button
xfs libcrc32c mgag200 i2c_algo_bit uas drm_kms_helper hid_generic raid1
syscopyarea usb_storage usbhid md_mod sr_mod sysfillrect xhci_pci cdrom
sysimgblt sd_mod fb_sys_fops xhci_hcd ahci ttm crc32c_intel libahci usbcore
drm libata usb_common dm_mirror dm_region_hash dm_log hwperf(OEX) sg
dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod efivarfs
autofs4
[  130.150216] Supported: Yes, External
[  130.150218] CPU: 132 PID: 1576 Comm: kworker/132:1 Tainted: G        W  OE    
X 4.4.175-94.79.1.17192.0.PTF.1128565-default #1
[  130.150219] Hardware name: HPE Superdome Flex/Superdome Flex, BIOS Bundle:
2.5.290 SFW:IP147.006.002.074.000.1810231610 10/23/2018
[  130.150223] Workqueue: events work_for_cpu_fn
[  130.150226]  0000000000000000 ffffffff8132dc30 ffff8c80f8ee3c70
ffffffffa1ff84d8
[  130.150229]  ffffffff810829a1 ffff8c80f8315818 ffff8c80f8ee3cc0
0000000000002000
[  130.150231]  ffff8c80f8315880 ffff8c80f8315980 ffffffff81082a1c
ffffffffa1ff8a40
[  130.150232] Call Trace:
[  130.150242]  [<ffffffff81019b09>] dump_trace+0x59/0x340
[  130.150248]  [<ffffffff81019eda>] show_stack_log_lvl+0xea/0x170
[  130.150252]  [<ffffffff8101acb1>] show_stack+0x21/0x40
[  130.150257]  [<ffffffff8132dc30>] dump_stack+0x5c/0x7c
[  130.150262]  [<ffffffff810829a1>] warn_slowpath_common+0x81/0xb0
[  130.150268]  [<ffffffff81082a1c>] warn_slowpath_fmt+0x4c/0x50
[  130.150275]  [<ffffffffa1ff175f>] ioat_alloc_chan_resources+0x1df/0x210
[ioatdma]
[  130.150284]  [<ffffffffa1ff1823>] ioat_dma_self_test+0x93/0x4f0 [ioatdma]
[  130.150292]  [<ffffffffa1ff3343>] ioat_pci_probe+0x773/0x910 [ioatdma]
[  130.150299]  [<ffffffff813789f4>] local_pci_probe+0x44/0xa0
[  130.150303]  [<ffffffff81098046>] work_for_cpu_fn+0x16/0x20
[  130.150307]  [<ffffffff8109af74>] process_one_work+0x154/0x420
[  130.150312]  [<ffffffff8109bc99>] worker_thread+0x249/0x4a0
[  130.150318]  [<ffffffff810a1089>] kthread+0xc9/0xe0
[  130.150323]  [<ffffffff816210b5>] ret_from_fork+0x55/0x80
[  130.155088] DWARF2 unwinder stuck at ret_from_fork+0x55/0x80
[  130.155088]
[  130.155089] Leftover inexact backtrace:
[  130.155089]
[  130.155094]  [<ffffffff810a0fc0>] ? kthread_park+0x50/0x50
[  130.155109] ---[ end trace 8684e418ca67248f ]---
[  130.155144] ioatdma 0006:00:04.2: Freeing 1 in use descriptors!
[  130.155146] ioatdma 0006:00:04.2: selftest cannot allocate chan resource
[  130.155156] ioatdma 0006:00:04.2: Intel(R) I/OAT DMA Engine init failed
[  130.155385] ioatdma 0006:00:04.3: enabling device (0000 -> 0002)
[  130.155633] ioatdma 0006:00:04.3: can't derive routing for PCI INT D
[  130.155635] ioatdma 0006:00:04.3: PCI INT D: no GSI

Resolution

According to HPE the kernel modules skx_edac and ioatdma should be blacklisted as they are known to cause problems on this kind of systems.


Cause

Normally installing the HPE Foundation software will blacklist those drivers. If they haven't been blacklisted yet please consider installing the software again.

Additional Information


Disclaimer

This Support Knowledgebase provides a valuable tool for SUSE customers and parties interested in our products and solutions to acquire information, ideas and learn from one another. Materials are provided for informational, personal or non-commercial use within your organization and are presented "AS IS" WITHOUT WARRANTY OF ANY KIND.

  • Document ID:7024063
  • Creation Date: 14-Aug-2019
  • Modified Date:03-Mar-2020
    • SUSE Linux Enterprise Server

< Back to Support Search

For questions or concerns with the SUSE Knowledgebase please contact: tidfeedback[at]suse.com

SUSE Support Forums

Get your questions answered by experienced Sys Ops or interact with other SUSE community experts.

Join Our Community

Support Resources

Learn how to get the most from the technical support you receive with your SUSE Subscription, Premium Support, Academic Program, or Partner Program.


SUSE Customer Support Quick Reference Guide SUSE Technical Support Handbook Update Advisories
Support FAQ

Open an Incident

Open an incident with SUSE Technical Support, manage your subscriptions, download patches, or manage user access.

Go to Customer Center