[next-20211019][PPC] kernel panics with lspci -vvnn command

Abdul Haleem abdhalee at linux.vnet.ibm.com
Thu Oct 21 17:44:39 AEDT 2021


Greeting's

Today's next kernel panics when lspci -vvnn commands is executed on my 
powerpc machine

# lspci -vvnn
0012:01:00.0 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb 
Fibre Channel to PCIe Adapter [1077:2261] (rev 01)
     Subsystem: IBM Device [1014:0650]
     Physical Slot: U78D8.ND0.FGD004S-P0-C2-C0
     Device tree node: 
/sys/firmware/devicetree/base/pci at 800000020000012/fibre-channel at 0
     Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ 
Stepping- SERR+ FastB2B- DisINTx-
     Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- INTx-
     Latency: 0, Cache Line Size: 128 bytes
     Interrupt: pin A routed to IRQ 48
     NUMA node: 2
     IOMMU group: 0
     Region 0: Memory at 40000285000 (64-bit, prefetchable) [size=4K]
     Region 2: Memory at 40000282000 (64-bit, prefetchable) [size=8K]
     Region 4: Memory at 40000100000 (64-bit, prefetchable) [size=1M]
     Expansion ROM at 40000240000 [disabled] [size=256K]
     Capabilities: [44] Power Management version 3
         Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
         Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=0 PME-
     Capabilities: [4c] Express (v2) Endpoint, MSI 00
         DevCap:    MaxPayload 2048 bytes, PhantFunc 0, Latency L0s 
<4us, L1 <1us
             ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ 
SlotPowerLimit 0.000W
         DevCtl:    CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
             RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ FLReset-
             MaxPayload 512 bytes, MaxReadReq 4096 bytes
         DevSta:    CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr- 
TransPend-
         LnkCap:    Port #0, Speed 8GT/s, Width x8, ASPM L0s L1, Exit 
Latency L0s <2us, L1 <2us
             ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
         LnkCtl:    ASPM Disabled; RCB 64 bytes, Disabled- CommClk-
             ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
         LnkSta:    Speed 8GT/s (ok), Width x8 (ok)
             TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
         DevCap2: Completion Timeout: Range B, TimeoutDis+ NROPrPrP- LTR-
              10BitTagComp- 10BitTagReq- OBFF Not Supported, ExtFmt- 
EETLPPrefix-
              EmergencyPowerReduction Not Supported, 
EmergencyPowerReductionInit-
              FRS- TPHComp- ExtTPHComp-
              AtomicOpsCap: 32bit- 64bit- 128bitCAS-
         DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis+ LTR- 
OBFF Disabled,
              AtomicOpsCtl: ReqEn-
         LnkCap2: Supported Link Speeds: 2.5-8GT/s, Crosslink- Retimer- 
2Retimers- DRS-
         LnkCtl2: Target Link Speed: 8GT/s, EnterCompliance- SpeedDis-
              Transmit Margin: Normal Operating Range, 
EnterModifiedCompliance- ComplianceSOS-
              Compliance De-emphasis: -6dB
         LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete+ 
EqualizationPhase1+
              EqualizationPhase2+ EqualizationPhase3+ 
LinkEqualizationRequest-
              Retimer- 2Retimers- CrosslinkRes: unsupported
     Capabilities: [88] Vital Product Data
BUG: Kernel NULL pointer dereference on read at 0x000080a0
BUG: Unable to handle kernel data access on read at 0x3949ffff40920078
BUG: Unable to handle kernel data access on read at 0x694a0002e94d00f0
Faulting instruction address: 0xc0000000006f4498
Faulting instruction address: 0xc0000000001d3680
Oops: Kernel access of bad area, sig: 11 [#1]
LE PAGE_SIZE=64K MMU=Radix SMP NR_CPUS=2048 NUMA pSeries
Faulting instruction address: 0xc0000000001abcf0
Modules linked in:
Thread overran stack, or stack corrupted
  rpadlpar_io rpaphp nfnetlink tcp_diag udp_diag inet_diag unix_diag 
af_packet_diag netlink_diag bonding rfkill sunrpc raid456 async_raid6_recov
async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c 
pseries_rng xts vmx_crypto gf128mul binfmt_misc sch_fq_codel ip_tables ext4
mbcache jbd2 dm_service_time sd_mod sg qla2xxx ibmvfc ibmveth nvme_fc 
nvme_fabrics nvme_core t10_pi scsi_transport_fc dm_multipath dm_mirror
dm_region_hash dm_log dm_mod fuse
CPU: 24 PID: 0 Comm: swapper/24 Kdump: loaded Not tainted 
5.15.0-rc5-next-20211012-autotest #1
NIP:  c0000000006f4498 LR: c0000000006f9c18 CTR: c000000000026e60
REGS: c000000006797560 TRAP: 0380   Not tainted 
(5.15.0-rc5-next-20211012-autotest)
MSR:  8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 42000824  XER: 00000000
CFAR: c0000000006f440c IRQMASK: 1
GPR00: c00000000022434c c000000006797800 c0000000019b2500 c00000117db0ac28
GPR04: c00000117db0a520 0000000000000000 3949ffff40920078 0000000000000001
GPR08: c000000063bd3cf0 c00000000073a7a8 892100602e3f0000 7265677368657265
GPR12: c000000000026e60 c00000117fb4be80 0000000000000000 000000001eef2b00
GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR20: 0000000000000000 0000000000000000 0000000000000003 0000000000000001
GPR24: 0000638695346493 0000000000000002 0000000000000003 c00000117db0a480
GPR28: c00000117db0a480 0000000000000000 c00000117db0a520 c00000117db0ac28
NIP [c0000000006f4498] rb_erase+0x158/0x440
LR [c0000000006f9c18] timerqueue_del+0x58/0xa0
Call Trace:
[c000000006797800] [0000000000000003] 0x3 (unreliable)
[c000000006797830] [c00000000022434c] __remove_hrtimer+0x5c/0xf0
[c000000006797870] [c000000000224b84] __hrtimer_run_queues+0x144/0x360
[c0000000067978f0] [c000000000225b90] hrtimer_interrupt+0x120/0x2f0
[c0000000067979a0] [c0000000000272d4] timer_interrupt+0x144/0x3e0
[c000000006797a00] [c000000000009a60] decrementer_common_virt+0x210/0x220
--- interrupt: 900 at plpar_hcall_norets_notrace+0x18/0x2c
NIP:  c0000000000ef100 LR: c0000000009bca88 CTR: 0000000000000000
REGS: c000000006797a70 TRAP: 0900   Not tainted 
(5.15.0-rc5-next-20211012-autotest)
MSR:  800000000280b033 <SF,VEC,VSX,EE,FP,ME,IR,DR,RI,LE>  CR: 24000224  
XER: 00000000
CFAR: 0000000000000c00 IRQMASK: 0
GPR00: 0000000000000000 c000000006797d10 c0000000019b2500 0000000000000000
GPR04: 8004001f54ca7c10 0000000000000000 0000000000000004 0000000000000018
GPR08: 0000000000000000 0000000000000090 0000000000000001 8004001f54ca7c00
GPR12: 0000000000000000 c00000117fb4be80 0000000000000000 000000001eef2b00
GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR20: 0000000000000000 0000000000000000 0000000000000000 c0000000018f1e28
GPR24: 0000000000000001 0000638691f200fe 0000000000000000 0000000000000000
GPR28: 0000000000000001 0000000000000000 c000000001192090 c000000001192098
NIP [c0000000000ef100] plpar_hcall_norets_notrace+0x18/0x2c
LR [c0000000009bca88] check_and_cede_processor+0x48/0x60
--- interrupt: 900
[c000000006797d10] [c00000117db0ac28] 0xc00000117db0ac28 (unreliable)
[c000000006797d70] [c0000000009bcec4] dedicated_cede_loop+0x94/0x1a0
[c000000006797dc0] [c0000000009b97f4] cpuidle_enter_state+0x2d4/0x4e0
[c000000006797e20] [c0000000009b9a98] cpuidle_enter+0x48/0x70
[c000000006797e60] [c0000000001a0264] call_cpuidle+0x44/0x80
[c000000006797e80] [c0000000001a0840] do_idle+0x340/0x390
[c000000006797f00] [c0000000001a0ac4] cpu_startup_entry+0x34/0x50
[c000000006797f30] [c00000000006144c] start_secondary+0x27c/0x290
[c000000006797f90] [c00000000000d254] start_secondary_prolog+0x10/0x14
Instruction dump:
7d284b78 e9280008 7fa95040 409eff9c e9280010 e8e90000 e9490008 70e70001
41820168 e8c90010 2fa60000 419e0010 <e8e60000> 70e70001 41820044 2faa0000
---[ end trace 3024a0b7ce3c6f83 ]---

# lspci -nn
0012:01:00.0 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb 
Fibre Channel to PCIe Adapter [1077:2261] (rev 01)
0012:01:00.1 Fibre Channel [0c04]: QLogic Corp. ISP2722-based 16/32Gb 
Fibre Channel to PCIe Adapter [1077:2261] (rev 01)

Problem is not seen with 5.15.0-rc4-next-20211005

-- 
Regard's

Abdul Haleem
IBM Linux Technology Center



More information about the Linuxppc-dev mailing list