IPIC_SERMR vs IPIC_SERMR
christophe leroy
christophe.leroy at c-s.fr
Thu May 9 03:17:51 AEST 2019
Hi Pablo,
Le 07/05/2019 à 23:50, Rodriguez Quesada, Pablo a écrit :
> Hi, Christophe and everyone from the LinuxPPC community,
>
> Hope you are doing well!
>
> I was working with Daniel, and we were reviewing the ipic source code
> and came to these questions:
>
> 1.The first one came up while reviewing the document “Understanding
> IPIC” from NXP (*). The document states that:
>
> /System Error Status Register (SERSR) – 0x40: Each bit in the SERSR
> register corresponds to a non-maskable error source that generates a
> machine check exception (MCP). These bits are cleared by writing the bit
> location with a logic 1./
>
> /The interrupting source must be cleared before clearing the interrupts
> error status bit./
>
> /System Error Mask Register (SERMR) – 0x44 When a machine check
> exception is signaled from one of the peripheral modules, its
> corresponding flag bit in this register is set. Any of the interrupting
> sources can be individually masked./
I guess the above description is misleading.
In the MPC8323 Reference Manual, I find it more explicit:
8.5.14 System Error Mask Register (SERMR)
Each implemented bit in SERMR, shown in Figure 8-16, corresponds to an
external and an internal mcp source (MCP). The user masks an MCP by
clearing and enables an interrupt by setting the corresponding SERMR
bit. When a masked MCP occurs, the corresponding SERSR bit is set,
regardless of the setting of the corresponding SERMR bit although no MCP
request is passed to the core in this case. The SERMR can be read by the
user at any time.
>
> //
>
> We noticed that you changed the ipic_clear_mcp_status function by
> swapping SERMR to SERSR, but we don’t fully understand why. Could you
> please elaborate on why this change is made? This because we had a code
> that was working before the patch and we would fully understand why it
> stopped working. From our perspective, both SERSR and SERMR are related
> to MCEs and honestly don’t know what’s the difference at the hardware
> level, and the role of every register on the machine check flow. It is
> also weird for us that before your patch the register SERSR wasn’t used
> in any part of the Linux kernel.
I suppose you are referring to the patch
https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/arch/powerpc/sysdev/ipic.c?h=v5.1&id=6b148a7ce72a7f87c81cbcde48af014abc0516a9
For me the role of a function called ipic_clear_mcp_status() is ... to
clear the status, not to manipulate a mask. So it has to deal with the
status register (SERSR) and has nothing to do with the mask register
(SERMR).
>
> 2.In commit 8acb88682cc00a41a677c2455a7c992d you removed
> ipic_set_highest_priority(), ipic_enable_mcp() and
> ipic_disable_mcp()because it wasn’t used but here we are confused
> because ipic_clear_mcp_status() sets all bits with the mask and in the
> old disable function the bits from the mask are cleared with a bitwise
> AND,changing the other bits from the SERMR register but the mcp_irq.
Probably the parameter of ipic_clear_mcp_status() should not be called
'mask'. This parameter identifies the sources to clear. The bits set to
1 corresponds to the sources to clear. The bits set to 0 corresponds to
the sources we don't want to touch.
ipic_enable_mcp() and ipic_disable_mcp() had a different role, their
role was to activate or deactivate interrupts generated by MCP sources
iaw the description of SERMR.
>
> ipic_clear:
>
> ipic_write
> <https://elixir.bootlin.com/linux/v4.1.13/ident/ipic_write>(primary_ipic
> <https://elixir.bootlin.com/linux/v4.1.13/ident/primary_ipic>->regs,
> IPIC_SERMR <https://elixir.bootlin.com/linux/v4.1.13/ident/IPIC_SERMR>,
> mask);
>
> ipic_disable:
>
> temp <https://elixir.bootlin.com/linux/v4.1.13/ident/temp> = ipic_read
> <https://elixir.bootlin.com/linux/v4.1.13/ident/ipic_read>(ipic
> <https://elixir.bootlin.com/linux/v4.1.13/ident/ipic>->regs, IPIC_SERMR
> <https://elixir.bootlin.com/linux/v4.1.13/ident/IPIC_SERMR>);
>
> temp <https://elixir.bootlin.com/linux/v4.1.13/ident/temp> &= (1 << (31
> - mcp_irq));
>
> ipic_write
> <https://elixir.bootlin.com/linux/v4.1.13/ident/ipic_write>(ipic
> <https://elixir.bootlin.com/linux/v4.1.13/ident/ipic>->regs, IPIC_SERMR
> <https://elixir.bootlin.com/linux/v4.1.13/ident/IPIC_SERMR>, temp
> <https://elixir.bootlin.com/linux/v4.1.13/ident/temp>);
>
> Isn’t ipic_clear a set function instead of a clear
> function? This bring us confusion
No, ipic_clear() role is to clear the status in accordance with the
description of SERSR. In the description you copied, it is explicitely
said: "These bits are cleared by writing the bit location with a logic 1".
In the MPC8323 Reference Manual, it is even more explicit:
Each implemented bit in the SERSR, listed in Table 8-22, corresponds to
an external and an internal error source (mcp). When an error interrupt
signal is received, the interrupt controller sets the corresponding
SERSR bit. SERSR bits are cleared by writing ones to them. Unmasked
event register bits should be cleared before clearing SERSR bits.
Because the user can only clear bits in this register, writing zeros to
this register has no effect.
>
> Another question is why these functions were created in the first place?
I guess someone wanted to create a full API to the IPIC, but it was
never used, or it was used by out-of-tree drivers or was used before the
kernel was it GIT.
>
> 3.In the NXP document it states that: /These bits are cleared by writing
> the bit location with a logic 1. The interrupting source must be cleared
> before clearing the interrupts error status bit./
>
> Does this mean that these registers work with negative logic? How is
> this managed in the kernel?
>
> The interrupting source is the SERMR register, the SERSR or any other?
>
> 4.What is the real difference between MCP and MCE? What are their uses?
MCE means Machine Check Exception
A machine check exception can be generated for several reasons, not only
MCP. For instead a TEA will also generate a MCE.
Iaw MPC8323 Reference Manual:
MCP = Machine-check interrupt
In table 7.7:
Machine check 00200 Caused by the assertion of the tea signal during a
data bus transaction, assertion of mcp, an address or data parity error,
or an instruction or data cache parity error. Note that the e300 has
SRR1 register values that are different from the G2/G2_LE cores’ when a
machine check occurs
In §8.1:
The machine check exception is caused by the internal mcp signal
generated by the IPIC, informing the host processor of error conditions,
assertion of the external IRQ0 machine-check request (enabled when
SEMSR[SIRQ0] = 1), and other conditions.
>
> I know it is a lot of information but we are very confused about the use
> of these registers and we noticed that you are very involved on the PPC
> development. So we would like to hear from an expert.
>
> Thank you very much for your time,
You are welcome
Christophe
>
> Pablo
>
> (*) https://www.nxp.com/docs/en/application-note/AN3797.pdf
>
---
L'absence de virus dans ce courrier électronique a été vérifiée par le logiciel antivirus Avast.
https://www.avast.com/antivirus
More information about the Linuxppc-dev
mailing list