[PATCH] powerpc: fix VMX fix for memcpy case
Vaidyanathan Srinivasan
svaidy at linux.vnet.ibm.com
Sun Oct 7 01:19:14 EST 2012
* Nishanth Aravamudan <nacc at linux.vnet.ibm.com> [2012-10-01 17:59:13]:
> [urgh, sorry Anton, Ben & Paul, inadvertently hit send before adding
> linuxppc-dev to the cc!]
>
> Hi Anton,
>
> In 2fae7cdb60240e2e2d9b378afbf6d9fcce8a3890 ("powerpc: Fix VMX in
> interrupt check in POWER7 copy loops"), I think you inadvertently
> introduced a regression for memcpy on POWER7 machines. copyuer and
> memcpy diverge slightly in their use of cr1 (copyuser doesn't use it,
> but memcpy does) and you end up clobbering that register with your fix.
> That results in (taken from an FC18 kernel):
>
> [ 18.824604] Unrecoverable VMX/Altivec Unavailable Exception f20 at c000000000052f40
> [ 18.824618] Oops: Unrecoverable VMX/Altivec Unavailable Exception, sig: 6 [#1]
> [ 18.824623] SMP NR_CPUS=1024 NUMA pSeries
> [ 18.824633] Modules linked in: tg3(+) be2net(+) cxgb4(+) ipr(+) sunrpc xts lrw gf128mul dm_crypt dm_round_robin dm_multipath linear raid10 raid456 async_raid6_recov async_memcpy async_pq raid6_pq async_xor xor async_tx raid1 raid0 scsi_dh_rdac scsi_dh_hp_sw scsi_dh_emc scsi_dh_alua squashfs cramfs
> [ 18.824705] NIP: c000000000052f40 LR: c00000000020b874 CTR: 0000000000000512
> [ 18.824709] REGS: c000001f1fef7790 TRAP: 0f20 Not tainted (3.6.0-0.rc6.git0.2.fc18.ppc64)
> [ 18.824713] MSR: 8000000000009032 <SF,EE,ME,IR,DR,RI> CR: 4802802e XER: 20000010
> [ 18.824726] SOFTE: 0
> [ 18.824728] CFAR: 0000000000000f20
> [ 18.824731] TASK = c000000fa7128400[0] 'swapper/24' THREAD: c000000fa7480000 CPU: 24
> GPR00: 00000000ffffffc0 c000001f1fef7a10 c00000000164edc0 c000000f9b9a8120
> GPR04: c000000f9b9a8124 0000000000001438 0000000000000060 03ffffff064657ee
> GPR08: 0000000080000000 0000000000000010 0000000000000020 0000000000000030
> GPR12: 0000000028028022 c00000000ff25400 0000000000000001 0000000000000000
> GPR16: 0000000000000000 7fffffffffffffff c0000000016b2180 c00000000156a500
> GPR20: c000000f968c7a90 c0000000131c31d8 c000001f1fef4000 c000000001561d00
> GPR24: 000000000000000a 0000000000000000 0000000000000001 0000000000000012
> GPR28: c000000fa5c04f80 00000000000008bc c0000000015c0a28 000000000000022e
> [ 18.824792] NIP [c000000000052f40] .memcpy_power7+0x5a0/0x7c4
> [ 18.824797] LR [c00000000020b874] .pcpu_free_area+0x174/0x2d0
> [ 18.824800] Call Trace:
> [ 18.824803] [c000001f1fef7a10] [c000000000052c14] .memcpy_power7+0x274/0x7c4 (unreliable)
> [ 18.824809] [c000001f1fef7b10] [c00000000020b874] .pcpu_free_area+0x174/0x2d0
> [ 18.824813] [c000001f1fef7bb0] [c00000000020ba88] .free_percpu+0xb8/0x1b0
> [ 18.824819] [c000001f1fef7c50] [c00000000043d144] .throtl_pd_exit+0x94/0xd0
> [ 18.824824] [c000001f1fef7cf0] [c00000000043acf8] .blkg_free+0x88/0xe0
> [ 18.824829] [c000001f1fef7d90] [c00000000018c048] .rcu_process_callbacks+0x2e8/0x8a0
> [ 18.824835] [c000001f1fef7e90] [c0000000000a8ce8] .__do_softirq+0x158/0x4d0
> [ 18.824840] [c000001f1fef7f90] [c000000000025ecc] .call_do_softirq+0x14/0x24
> [ 18.824845] [c000000fa7483650] [c000000000010e80] .do_softirq+0x160/0x1a0
> [ 18.824850] [c000000fa74836f0] [c0000000000a94a4] .irq_exit+0xf4/0x120
> [ 18.824854] [c000000fa7483780] [c000000000020c44] .timer_interrupt+0x154/0x4d0
> [ 18.824859] [c000000fa7483830] [c000000000003be0] decrementer_common+0x160/0x180
> [ 18.824866] --- Exception: 901 at .plpar_hcall_norets+0x84/0xd4
> [ 18.824866] LR = .check_and_cede_processor+0x48/0x80
> [ 18.824871] [c000000fa7483b20] [c00000000007f018] .check_and_cede_processor+0x18/0x80 (unreliable)
> [ 18.824877] [c000000fa7483b90] [c00000000007f104] .dedicated_cede_loop+0x84/0x150
> [ 18.824883] [c000000fa7483c50] [c0000000006bc030] .cpuidle_enter+0x30/0x50
> [ 18.824887] [c000000fa7483cc0] [c0000000006bc9f4] .cpuidle_idle_call+0x104/0x720
> [ 18.824892] [c000000fa7483d80] [c000000000070af8] .pSeries_idle+0x18/0x40
> [ 18.824897] [c000000fa7483df0] [c000000000019084] .cpu_idle+0x1a4/0x380
> [ 18.824902] [c000000fa7483ec0] [c0000000008a4c18] .start_secondary+0x520/0x528
> [ 18.824907] [c000000fa7483f90] [c0000000000093f0] .start_secondary_prolog+0x10/0x14
> [ 18.824911] Instruction dump:
> [ 18.824914] 38840008 90030000 90e30004 38630008 7ca62850 7cc300d0 78c7e102 7cf01120
> [ 18.824923] 78c60660 39200010 39400020 39600030 <7e00200c> 7c0020ce 38840010 409f001c
> [ 18.824935] ---[ end trace 0bb95124affaaa45 ]---
> [ 18.825046] Unrecoverable VMX/Altivec Unavailable Exception f20 at c000000000052d08
>
> I believe the right fix is to make memcpy match usercopy and not use
> cr1.
>
> Signed-off-by: Nishanth Aravamudan <nacc at us.ibm.com>
Tested-by: Vaidyanathan Srinivasan <svaidy at linux.vnet.ibm.com>
> ---
> I've not tested this fix yet, but I think it's logically correct.
> Probably needs to go to 3.6-stable as well.
>
> diff --git a/arch/powerpc/lib/memcpy_power7.S b/arch/powerpc/lib/memcpy_power7.S
> index 7ba6c96..0663630 100644
> --- a/arch/powerpc/lib/memcpy_power7.S
> +++ b/arch/powerpc/lib/memcpy_power7.S
> @@ -239,8 +239,8 @@ _GLOBAL(memcpy_power7)
> ori r9,r9,1 /* stream=1 */
>
> srdi r7,r5,7 /* length in cachelines, capped at 0x3FF */
> - cmpldi cr1,r7,0x3FF
> - ble cr1,1f
> + cmpldi r7,0x3FF
> + ble 1f
> li r7,0x3FF
> 1: lis r0,0x0E00 /* depth=7 */
> sldi r7,r7,7
This change on v3.6 mainline tree allows kernel to boot without exception.
--Vaidy
More information about the Linuxppc-dev
mailing list