[Skiboot] [PATCH 1/2] SLW: Remove stop1_lite and stop0 stop states

Nicholas Piggin npiggin at gmail.com
Thu May 10 20:02:45 AEST 2018


On Thu, 10 May 2018 14:29:44 +0530
Akshay Adiga <akshay.adiga at linux.vnet.ibm.com> wrote:

> On Thu, May 03, 2018 at 08:15:59PM +1000, Nicholas Piggin wrote:
> > On Thu, 03 May 2018 20:03:55 +1000
> > Stewart Smith <stewart at linux.vnet.ibm.com> wrote:
> >   
> > > Nicholas Piggin <npiggin at gmail.com> writes:  
> > > > On Thu, 3 May 2018 14:36:47 +0530
> > > > Akshay Adiga <akshay.adiga at linux.vnet.ibm.com> wrote:
> > > >    
> > > >> On Tue, May 01, 2018 at 01:47:23PM +1000, Nicholas Piggin wrote:    
> > > >> > On Mon, 30 Apr 2018 14:42:08 +0530
> > > >> > Akshay Adiga <akshay.adiga at linux.vnet.ibm.com> wrote:
> > > >> >       
> > > >> > > Powersaving for stop0_lite and stop1_lite is observed to be quite similar
> > > >> > > and both states resume without state loss. Using context_switch test [1]
> > > >> > > we observe that stop0_lite has slightly lower latency, hence removing
> > > >> > > stop1_lite.
> > > >> > > 
> > > >> > > [1] linux/tools/testing/selftests/powerpc/benchmarks/context_switch.c
> > > >> > > 
> > > >> > > Signed-off-by: Akshay Adiga <akshay.adiga at linux.vnet.ibm.com>      
> > > >> > 
> > > >> > I'm okay for removing stop1_lite and stop2_lite -- SMT switching
> > > >> > is very latency critical. If we decide to actually start saving
> > > >> > real power then SMT should already have been switched.
> > > >> > 
> > > >> > So I would put stop1_lite and stop2_lite removal in the same patch.      
> > > >> 
> > > >> I can do this.
> > > >>     
> > > >> > 
> > > >> > Then what do we have? stop0_lite, stop0, stop1 for our fast idle
> > > >> > states.      
> > > >> 
> > > >> Currently we were looking at  stop0_lite , stop1 as the fast idle states
> > > >> because stop0 and stop1 have similar latency and powersaving.
> > > >> Having so many low latency states does not make sense.
> > > >>     
> > > >> > 
> > > >> > I would be against removing stop0 if that is our fastest way to
> > > >> > release SMT resources, even if there is only a small advantage. Why
> > > >> > not remove stop1 instead?
> > > >> >      
> > > >> SMT-folding comes into picture only when we have at least one thread
> > > >> running in the core. stop0 and stop1 has exactly same power-saving and
> > > >> both will release SMT resources if at least one thread in the core is
> > > >> running.    
> > > >
> > > > Right, but you don't know that other threads are running or will remain
> > > > running when you enter stop. If not, then latency is higher for stop1,
> > > > no? So we need to be using stop0.
> > > >    
> > > >> 
> > > >> As soon as all threads are idle core enters stop0/stop1, where stop1
> > > >> does a bit more powersaving than stop0.
> > > >>     
> > > >> > We also need to better evaluate stop0_lite. How much advantage does
> > > >> > that have over snooze?      
> > > >> 
> > > >> I evaluated snooze and stop0_lite, there is an additional ipi latency of
> > > >> a few microseconds in case of stop0_lite. So snooze cannot still be
> > > >> replaced by stop0_lite.    
> > > >
> > > > I meant the other way around. Replace stop0_lite with snooze.
> > > >
> > > > So we would have snooze, stop0, stop2, and stop4 and/or 5.    
> > > 
> > > Slightly stupid question: should we be disabling these here or should
> > > Linux be better and deciding what states to use?  
> > 
> > Yeah not a bad question, I don't have a good answer. I don't know how
> > smart Linux is at deciding what to use and when.
> > 
> > I am pretty sure the way we set our _lite states wrong -- we don't
> > want to go into stop2_lite as a deeper sleep state than stop0 for
> > example, because that then prevents SMT folding.  
> 
> I think we should keep both stop0 and stop1, i was not able to get
> a good enough reason to remove stop0.
> 
> I a diffrent patch we need to tweak residencies so that we can bias
> to more useful stop states.
> 

Well I would say stop1_lite and stop2_lite should not be used if we
have stop0_lite, particularly they should not be used after stop0 in
the case that we have SMT enabled. If SMT is disabled then possibly
we could use the _lite states instead.

So we need some way to advertise and respond to that in Linux.

After that I guess it's a matter of measuring and tuning the others.
I think between snooze and stop0, it is an open question whether it
is worth using stop0_lite.

Thanks,
Nick


More information about the Skiboot mailing list