[Skiboot] [PATCH] occ: Filter out entries from Pmin to Pmax in pstate table

Balbir Singh bsingharora at gmail.com
Mon Apr 18 18:24:36 AEST 2016


>>
>> I was wondering if we just expose all the pstates including p_min and p_max and
>> nr_pstates, the OS might be better at filtering states and implementing the
>> right thing in the future without requiring a reboot (may be in the future).
>> We might be able to show turbo p-states and show them as disabled in the OS
> 
> Hi Balbir,
> 
> There are two scenarios of turbo mode control:
> (a) OS driven policy:
> 
> This is the case you are talking about where we would like Linux OS
> cpufreq driver to control the limits of cpu frequency scaling in order
> to drive responsiveness and efficiency for workloads.
> 
> We can do this today by setting the max scaling frequency limit in
> ondemand and performance governor and we can have tuned profiles to
> make this simple for end users.
> 
> We have the full cpu frequency range including exposed to cpu-freq
> driver and also the min/nominal/max frequency is available in sysfs to
> set the required policy.
> 
> We can disable turbo mode today by setting max scaling frequency to
> the current nominal frequency.
> 
> (b) Firmware driven policy:
> 
> This is the case Shilpa's fix addresses.  The goal here is to have an
> equivalent of BIOS based settings that is consistent across different
> OS instances and limits the policies that OS can use.
> 
> Turbo mode disable in scenario (a) can be done by setting cpu-freq
> driver limits.  However for case (b) we need to filter out the allowed
> range and tell Linux only the PStates that Linux will be allowed to
> set.
> 
> OCC will limit the range and today if the PState requested by Linux is
> not granted by OCC then we assume a power cap or throttle situation
> and do not expect to have a cpu freq policy conflict between OCC and
> Linux.
> 

Hi, Vaidy

I understand (a) and (b) and the patch looks good, however, I was just wondering
if keeping the future in mind lets consider the following p-states

p_min, p_1, p_2, ..., p_max

p_max will change based on settings by OCC

p_min, p_1, p_2, ..., current p_max, ..., tp_max

If OCC in the future were to ever make tp_max dynamic (change without reboot), the
current code would require a reboot, we could support it.
But I realize that this is beyond the scope of this patch.

The patch overall makes sense

Balbir Singh


More information about the Skiboot mailing list