[PATCH/RFC] sysfs cache code rewrite
Nathan Lynch
ntl at pobox.com
Wed Jan 7 08:25:33 EST 2009
Please unsubscribe me from... wait, that's not what I meant to say.
Any thoughts or questions on this one? I posted this right before the
holidays so my feelings aren't hurt by the lack of response (yet!)
To summarize why I think this should go in 2.6.29:
- Userspace will now be able to determine the true cache topology
(the current code doesn't tell you which caches are shared between
CPUs).
- More complete information will be presented on systems that use
[di]-cache-block-size properties instead of [di]-cache-line-size.
- While overall LOC has increased, the documentation is better, and the
functions are much shorter and less complex.
Nathan Lynch wrote:
> The current code for providing processor cache information in sysfs
> has the following deficiencies:
> - several complex functions that are hard to understand
> - implicit recursion (cache_desc_release -> kobject_put -> cache_desc_release)
> - explicit recursion (create_cache_index_info)
> - use of two per-cpu arrays when one would suffice
> - duplication of work on systems where CPUs share cache
>
> Also, when I looked at implementing support for a shared_cpu_map
> attribute, it was pretty much impossible to handle hotplug without
> checking every single online CPU's cache_desc list and fixing things
> up... not that this is a hot path, but it would have introduced
> O(n^2)-ish behavior during boot. Addressing this involved rethinking
> the core data structures used, which didn't lend itself to an
> incremental approach.
>
> This implementation maintains a "forest" (potentially more than one
> tree) of cache objects which reflects the system's cache topology.
> Cache objects are instantiated as needed as CPUs come online. A
> per-cpu array is used mainly for sysfs-related bookkeeping; the
> objects in the array just point to the appropriate points in the
> forest.
>
> This maintains compatibility with the existing code and includes some
> enhancements:
> - Implement the shared_cpu_map attribute, which is essential for
> enabling userspace to discover the system's overall cache topology.
> - Use cache-block-size properties if cache-line-size is not available.
>
> I chose to place this implementation in a new file since it would have
> roughly doubled the size of sysfs.c, which is already kind of messy.
>
> Signed-off-by: Nathan Lynch <ntl at pobox.com>
> ---
> arch/powerpc/kernel/Makefile | 2 +-
> arch/powerpc/kernel/cacheinfo.c | 837 +++++++++++++++++++++++++++++++++++++++
> arch/powerpc/kernel/cacheinfo.h | 8 +
> arch/powerpc/kernel/sysfs.c | 300 +--------------
> 4 files changed, 850 insertions(+), 297 deletions(-)
> create mode 100644 arch/powerpc/kernel/cacheinfo.c
> create mode 100644 arch/powerpc/kernel/cacheinfo.h
>
> I've tested this on various ppc64 systems, making sure that the cache
> attributes' output remain unchanged, as well as running cpu hotplug
> stress tests while concurrently accessing the cpu sysfs hierarchy.
>
More information about the Linuxppc-dev
mailing list