[PATCH 2/2] tools/perf: Fix printing field separator in CSV metrics output

Athira Rajeev atrajeev at linux.vnet.ibm.com
Mon Nov 14 20:30:47 AEDT 2022



> On 09-Nov-2022, at 3:53 PM, Athira Rajeev <atrajeev at linux.vnet.ibm.com> wrote:
> 
> 
> 
>> On 09-Nov-2022, at 2:27 AM, Arnaldo Carvalho de Melo <acme at kernel.org> wrote:
>> 
>> Em Wed, Nov 02, 2022 at 02:07:06PM +0530, Athira Rajeev escreveu:
>>> 
>>> 
>>>> On 18-Oct-2022, at 2:26 PM, Athira Rajeev <atrajeev at linux.vnet.ibm.com> wrote:
>>>> 
>>>> In perf stat with CSV output option, number of fields
>>>> in metrics output is not matching with number of fields
>>>> in other event output lines.
>>>> 
>>>> Sample output below after applying patch to fix
>>>> printing os->prefix.
>>>> 
>>>> 	# ./perf stat -x, --per-socket -a -C 1 ls
>>>> 	S0,1,1.89,msec,cpu-clock,1887692,100.00,1.013,CPUs utilized
>>>> 	S0,1,2,,context-switches,1885842,100.00,1.060,K/sec
>>>> 	S0,1,0,,cpu-migrations,1885374,100.00,0.000,/sec
>>>> 	S0,1,2,,page-faults,1884880,100.00,1.060,K/sec
>>>> 	S0,1,189544,,cycles,1263158,67.00,0.100,GHz
>>>> 	S0,1,64602,,stalled-cycles-frontend,1876146,100.00,34.08,frontend cycles idle
>>>> 	S0,1,128241,,stalled-cycles-backend,1875512,100.00,67.66,backend cycles idle
>>>> 	S0,1,95578,,instructions,1874676,100.00,0.50,insn per cycle
>>>> ===>	S0,1,,,,,,,1.34,stalled cycles per insn
>>>> 
>>>> The above command line uses field separator as ","
>>>> via "-x," option and per-socket option displays
>>>> socket value as first field. But here the last line
>>>> for "stalled cycles per insn" has more separators.
>>>> Each csv output line is expected to have 8 field
>>>> separatorsi (for the 9 fields), where as last line
>>>> has 10 "," in the result. Patch fixes this issue.
>>>> 
>>>> The counter stats are displayed by function
>>>> "perf_stat__print_shadow_stats" in code
>>>> "util/stat-shadow.c". While printing the stats info
>>>> for "stalled cycles per insn", function "new_line_csv"
>>>> is used as new_line callback.
>>>> 
>>>> The fields printed in each line contains:
>>>> "Socket_id,aggr nr,Avg,unit,event_name,run,enable_percent,ratio,unit"
>>>> 
>>>> The metric output prints Socket_id, aggr nr, ratio
>>>> and unit. It has to skip through remaining five fields
>>>> ie, Avg,unit,event_name,run,enable_percent. The csv
>>>> line callback uses "os->nfields" to know the number of
>>>> fields to skip to match with other lines.
>>>> Currently it is set as:
>>>> 	os.nfields = 3 + aggr_fields[config->aggr_mode] + (counter->cgrp ? 1 : 0);
>>>> 
>>>> But in case of aggregation modes, csv_sep already
>>>> gets printed along with each field (Function "aggr_printout"
>>>> in util/stat-display.c). So aggr_fields can be
>>>> removed from nfields. And fixed number of fields to
>>>> skip has to be "4". This is to skip fields for:
>>>> "avg, unit, event name, run, enable_percent"
>>>> Example from line for instructions:
>>>> "1.89,msec,cpu-clock,1887692,100.00"
>>>> 
>>>> This needs 4 csv separators. Patch removes aggr_fields
>>>> and uses 4 as fixed number of os->nfields to skip.
>>>> 
>>>> After the patch:
>>>> 
>>>> 	# ./perf stat -x, --per-socket -a -C 1 ls
>>>> 	S0,1,1.92,msec,cpu-clock,1917648,100.00,1.010,CPUs utilized
>>>> 	S0,1,54,,context-switches,1916762,100.00,28.176,K/sec
>>>> 	-------
>>>> 	S0,1,528693,,instructions,1908854,100.00,0.36,insn per cycle
>>>> 	S0,1,,,,,,1.81,stalled cycles per insn
>>>> 
>>>> Fixes: 92a61f6412d3 ("perf stat: Implement CSV metrics output")
>>>> Reported-by: Disha Goel <disgoel at linux.vnet.ibm.com>
>>>> Signed-off-by: Athira Rajeev <atrajeev at linux.vnet.ibm.com>
>>> 
>>> Hi All,
>>> 
>>> Looking for review comments for this change.
>> 
>> This clashed with a patch from Namhyung that I just applied:
>> 
>> http://lore.kernel.org/lkml/20221107213314.3239159-2-namhyung@kernel.org
>> 
>> Can you please check? I just applied the other patch in this series.
>> 
>> Thanks,
>> 
>> - Arnaldo
> 
> Hi Arnaldo,
> 
> Thanks for checking the patch series.
> Please find the updated patch below which is created on top of perf/urgent.

Hi Arnaldo,

I posted this as a separate patch with version V2 here: 

https://lore.kernel.org/linux-perf-users/20221114085523.86570-1-atrajeev@linux.vnet.ibm.com/T/#m1ba8c773b53f198923101684c39b13da686c211d

Thanks
Athira

> 
> From dde8f830ad318c9111c3fea5415fd8170b4c51bd Mon Sep 17 00:00:00 2001
> From: Athira Rajeev <atrajeev at linux.vnet.ibm.com>
> Date: Tue, 18 Oct 2022 14:26:05 +0530
> Subject: [PATCH] tools/perf: Fix printing field separator in CSV metrics
> output
> 
> In perf stat with CSV output option, number of fields
> in metrics output is not matching with number of fields
> in other event output lines.
> 
> Sample output below after applying patch to fix
> printing os->prefix.
> 
> 	# ./perf stat -x, --per-socket -a -C 1 ls
> 	S0,1,1.89,msec,cpu-clock,1887692,100.00,1.013,CPUs utilized
> 	S0,1,2,,context-switches,1885842,100.00,1.060,K/sec
> 	S0,1,0,,cpu-migrations,1885374,100.00,0.000,/sec
> 	S0,1,2,,page-faults,1884880,100.00,1.060,K/sec
> 	S0,1,189544,,cycles,1263158,67.00,0.100,GHz
> 	S0,1,64602,,stalled-cycles-frontend,1876146,100.00,34.08,frontend cycles idle
> 	S0,1,128241,,stalled-cycles-backend,1875512,100.00,67.66,backend cycles idle
> 	S0,1,95578,,instructions,1874676,100.00,0.50,insn per cycle
> ===>	S0,1,,,,,,,1.34,stalled cycles per insn
> 
> The above command line uses field separator as ","
> via "-x," option and per-socket option displays
> socket value as first field. But here the last line
> for "stalled cycles per insn" has more separators.
> Each csv output line is expected to have 8 field
> separatorsi (for the 9 fields), where as last line
> has 10 "," in the result. Patch fixes this issue.
> 
> The counter stats are displayed by function
> "perf_stat__print_shadow_stats" in code
> "util/stat-shadow.c". While printing the stats info
> for "stalled cycles per insn", function "new_line_csv"
> is used as new_line callback.
> 
> The fields printed in each line contains:
> "Socket_id,aggr nr,Avg,unit,event_name,run,enable_percent,ratio,unit"
> 
> The metric output prints Socket_id, aggr nr, ratio
> and unit. It has to skip through remaining five fields
> ie, Avg,unit,event_name,run,enable_percent. The csv
> line callback uses "os->nfields" to know the number of
> fields to skip to match with other lines.
> Currently it is set as:
> 	os.nfields = 3 + aggr_fields[config->aggr_mode] + (counter->cgrp ? 1 : 0);
> 
> But in case of aggregation modes, csv_sep already
> gets printed along with each field (Function "aggr_printout"
> in util/stat-display.c). So aggr_fields can be
> removed from nfields. And fixed number of fields to
> skip has to be "4". This is to skip fields for:
> "avg, unit, event name, run, enable_percent"
> Example from line for instructions:
> "1.89,msec,cpu-clock,1887692,100.00"
> 
> This needs 4 csv separators. Patch removes aggr_fields
> and uses 4 as fixed number of os->nfields to skip.
> 
> After the patch:
> 
> 	# ./perf stat -x, --per-socket -a -C 1 ls
> 	S0,1,1.92,msec,cpu-clock,1917648,100.00,1.010,CPUs utilized
> 	S0,1,54,,context-switches,1916762,100.00,28.176,K/sec
> 	-------
> 	S0,1,528693,,instructions,1908854,100.00,0.36,insn per cycle
> 	S0,1,,,,,,1.81,stalled cycles per insn
> 
> Fixes: 92a61f6412d3 ("perf stat: Implement CSV metrics output")
> Reported-by: Disha Goel <disgoel at linux.vnet.ibm.com>
> Signed-off-by: Athira Rajeev <atrajeev at linux.vnet.ibm.com>
> ---
> tools/perf/util/stat-display.c | 13 +------------
> 1 file changed, 1 insertion(+), 12 deletions(-)
> 
> diff --git a/tools/perf/util/stat-display.c b/tools/perf/util/stat-display.c
> index ba66bb7fc1ca..5ce14bf18055 100644
> --- a/tools/perf/util/stat-display.c
> +++ b/tools/perf/util/stat-display.c
> @@ -551,20 +551,9 @@ static void printout(struct perf_stat_config *config, struct aggr_cpu_id id, int
> 	new_line_t nl;
> 
> 	if (config->csv_output) {
> -		static const int aggr_fields[AGGR_MAX] = {
> -			[AGGR_NONE] = 1,
> -			[AGGR_GLOBAL] = 0,
> -			[AGGR_SOCKET] = 2,
> -			[AGGR_DIE] = 2,
> -			[AGGR_CORE] = 2,
> -			[AGGR_THREAD] = 1,
> -			[AGGR_UNSET] = 0,
> -			[AGGR_NODE] = 1,
> -		};
> -
> 		pm = config->metric_only ? print_metric_only_csv : print_metric_csv;
> 		nl = config->metric_only ? new_line_metric : new_line_csv;
> -		os.nfields = 3 + aggr_fields[config->aggr_mode] + (counter->cgrp ? 1 : 0);
> +		os.nfields = 4 + (counter->cgrp ? 1 : 0);
> 	} else if (config->json_output) {
> 		pm = config->metric_only ? print_metric_only_json : print_metric_json;
> 		nl = config->metric_only ? new_line_metric : new_line_json;
> -- 
> 2.31.1
> 
> 
> Thanks
> Athira
> 
>> 
>>> Thanks
>>> Athira
>>> 
>>>> ---
>>>> tools/perf/util/stat-display.c | 13 +------------
>>>> 1 file changed, 1 insertion(+), 12 deletions(-)
>>>> 
>>>> diff --git a/tools/perf/util/stat-display.c b/tools/perf/util/stat-display.c
>>>> index 879874a4bc07..5ca151adf826 100644
>>>> --- a/tools/perf/util/stat-display.c
>>>> +++ b/tools/perf/util/stat-display.c
>>>> @@ -551,20 +551,9 @@ static void printout(struct perf_stat_config *config, struct aggr_cpu_id id, int
>>>> 	new_line_t nl;
>>>> 
>>>> 	if (config->csv_output) {
>>>> -		static const int aggr_fields[AGGR_MAX] = {
>>>> -			[AGGR_NONE] = 1,
>>>> -			[AGGR_GLOBAL] = 0,
>>>> -			[AGGR_SOCKET] = 2,
>>>> -			[AGGR_DIE] = 2,
>>>> -			[AGGR_CORE] = 2,
>>>> -			[AGGR_THREAD] = 1,
>>>> -			[AGGR_UNSET] = 0,
>>>> -			[AGGR_NODE] = 0,
>>>> -		};
>>>> -
>>>> 		pm = config->metric_only ? print_metric_only_csv : print_metric_csv;
>>>> 		nl = config->metric_only ? new_line_metric : new_line_csv;
>>>> -		os.nfields = 3 + aggr_fields[config->aggr_mode] + (counter->cgrp ? 1 : 0);
>>>> +		os.nfields = 4 + (counter->cgrp ? 1 : 0);
>>>> 	} else if (config->json_output) {
>>>> 		pm = config->metric_only ? print_metric_only_json : print_metric_json;
>>>> 		nl = config->metric_only ? new_line_metric : new_line_json;
>>>> -- 
>>>> 2.31.1
>>>> 
>> 
>> -- 
>> 
>> - Arnaldo



More information about the Linuxppc-dev mailing list