[PATCH v4 0/5] perf report: Show branch type

Jiri Olsa jolsa at redhat.com
Wed Apr 12 20:58:39 AEST 2017


On Wed, Apr 12, 2017 at 06:21:01AM +0800, Jin Yao wrote:

SNIP

> 
> 3. Use 2 bits in perf_branch_entry for a "cross" metrics checking
>    for branch cross 4K or 2M area. It's an approximate computing
>    for checking if the branch cross 4K page or 2MB page.
> 
> For example:
> 
> perf record -g --branch-filter any,save_type <command>
> 
> perf report --stdio
> 
>      JCC forward:  27.7%
>     JCC backward:   9.8%
>              JMP:   0.0%
>          IND_JMP:   6.5%
>             CALL:  26.6%
>         IND_CALL:   0.0%
>              RET:  29.3%
>             IRET:   0.0%
>         CROSS_4K:   0.0%
>         CROSS_2M:  14.3%

got mangled perf report --stdio output for:


[root at ibm-x3650m4-02 perf]# ./perf record -j any,save_type kill
kill: not enough arguments
[ perf record: Woken up 1 times to write data ]
[ perf record: Captured and wrote 0.013 MB perf.data (18 samples) ]

[root at ibm-x3650m4-02 perf]# ./perf report --stdio -f | head -30
# To display the perf.data header info, please use --header/--header-only options.
#
#
# Total Lost Samples: 0
#
# Samples: 253  of event 'cycles'
# Event count (approx.): 253
#
# Overhead  Command  Source Shared Object  Source Symbol                            Target Symbol                            Basic Block Cycles
# ........  .......  ....................  .......................................  .......................................  ..................
#
     8.30%  perf
Um  [kernel.vmlinux]      [k] __intel_pmu_enable_all.constprop.17  [k] native_write_msr                     -                 
     7.91%  perf
Um  [kernel.vmlinux]      [k] intel_pmu_lbr_enable_all             [k] __intel_pmu_enable_all.constprop.17  -                 
     7.91%  perf
Um  [kernel.vmlinux]      [k] native_write_msr                     [k] intel_pmu_lbr_enable_all             -                 
     6.32%  kill     libc-2.24.so          [.] _dl_addr                             [.] _dl_addr                             -                 
     5.93%  perf
Um  [kernel.vmlinux]      [k] perf_iterate_ctx                     [k] perf_iterate_ctx                     -                 
     2.77%  kill     libc-2.24.so          [.] malloc                               [.] malloc                               -                 
     1.98%  kill     libc-2.24.so          [.] _int_malloc                          [.] _int_malloc                          -                 
     1.58%  kill     [kernel.vmlinux]      [k] __rb_insert_augmented                [k] __rb_insert_augmented                -                 
     1.58%  perf
Um  [kernel.vmlinux]      [k] perf_event_exec                      [k] perf_event_exec                      -                 
     1.19%  kill     [kernel.vmlinux]      [k] anon_vma_interval_tree_insert        [k] anon_vma_interval_tree_insert        -                 
     1.19%  kill     [kernel.vmlinux]      [k] free_pgd_range                       [k] free_pgd_range                       -                 
     1.19%  kill     [kernel.vmlinux]      [k] n_tty_write                          [k] n_tty_write                          -                 
     1.19%  perf
Um  [kernel.vmlinux]      [k] native_sched_clock                   [k] sched_clock                          -                 
...
SNIP


jirka


More information about the Linuxppc-dev mailing list