[PATCH 1/1] powerpc/perf: Adjust callchain based on DWARF debug
Jiri Olsa
jolsa at redhat.com
Thu May 22 20:00:45 EST 2014
On Tue, May 20, 2014 at 06:26:44PM -0700, Sukadev Bhattiprolu wrote:
SNIP
> + *
> + * The value in LR is only needed when it holds a return address. If the
> + * return address is on the stack, we should ignore the LR value.
> + *
> + * Further, when the return address is in the LR, if a new frame was just
> + * allocated but the LR was not saved into it, then the LR contains the
> + * caller, slot 4: contains the caller's caller and the contents of slot 3:
> + * (chain->ips[3]) is undefined and must be ignored.
> + *
> + * Use DWARF debug information to determine if any entries need to be skipped.
> + *
> + * Return:
> + * index: of callchain entry that needs to be ignored (if any)
> + * -1 if no entry needs to be ignored or in case of errors
> + *
> + * TODO:
> + * Rather than returning an index into the callchain and have the
> + * caller skip that entry, we could modify the callchain in-place
> + * by putting a PERF_CONTEXT_IGNORE marker in the affected entry.
> + *
> + * But @chain points to read-only mmap, so the caller needs to
> + * duplicate the callchain to modify in-place - something like:
> + *
> + * new_callchain = arch_duplicate_callchain();
> + * arch_adjust_callchain(new_callchain);
> + * ...
> + * arch_free_callchain(new_callchain);
> + *
> + * Since we only expect to adjust <= 1 entry for now, just return
> + * the index.
yep, that sounds more clear to me.. something like below?
calling callchain_dup from within arch_adjust_callchain in case
you want to change it and returning != 0 in this case, so
we could free the new callchain
but it might be to much overhead in case you have the support to skip
just one entry now right? is there a plan for more cleaning? ;)
thanks,
jirka
---
diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c
index 7409ac8..ec18310 100644
--- a/tools/perf/util/machine.c
+++ b/tools/perf/util/machine.c
@@ -1279,6 +1279,11 @@ struct branch_info *sample__resolve_bstack(struct perf_sample *sample,
return bi;
}
+struct ip_callchain *callchain_dup(struct ip_callchain *chain)
+{
+ return memdup(chain, sizeof(*chain) + chain->nr * sizeof(u64));
+}
+
static int machine__resolve_callchain_sample(struct machine *machine,
struct thread *thread,
struct ip_callchain *chain,
@@ -1298,6 +1303,8 @@ static int machine__resolve_callchain_sample(struct machine *machine,
return 0;
}
+ adjusted = arch_adjust_callchain(machine, thread, &chain);
+
for (i = 0; i < chain_nr; i++) {
u64 ip;
struct addr_location al;
@@ -1326,7 +1333,7 @@ static int machine__resolve_callchain_sample(struct machine *machine,
* Discard all.
*/
callchain_cursor_reset(&callchain_cursor);
- return 0;
+ goto out;
}
continue;
}
@@ -1350,10 +1357,14 @@ static int machine__resolve_callchain_sample(struct machine *machine,
err = callchain_cursor_append(&callchain_cursor,
ip, al.map, al.sym);
if (err)
- return err;
+ goto out;
}
- return 0;
+ out:
+ if (adjusted)
+ free(chain);
+
+ return err;
}
static int unwind_entry(struct unwind_entry *entry, void *arg)
More information about the Linuxppc-dev
mailing list