Also, what's the reasoning for --cumulate not being an option under
perf record -g ..,<order>?
Sorry, I cannot understand you. The 'perf record' just saves sample
data (and callchains) from the ring-buffer. All the processing happens
in 'perf report'. I can't see what you expect from the 'perf record
--cumulate'. Am I missing something?
Given that there are clear use cases in production involving complex
callgraphs, I'm for getting this support in first and then reconciling
the differences with perf record -b later.
I think what Frederic said is that the code de-duplication of 'perf
report' side. The branch stack and --cumulate are different - branch
stack concentrates on the branch itself but --cumulate uses callchains
to find parents and give some credit to them as side information.