Tweaks for datadog compatibility #74

vilterp · 2022-10-21T17:17:41Z

Without these, the flame graph shows up empty in their viewer.

https://www.datadoghq.com/product/code-profiling/

(Accidentally closed #73; reopening)

codecov-commenter · 2022-10-21T17:40:06Z

Codecov Report

Merging #74 (75cf270) into master (ac78823) will increase coverage by 0.01%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master      #74      +/-   ##
==========================================
+ Coverage   96.61%   96.62%   +0.01%     
==========================================
  Files           3        3              
  Lines         295      296       +1     
==========================================
+ Hits          285      286       +1     
  Misses         10       10

Impacted Files	Coverage Δ
src/Allocs.jl	`92.00% <100.00%> (ø)`
src/PProf.jl	`99.20% <100.00%> (+<0.01%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

NHDaly · 2022-10-21T19:32:04Z

src/Allocs.jl

@@ -43,7 +43,7 @@ function pprof(alloc_profile::Profile.Allocs.AllocResults = Profile.Allocs.fetch
               # Allocs-specific arguments:
               frame_for_type::Bool = true,
            )
-    period = UInt64(0x1)
+    period = sum(alloc.size for alloc in alloc_profile.allocs, init=0)


🤔 this seems weird.. Isn't this supposed to basically represent how much whatever does one sample represent?

So for a CPU profile, this would be like how much time in between samples? So for an allocation profile, it should be 1 allocation per sample, right?

Hm yeah I was misunderstanding this.

Looking at the proto, it says "The number of events between sampled occurrences": https://github.com/google/pprof/blob/131d412537eacd9973045c73835c3ac6ed696765/proto/profile.proto#L85-L86

So I guess it's basically supposed to be the sample interval? E.g. "every 1000th alloc" would be 1000?

NHDaly · 2022-10-21T19:33:24Z

src/Allocs.jl

    ]
-    period_type = ValueType!("heap", "bytes")
+    period_type = ValueType!("alloc_space", "bytes")


oh maybe the period_type should actually be (alloc_objects", "count")? then maybe you could keep the period = 0x01?

NHDaly · 2022-10-21T19:34:09Z

src/PProf.jl

@@ -163,7 +163,7 @@ function pprof(data::Union{Nothing, Vector{UInt}} = nothing,
            # End of sample
            value = [
                1,                   # events
-                length(location_id), # stack_depth
+                60000000             # CPU ns


what does this mean?

Hm yeah, I think I found I needed to put some number here to get it to show up in DD, but this is just a bogus value. Not sure what it's supposed to be… Maybe it's supposed to correspond to the sample period? i.e. if you're taking a stack sample every 100ms, this is supposed to be 100ms (in ns)?

ah, yes, that's right. i think that is exactly what it's supposed to be. it's supposed to be the value matching the value type:

ValueType!("cpu", "nanoseconds")

👍

🤔 i think you should be able to ask Profile what it's current sample rate is, so that this can be an accurate number.

help?> Profile.init() init(; n::Integer, delay::Real) Configure the delay between backtraces (measured in seconds), and the number n of instruction pointers that may be stored per thread. Each instruction pointer corresponds to a single line of code; backtraces generally consist of a long list of instruction pointers. Note that 6 spaces for instruction pointers per backtrace are used to store metadata and two NULL end markers. Current settings can be obtained by calling this function with no arguments, and each can be set independently using keywords or in the order (n, delay). │ Julia 1.8 │ │ As of Julia 1.8, this function allocates space for n instruction pointers per thread being profiled. Previously this was n total. julia> Profile.init() (10000000, 0.001) julia> Profile.init()[2] 0.001

NHDaly · 2022-10-21T19:35:29Z

src/PProf.jl

-        ValueType!("events",      "count"), # Mandatory
-        ValueType!("stack_depth", "count")
+        ValueType!("events", "count"), # Mandatory
+        ValueType!("cpu", "nanoseconds")


can we still emit stack-depth values, or does even their presence break the datadog viewer?

I'm fine to drop them though; i don't think they really added much, and they're just there because Valentin and i were trying to understand how the sample_types worked, i think

I'm not sure if they break the DD viewer… I'm not sure why you'd need them though, since each sample itself is a full stack, so the depth information is already there

NHDaly

Cool yeah, these mostly all seem fine 👍 couple questions tho

vilterp · 2022-10-28T09:38:36Z

src/PProf.jl

-                full_name_with_args = _escape_name_for_pprof(String(take!(io)))
+                call_str = String(take!(io))
+                # add module name as well
+                full_name_with_args = _escape_name_for_pprof("$(meth.module).$call_str")


adding the module name here should be the only user-visible change in this PR — it's pretty essential to get frames to show up right in DD since it essentially does

package, func = split(name, "."; limit=2)

e.g. for this one:

Without the module name it would parse as

package: DatadogProfileUploader.profile_and_upload(DatadogProfileUploader

func: DDConfig, typeof(myfunc))

If people don't want the behavior to change though, I could put this behind an option.

vilterp added 4 commits October 21, 2022 00:08

try to make compatible with DD

77a8653

guard lines from being -1

285a963

allocs: guard against -1

5795cd4

update period

75cf270

NHDaly reviewed Oct 21, 2022

View reviewed changes

add (bogus) CPU time sample time to flamegraph profile

fbb81b5

vilterp mentioned this pull request Oct 25, 2022

Help with Julia profiling support? DataDog/dd-trace-go#1534

Closed

vilterp added 2 commits October 28, 2022 01:35

add module name

aa0f960

comment

785f7c1

vilterp commented Oct 28, 2022

View reviewed changes

use existing sampling_delay_ns variable

0ece3ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tweaks for datadog compatibility #74

Tweaks for datadog compatibility #74

vilterp commented Oct 21, 2022 •

edited

Loading

codecov-commenter commented Oct 21, 2022 •

edited

Loading

NHDaly Oct 21, 2022

vilterp Oct 28, 2022

NHDaly Oct 21, 2022

NHDaly Oct 21, 2022

vilterp Oct 25, 2022

NHDaly Oct 28, 2022

NHDaly Oct 21, 2022

vilterp Oct 28, 2022

NHDaly left a comment

vilterp Oct 28, 2022

Tweaks for datadog compatibility #74

Are you sure you want to change the base?

Tweaks for datadog compatibility #74

Conversation

vilterp commented Oct 21, 2022 • edited Loading

codecov-commenter commented Oct 21, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NHDaly left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vilterp commented Oct 21, 2022 •

edited

Loading

codecov-commenter commented Oct 21, 2022 •

edited

Loading