feat: OTel context sharing by morrisonlevi · Pull Request #3970 · DataDog/dd-trace-php

morrisonlevi · 2026-06-10T00:59:43Z

Description

This adds OTel context sharing using libdatadog. It adds it to both tracing (producer) and profiling (consumer).

This is mostly AI-written with a little human review as I went. It is not ready for merge, but it did pass some local end-to-end experimentation.

Reviewer checklist

Test coverage seems ok.
Appropriate labels assigned.

datadog-datadog-prod-us1 · 2026-06-10T01:01:33Z

Tests

✨ Fix all issues with BitsAI

⚠️ Warnings

🚦 11 Pipeline jobs failed

DataDog/apm-reliability/dd-trace-php | ASAN test_c with multiple observers: [8.5]

DataDog/apm-reliability/dd-trace-php | ASAN test_c: [8.1, arm64]

DataDog/apm-reliability/dd-trace-php | check-big-regressions

View all 11 failed jobs.

ℹ️ Info

No other issues found (see more)

🧪 All tests passed
❄️ No new flaky tests detected

🔄 Datadog auto-retried 1 job - 1 passed on retry

🎯 Code Coverage (details)
• Patch Coverage: 0.00%
• Overall Coverage: 54.08% (-0.00%)

Useful? React with 👍 / 👎

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: 99bb029 | Docs | Datadog PR Page | Give us feedback!}

pr-commenter · 2026-06-10T01:13:45Z

Benchmarks [ profiler ]

Benchmark execution time: 2026-06-24 21:06:32

Comparing candidate commit 99bb029 in PR branch levi/otel-thread-context with baseline commit 81891ec in branch master.

Found 0 performance improvements and 3 performance regressions! Performance is the same for 25 metrics, 8 unstable metrics.

Explanation

This is an A/B test comparing a candidate commit's performance against that of a baseline commit. Performance changes are noted in the tables below as:

🟩 = significantly better candidate vs. baseline
🟥 = significantly worse candidate vs. baseline

We compute a confidence interval (CI) over the relative difference of means between metrics from the candidate and baseline commits, considering the baseline as the reference.

If the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD), the change is considered significant.

Feel free to reach out to #apm-benchmarking-platform on Slack if you have any questions.

More details about the CI and significant changes

You can imagine this CI as a range of values that is likely to contain the true difference of means between the candidate and baseline commits.

CIs of the difference of means are often centered around 0%, because often changes are not that big:

---------------------------------(------|---^--------)-------------------------------->
                              -0.6%    0%  0.3%     +1.2%
                                 |          |        |
         lower bound of the CI --'          |        |
sample mean (center of the CI) -------------'        |
         upper bound of the CI ----------------------'

As described above, a change is considered significant if the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD).

For instance, for an execution time metric, this confidence interval indicates a significantly worse performance:

----------------------------------------|---------|---(---------^---------)---------->
                                       0%        1%  1.3%      2.2%      3.1%
                                                  |   |         |         |
       significant impact threshold --------------'   |         |         |
                      lower bound of CI --------------'         |         |
       sample mean (center of the CI) --------------------------'         |
                      upper bound of CI ----------------------------------'

scenario:php-profiler-timeline-memory-control

🟥 cpu_user_time [+30.797ms; +38.178ms] or [+5.143%; +6.376%]
🟥 execution_time [+35.679ms; +38.622ms] or [+5.701%; +6.171%]

scenario:php-profiler-timeline-memory-with-profiler

🟥 execution_time [+31.618ms; +55.656ms] or [+3.028%; +5.329%]

pr-commenter · 2026-06-10T02:16:21Z

Benchmarks [ tracer ]

Benchmark execution time: 2026-06-24 22:07:28

Comparing candidate commit 99bb029 in PR branch levi/otel-thread-context with baseline commit 81891ec in branch master.

Found 3 performance improvements and 8 performance regressions! Performance is the same for 183 metrics, 0 unstable metrics.

Explanation

This is an A/B test comparing a candidate commit's performance against that of a baseline commit. Performance changes are noted in the tables below as:

🟩 = significantly better candidate vs. baseline
🟥 = significantly worse candidate vs. baseline

We compute a confidence interval (CI) over the relative difference of means between metrics from the candidate and baseline commits, considering the baseline as the reference.

If the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD), the change is considered significant.

Feel free to reach out to #apm-benchmarking-platform on Slack if you have any questions.

More details about the CI and significant changes

You can imagine this CI as a range of values that is likely to contain the true difference of means between the candidate and baseline commits.

CIs of the difference of means are often centered around 0%, because often changes are not that big:

---------------------------------(------|---^--------)-------------------------------->
                              -0.6%    0%  0.3%     +1.2%
                                 |          |        |
         lower bound of the CI --'          |        |
sample mean (center of the CI) -------------'        |
         upper bound of the CI ----------------------'

As described above, a change is considered significant if the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD).

For instance, for an execution time metric, this confidence interval indicates a significantly worse performance:

----------------------------------------|---------|---(---------^---------)---------->
                                       0%        1%  1.3%      2.2%      3.1%
                                                  |   |         |         |
       significant impact threshold --------------'   |         |         |
                      lower bound of CI --------------'         |         |
       sample mean (center of the CI) --------------------------'         |
                      upper bound of CI ----------------------------------'

scenario:ContextPropagationBench/benchExtractHeaders64Bit

🟥 execution_time [+108.858ns; +197.142ns] or [+9.041%; +16.374%]

scenario:ContextPropagationBench/benchExtractHeaders64Bit-opcache

🟥 execution_time [+60.976ns; +199.024ns] or [+3.053%; +9.966%]

scenario:ContextPropagationBench/benchExtractTraceContext128Bit

🟥 execution_time [+120.378ns; +199.622ns] or [+6.618%; +10.974%]

scenario:ContextPropagationBench/benchExtractTraceContext128Bit-opcache

🟥 execution_time [+78.666ns; +167.334ns] or [+4.209%; +8.953%]

scenario:MessagePackSerializationBench/benchMessagePackSerialization

🟩 execution_time [-8.590µs; -7.670µs] or [-7.697%; -6.873%]

scenario:MessagePackSerializationBench/benchMessagePackSerialization-opcache

🟩 execution_time [-5.701µs; -4.099µs] or [-5.102%; -3.668%]

scenario:SpanBench/benchOpenTelemetryAPI

🟥 mem_peak [+4.487MB; +4.487MB] or [+9.412%; +9.412%]

scenario:SpanBench/benchOpenTelemetryAPI-opcache

🟥 mem_peak [+4.485MB; +4.485MB] or [+10.030%; +10.030%]

scenario:SpanBench/benchOpenTelemetryInteroperability

🟥 mem_peak [+643.602KB; +643.609KB] or [+2.233%; +2.233%]

scenario:SpanBench/benchOpenTelemetryInteroperability-opcache

🟥 mem_peak [+641.570KB; +641.582KB] or [+2.478%; +2.478%]

scenario:TraceSerializationBench/benchSerializeTrace-opcache

🟩 execution_time [-77.025µs; -46.975µs] or [-4.948%; -3.017%]

…ntext # Conflicts: # Cargo.lock # Makefile

…trace-php into levi/otel-thread-context

…ntext

pr-commenter · 2026-06-23T16:06:31Z

Benchmarks [ appsec ]

Benchmark execution time: 2026-06-24 21:31:00

Comparing candidate commit 99bb029 in PR branch levi/otel-thread-context with baseline commit 81891ec in branch master.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 12 metrics, 0 unstable metrics.

Explanation

This is an A/B test comparing a candidate commit's performance against that of a baseline commit. Performance changes are noted in the tables below as:

🟩 = significantly better candidate vs. baseline
🟥 = significantly worse candidate vs. baseline

We compute a confidence interval (CI) over the relative difference of means between metrics from the candidate and baseline commits, considering the baseline as the reference.

If the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD), the change is considered significant.

Feel free to reach out to #apm-benchmarking-platform on Slack if you have any questions.

More details about the CI and significant changes

You can imagine this CI as a range of values that is likely to contain the true difference of means between the candidate and baseline commits.

CIs of the difference of means are often centered around 0%, because often changes are not that big:

---------------------------------(------|---^--------)-------------------------------->
                              -0.6%    0%  0.3%     +1.2%
                                 |          |        |
         lower bound of the CI --'          |        |
sample mean (center of the CI) -------------'        |
         upper bound of the CI ----------------------'

As described above, a change is considered significant if the CI is entirely outside the configured SIGNIFICANT_IMPACT_THRESHOLD (or the deprecated UNCONFIDENCE_THRESHOLD).

For instance, for an execution time metric, this confidence interval indicates a significantly worse performance:

----------------------------------------|---------|---(---------^---------)---------->
                                       0%        1%  1.3%      2.2%      3.1%
                                                  |   |         |         |
       significant impact threshold --------------'   |         |         |
                      lower bound of CI --------------'         |         |
       sample mean (center of the CI) --------------------------'         |
                      upper bound of CI ----------------------------------'

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 259658d91b

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-24T13:02:51Z

    span->root = DDTRACE_G(active_stack)->root_span;

    ddtrace_set_global_span_properties(span);
+    ddtrace_update_otel_thread_context();


Refresh OTel context after trace ID changes

Updating the TLS record only on span open/close/switch leaves it stale when an existing root span's trace ID is replaced later, e.g. by DDTrace\consume_distributed_tracing_headers() or DDTrace\set_distributed_tracing_context(), which mutate root_span->trace_id and only call ddtrace_update_root_id_properties(). In requests that consume distributed headers after the autoroot span already exists and do not open another span, OTel readers keep seeing the originally generated trace ID until the next span transition.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-06-24T13:02:51Z

        listener->finish_user_req(listener, &span->std);
    }
+#ifdef __linux__
+    ddtrace_clear_otel_thread_context_root_span();


Only clear the matching user-request override

This clears the global OTel root override regardless of which user-request span is finishing. If a second notify_start() begins before the first span is closed, the AppSec listener finishes/replaces the previous request but the first span still has notify_user_req_end; closing that older span later calls this path and detaches the override for the newer active user request, so subsequent OTel/profiling context falls back to the outer stack.

Useful? React with 👍 / 👎.

bwoebi

I think the approach proposed by this PR is flawed / more expensive than needed.

The PR tries to synchronize state between otel context and root span.
The otel context is, at it's core, a pointer to some struct.

All we need is making the otel context part of the root_span_data struct.
Any updates to the trace_id / the roots span id -> update the local root span id / trace id part of the context, any updates to active -> update the span_id of the context.
And if there's no root span data, we just write NULL to the TLS variable.

I'm very not fond of "synchronizing" with otel context, any updates to the otel context should in my opinion simply happen in-place. There's also no reason for a "root span override".
It feels also way heavier than it should be. Updating a span id due to open/close/drop should be one 64 bit write. That's it.
Switching the span stack should cause one pointer write of the span stacks root span otel context to the TLS pointer. That's it, too.

Further, this PR at the very least forgot to handle ddtrace_update_root_id_properties().

bwoebi · 2026-06-24T14:04:02Z

I also don't understand why this functionality depends on libdatadog. The code should be IMHO simpler and less LoC if we just manage this manually.

feat: otel context sharing

9d6028e

morrisonlevi added ☠️ do-not-merge/WIP profiling Relates to the Continuous Profiler tracing labels Jun 10, 2026

morrisonlevi added 2 commits June 15, 2026 11:58

Merge remote-tracking branch 'origin/master' into levi/otel-thread-co…

96f4f78

…ntext # Conflicts: # Cargo.lock # Makefile

Read OTEL thread context directly in profiler

a449010

cataphract mentioned this pull request Jun 16, 2026

refactor(otel-thread-ctx): inline thread-local resolution DataDog/libdatadog#2129

Merged

cataphract and others added 7 commits June 19, 2026 10:18

update libdatadog

bc311c3

Also publish otel context

f891d5f

demo of otel thread context reader

79ff015

Support otel thread local context

d9708a1

Merge branch 'glopes/otel-thr-ctx-metadata' of github.com:DataDog/dd-…

0037ab7

…trace-php into levi/otel-thread-context

Add tests for otel ctx/thread local

d86b4a3

Merge remote-tracking branch 'origin/master' into levi/otel-thread-co…

21f8b27

…ntext

cataphract added 3 commits June 23, 2026 17:34

build fixes

877680d

misc fixes/improvements

a75c8cc

Improve logging during appsec tel int tests

259658d

cataphract marked this pull request as ready for review June 24, 2026 12:56

cataphract requested review from a team as code owners June 24, 2026 12:56

chatgpt-codex-connector Bot reviewed Jun 24, 2026

View reviewed changes

bwoebi requested changes Jun 24, 2026

View reviewed changes

Merge branch 'master' into levi/otel-thread-context

9af82d8

refactor based on discussion

1a4dfbd

morrisonlevi requested a review from a team as a code owner June 24, 2026 18:27

morrisonlevi requested review from leoromanovsky and typotter and removed request for a team June 24, 2026 18:27

refactor: implement discussion from slack

99bb029

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: OTel context sharing#3970

feat: OTel context sharing#3970
morrisonlevi wants to merge 16 commits into
masterfrom
levi/otel-thread-context

morrisonlevi commented Jun 10, 2026 •

edited

Loading

Uh oh!

datadog-datadog-prod-us1 Bot commented Jun 10, 2026 •

edited by datadog-official Bot

Loading

Uh oh!

pr-commenter Bot commented Jun 10, 2026 •

edited

Loading

Explanation

More details about the CI and significant changes

Uh oh!

pr-commenter Bot commented Jun 10, 2026 •

edited

Loading

Explanation

More details about the CI and significant changes

Uh oh!

pr-commenter Bot commented Jun 23, 2026 •

edited

Loading

Explanation

More details about the CI and significant changes

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 24, 2026

Uh oh!

chatgpt-codex-connector Bot Jun 24, 2026

Uh oh!

bwoebi left a comment

Uh oh!

bwoebi commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

morrisonlevi commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Reviewer checklist

Uh oh!

datadog-datadog-prod-us1 Bot commented Jun 10, 2026 • edited by datadog-official Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Warnings

ℹ️ Info

Uh oh!

pr-commenter Bot commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks [ profiler ]

Explanation

More details about the CI and significant changes

scenario:php-profiler-timeline-memory-control

scenario:php-profiler-timeline-memory-with-profiler

Uh oh!

pr-commenter Bot commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks [ tracer ]

Explanation

More details about the CI and significant changes

scenario:ContextPropagationBench/benchExtractHeaders64Bit

scenario:ContextPropagationBench/benchExtractHeaders64Bit-opcache

scenario:ContextPropagationBench/benchExtractTraceContext128Bit

scenario:ContextPropagationBench/benchExtractTraceContext128Bit-opcache

scenario:MessagePackSerializationBench/benchMessagePackSerialization

scenario:MessagePackSerializationBench/benchMessagePackSerialization-opcache

scenario:SpanBench/benchOpenTelemetryAPI

scenario:SpanBench/benchOpenTelemetryAPI-opcache

scenario:SpanBench/benchOpenTelemetryInteroperability

scenario:SpanBench/benchOpenTelemetryInteroperability-opcache

scenario:TraceSerializationBench/benchSerializeTrace-opcache

Uh oh!

pr-commenter Bot commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks [ appsec ]

Explanation

More details about the CI and significant changes

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

bwoebi left a comment

Choose a reason for hiding this comment

Uh oh!

bwoebi commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

morrisonlevi commented Jun 10, 2026 •

edited

Loading

datadog-datadog-prod-us1 Bot commented Jun 10, 2026 •

edited by datadog-official Bot

Loading

pr-commenter Bot commented Jun 10, 2026 •

edited

Loading

pr-commenter Bot commented Jun 10, 2026 •

edited

Loading

pr-commenter Bot commented Jun 23, 2026 •

edited

Loading