exec-server: log structured timing events by bolinfest · Pull Request #30901 · openai/codex

bolinfest · 2026-07-02T05:41:52Z

Why

An exec-server request and the process it starts have different lifetimes. A request-level measurement shows JSON-RPC handling latency, while a process-level measurement captures the actual execution lifetime and may continue after the initiating request has completed. Structured INFO logs let deployments consume both measurements through their existing logging pipeline.

This PR is stacked on #30334, which adds the corresponding core tool and inference events.

What changed

Added a codex.exec_server_request event when request handling completes.
Added a codex.exec_server_process event when the underlying process reaches a terminal state.
Reused the existing exec-server telemetry lifecycle so the log events and metrics observe the same boundaries.
Captured event-only identifiers only when the codex_exec_server::telemetry INFO target is enabled.
Added focused tests for event fields and disabled-target allocation behavior.

Suggested logging filter

RUST_LOG='warn,codex_exec_server::telemetry=info' codex-exec-server

Event examples

Identifier and timing values below are illustrative.

`codex.exec_server_request`

{
  "timestamp": "2026-06-27T03:45:20.447Z",
  "level": "INFO",
  "fields": {
    "message": "exec-server request completed",
    "event.name": "codex.exec_server_request",
    "request_id": "request-42",
    "method": "process/start",
    "result": "success",
    "duration_ms": 18,
    "traceparent": "00-4bf92f3577b34da6a3ce929d0e0e4736-00f067aa0ba902b7-01"
  },
  "target": "codex_exec_server::telemetry"
}

`codex.exec_server_process`

{
  "timestamp": "2026-06-27T03:45:20.861Z",
  "level": "INFO",
  "fields": {
    "message": "exec-server process completed",
    "event.name": "codex.exec_server_process",
    "process_id": "process-018f27",
    "trace_id": "4bf92f3577b34da6a3ce929d0e0e4736",
    "result": "success",
    "duration_ms": 414
  },
  "target": "codex_exec_server::telemetry"
}

Test plan

just test -p codex-exec-server exec_server_timing_events_are_structured_info_logs
just test -p codex-exec-server disabled_info_events_do_not_capture_process_log_values

Stack created with Sapling. Best reviewed with ReviewStack.

## Why App-server deployments can consume structured JSON logs for operational measurements without requiring an OTEL exporter. Existing tool-result telemetry reports the handler outcome, but it does not separate time spent waiting to dispatch from time spent executing the handler. A compact completion event for the outer, direct tool call lets consumers measure those phases and correlate them with a conversation and turn. Code-mode calls are intentionally excluded so nested runtime calls do not create overlapping events that are easy to double-count. ## What changed - Added a [`ToolCallTimingGuard`](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/core/src/tools/parallel.rs#L32) around direct tool calls. Event-only strings and timing state are captured only when the `codex_core::tools::parallel` `INFO` target is enabled. - Added a [`codex.tool_call`](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/core/src/tools/parallel.rs#L313) completion event with conversation, turn, tool, call, trace, dispatch, handler, and total timing fields. - Recorded the execution-start marker after the dispatch lock is acquired. Event emission snapshots that marker once so a concurrently starting dispatch cannot produce internally inconsistent fields. - Limited the event to `ToolCallSource::Direct`; [unit coverage](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/core/src/tools/parallel.rs#L365) verifies code-mode calls are ignored. - Added [cancellation coverage](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/core/src/tools/parallel.rs#L408) that holds the execution gate and verifies a call cancelled before admission emits exactly one dispatch-only timing event. - Added reusable [`JsonLogCapture`](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/app-server/tests/common/json_logging.rs#L15) support, a [JSON-logging-specific `TestAppServer` constructor](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/app-server/tests/common/test_app_server.rs#L172), and an [end-to-end app-server test](https://github.com/openai/codex/blob/141110a73cff041c42fd51f47e363e83df1dde51/codex-rs/app-server/tests/suite/logging.rs#L52) that drives a direct `exec_command` through the public v2 JSON-RPC API and validates the emitted JSON event. Exec-server-specific request and process timing remains in the stacked PR #30901. ## Suggested logging filter ```bash LOG_FORMAT=json \ RUST_LOG='warn,codex_core::tools::parallel=info' \ codex app-server ``` ## Event example Identifier and timing values are illustrative. ### `codex.tool_call` ```json { "timestamp": "2026-06-27T03:45:20.443Z", "level": "INFO", "fields": { "message": "tool call completed", "event.name": "codex.tool_call", "trace_id": "4bf92f3577b34da6a3ce929d0e0e4736", "conversation.id": "67e55044-10b1-426f-9247-bb680e5fe0c8", "turn_id": "019f04f8-6ac2-78f1-8625-f04a6d35af18", "tool_name": "exec_command", "call_id": "call_7b8483", "tool_source": "direct", "execution_started": true, "dispatch_duration_ms": 12, "handler_duration_ms": 431, "total_duration_ms": 443 }, "target": "codex_core::tools::parallel" } ``` If execution never starts, `execution_started` is `false`, `handler_duration_ms` is `0`, and `dispatch_duration_ms` covers the full observed lifetime. If a duration cannot be represented as an unsigned 64-bit millisecond value, all three duration fields are omitted rather than populated with a sentinel that could corrupt downstream calculations. ## Test plan - `just test -p codex-core tool_call_timing_guard_ignores_code_mode_source` - `just test -p codex-core cancellation_before_dispatch_admission_logs_dispatch_only_timing` - `just test -p codex-app-server app_server_emits_structured_tool_call_timing_event` --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/openai/codex/pull/30334). * __->__ #30334

bolinfest requested a review from a team as a code owner July 2, 2026 05:41

bolinfest changed the base branch from main to pr30334 July 2, 2026 05:41

bolinfest mentioned this pull request Jul 2, 2026

telemetry: log structured direct tool-call timing #30334

Merged

bolinfest added 2 commits July 1, 2026 22:56

telemetry: log structured tool and inference timing events

c899466

exec-server: log structured timing events

8b6f707

bolinfest force-pushed the pr30334 branch from 39af213 to c899466 Compare July 2, 2026 05:58

bolinfest force-pushed the pr30901 branch from 441a453 to 8b6f707 Compare July 2, 2026 05:58

bolinfest force-pushed the pr30334 branch 8 times, most recently from 38d7926 to 141110a Compare July 2, 2026 20:22

Base automatically changed from pr30334 to main July 2, 2026 20:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

exec-server: log structured timing events#30901

exec-server: log structured timing events#30901
bolinfest wants to merge 2 commits into
mainfrom
pr30901

bolinfest commented Jul 2, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

bolinfest commented Jul 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What changed

Suggested logging filter

Event examples

codex.exec_server_request

codex.exec_server_process

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bolinfest commented Jul 2, 2026 •

edited

Loading

`codex.exec_server_request`

`codex.exec_server_process`