Bugfix for Declarative Workflow by peibekwe · Pull Request #6530 · microsoft/agent-framework

peibekwe · 2026-06-15T23:54:16Z

Description & Review Guide

Bugfix for declarative workflow InvokeFunctionTool and InvokeMcpTool

Contribution Checklist

The code builds clean without any errors or warnings
All unit tests pass, and I have added new tests where possible
The PR follows the Contribution Guidelines
This PR is linked to an issue and there is no other open PR for this issue (see Related Issue above).
This is not a breaking change. If it is a breaking change, add the breaking change label (or add "[BREAKING]" to the title prefix, before or after any language prefix) — a workflow keeps the label and title prefix in sync automatically.

github-actions

Automated Code Review

Reviewers: 5 | Confidence: 86%

✓ Correctness

No actionable issues found in this dimension.

✓ Security Reliability

No actionable issues found in this dimension.

✓ Test Coverage

The PR adds comprehensive tests for the new per-invocation request ID mechanism (concurrent approvals, checkpoint/restore, legacy migration, duplicate delivery, etc.) for both InvokeFunctionToolExecutor and InvokeMcpToolExecutor. However, several pre-existing MCP approval tests now silently exercise the error/rejection path instead of the success path because they construct approval responses with action.Id but never call HandleAsync to register a snapshot. Additionally, there is no test verifying _pendingNonApprovalCallIds survive checkpoint/restore cycles. The MCP executor test suite adds comprehensive coverage for the new per-invocation request ID scheme, including unique IDs, concurrent approvals, checkpoint/reset/restore cycles, and legacy migration. However, it is missing two test scenarios that the parallel InvokeFunctionToolExecutorTest covers: duplicate approval delivery idempotency and stale+valid approvals in a single response message. These are important edge cases for a concurrent system.

✓ Failure Modes

No actionable issues found in this dimension.

✓ Design Approach

The new per-invocation snapshot design closes the cross-request mix-up cases, but it still leaves one checkpoint/replay hole: once an approval response is consumed, the executor only removes that snapshot from memory. If the process resets before the next checkpoint, restore will reload the stale snapshot and republish the old pending request, so the same approval can drive the MCP tool a second time.

Suggestions

Add an MCP-equivalent of InvokeFunctionToolDuplicateApprovalDeliveryInvokesFunctionOnceAsync — delivering the same approval response twice should invoke the MCP tool exactly once and surface the not-approved error on the second delivery.
Add an MCP-equivalent of InvokeFunctionToolAprovalMatchPrefersPendingSnapshotAsync — when a response contains multiple ToolApprovalResponseContent items (a stale/unrelated approval plus the valid one), the executor should select the one matching a pending snapshot. The MCP CaptureResponseAsync already handles this via FirstOrDefault with a predicate, but an explicit test would guard against regressions.

Automated review by peibekwe's agents

Copilot

Pull request overview

This PR fixes correctness gaps in declarative workflow tool invocation by introducing per-invocation identifiers and storing approval-time snapshots per request, ensuring approvals resume with the exact parameters the user reviewed even under concurrent/interleaved execution and across checkpoint/restore cycles.

Changes:

Switch InvokeMcpToolExecutor and InvokeFunctionToolExecutor from a single approval snapshot to per-invocation snapshot maps keyed by unique request ids, including legacy checkpoint migration.
Add tracking for non-approval InvokeFunctionTool call ids so FunctionResultContent can be matched to the originating invocation rather than this.Id.
Expand unit tests to cover: unique request ids, concurrent pending approvals, missing snapshot behavior, legacy checkpoint migration, and checkpoint/reset/restore survivability.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
dotnet/src/Microsoft.Agents.AI.Workflows.Declarative/ObjectModel/InvokeMcpToolExecutor.cs	Implements per-invocation approval snapshots + checkpoint persistence/migration for MCP tool approvals.
dotnet/src/Microsoft.Agents.AI.Workflows.Declarative/ObjectModel/InvokeFunctionToolExecutor.cs	Implements per-invocation call ids + approval snapshots + persisted pending state for function-tool invocations.
dotnet/src/Microsoft.Agents.AI.Workflows.Declarative/Interpreter/DeclarativeActionExecutor.cs	Makes `ResetAsync()` virtual so action executors can actually override/reset per-run state.
dotnet/tests/Microsoft.Agents.AI.Workflows.Declarative.UnitTests/ObjectModel/InvokeMcpToolExecutorTest.cs	Updates and adds tests for per-invocation MCP approval ids, migration, and checkpoint/reset/restore behavior.
dotnet/tests/Microsoft.Agents.AI.Workflows.Declarative.UnitTests/ObjectModel/InvokeFunctionToolExecutorTest.cs	Adds extensive coverage for per-invocation call ids, concurrent pending approvals, migration, and replay/reset scenarios.

github-actions

Automated Code Review

Reviewers: 5 | Confidence: 83%

✓ Correctness

No actionable issues found in this dimension.

✓ Security Reliability

The PR correctly migrates from a single approval snapshot to a per-invocation concurrent dictionary, fixing a significant limitation where a single executor instance could only track one pending approval. The security model is maintained: approval snapshots are consumed atomically via TryRemove preventing replay, per-invocation GUIDs prevent ID prediction, and legacy migration is handled correctly. No high-severity security or reliability issues found. One medium-severity observation about silent error swallowing in InvokeFunctionToolExecutor when a response contains neither a matching FunctionResultContent nor any ToolApprovalResponseContent.

✓ Test Coverage

The MCP test coverage is thorough for the new per-invocation approval snapshot design, covering unique IDs, concurrent pending approvals, missing snapshots, legacy migration, and checkpoint/reset/restore cycles. However, there is an asymmetric coverage gap: InvokeFunctionToolExecutorTest includes InvokeFunctionToolDuplicateApprovalDeliveryInvokesFunctionOnceAsync (verifying idempotency when the same approval is delivered twice), but InvokeMcpToolExecutorTest has no equivalent test for the same atomic TryRemove-based idempotency contract.

✓ Failure Modes

No actionable issues found in this dimension.

✗ Design Approach

Both tool executors now key pending work by per-invocation IDs, but they still only clear those IDs from in-memory dictionaries on the response path. Because checkpointing persists the dictionaries and restore reloads them, a crash after approval/result handling but before the next checkpoint can resurrect an already-consumed invocation and let the same response be processed again after restore. The new MCP tests cover more of the approval-snapshot behavior, but one important failure mode is still untested: successful approval removes the pending snapshot only in memory, while checkpoint/restore persists _approvalSnapshots separately. I also noticed one smaller coverage gap where the unique-request-id test claims to validate both wrapper and inner MCP content ids but only checks the wrapper.

Flagged Issues

Both InvokeFunctionToolExecutor (CaptureResponseAsync at line 208) and InvokeMcpToolExecutor (CaptureResponseAsync at line 167) remove consumed pending IDs/approval snapshots only from in-memory dictionaries, while OnCheckpointingAsync/OnCheckpointRestoredAsync persist and reload those maps. A crash after approval handling but before the next checkpoint will resurrect an already-consumed invocation and allow the same response to be replayed. A durable state update is needed on the response path, not just an in-memory removal.
The test at InvokeMcpToolExecutorTest.cs:1426-1428 only asserts the live in-memory dictionary is empty after approval. Since CaptureResponseAsync never persists the updated map back to workflow state while checkpoint/restore serializes and reloads it, this test mises the crash-before-checkpoint replay path. Please assert the persisted stateStore["_approvalSnapshots"] is cleared/updated after CaptureResponseAsync.

Suggestions

The unique-request-id test at InvokeMcpToolExecutorTest.cs:1126-1134 claims to validate uniqueness on both McpServerToolCallContent and the wrapping ToolApprovalRequestContent, but only checks the wrapper request ids. The analogous function-tool test (InvokeFunctionToolExecutorTest.cs:476-513) also verifies the inner call id. Adding the MCP inner-content assertion would make the test match its stated contract and serve as a more complete regression guard.

Automated review by peibekwe's agents

github-actions · 2026-06-16T01:05:26Z

Flagged issue

Both InvokeFunctionToolExecutor (CaptureResponseAsync at line 208) and InvokeMcpToolExecutor (CaptureResponseAsync at line 167) remove consumed pending IDs/approval snapshots only from in-memory dictionaries, while OnCheckpointingAsync/OnCheckpointRestoredAsync persist and reload those maps. A crash after approval handling but before the next checkpoint will resurrect an already-consumed invocation and allow the same response to be replayed. A durable state update is needed on the response path, not just an in-memory removal.

Source: automated DevFlow PR review

github-actions · 2026-06-16T01:05:26Z

Flagged issue

The test at InvokeMcpToolExecutorTest.cs:1426-1428 only asserts the live in-memory dictionary is empty after approval. Since CaptureResponseAsync never persists the updated map back to workflow state while checkpoint/restore serializes and reloads it, this test mises the crash-before-checkpoint replay path. Please assert the persisted stateStore["_approvalSnapshots"] is cleared/updated after CaptureResponseAsync.

Source: automated DevFlow PR review

peibekwe added 2 commits June 15, 2026 16:43

Declarative workflow bugfix

61378ee

Merge branch 'main' into peibekwe/declarative-bugfix

48f137c

peibekwe self-assigned this Jun 15, 2026

Copilot AI review requested due to automatic review settings June 15, 2026 23:54

moonbox3 added .NET workflows Related to Workflows in agent-framework labels Jun 15, 2026

peibekwe temporarily deployed to integration June 15, 2026 23:54 — with GitHub Actions Inactive

peibekwe changed the title ~~Peibekwe/declarative bugfix~~ Bugfixes for Declarative Workflow Jun 15, 2026

github-actions Bot changed the title ~~Bugfixes for Declarative Workflow~~ .NET: Peibekwe/declarative bugfix Jun 15, 2026

Copilot started reviewing on behalf of peibekwe June 15, 2026 23:55 View session

peibekwe changed the title ~~.NET: Peibekwe/declarative bugfix~~ Bugfix for Declarative Workflow Jun 15, 2026

github-actions Bot reviewed Jun 16, 2026

View reviewed changes

Comment thread ...Microsoft.Agents.AI.Workflows.Declarative.UnitTests/ObjectModel/InvokeMcpToolExecutorTest.cs

Copilot AI reviewed Jun 16, 2026

View reviewed changes

Fix PR comments

2c6c170

peibekwe temporarily deployed to integration June 16, 2026 00:42 — with GitHub Actions Inactive

peibekwe marked this pull request as ready for review June 16, 2026 00:56

github-actions Bot reviewed Jun 16, 2026

View reviewed changes

SergeyMenshykh approved these changes Jun 16, 2026

View reviewed changes

rogerbarreto approved these changes Jun 16, 2026

View reviewed changes

alliscode approved these changes Jun 16, 2026

View reviewed changes

peibekwe added this pull request to the merge queue Jun 16, 2026

Any commits made after this event will not be merged.

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 16, 2026

peibekwe added this pull request to the merge queue Jun 16, 2026

Any commits made after this event will not be merged.

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 16, 2026

peibekwe added this pull request to the merge queue Jun 16, 2026

Any commits made after this event will not be merged.

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 16, 2026

peibekwe added this pull request to the merge queue Jun 16, 2026

Any commits made after this event will not be merged.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix for Declarative Workflow#6530

Bugfix for Declarative Workflow#6530
peibekwe wants to merge 3 commits into
mainfrom
peibekwe/declarative-bugfix

peibekwe commented Jun 15, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

peibekwe commented Jun 15, 2026

Description & Review Guide

Contribution Checklist

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Automated Code Review

✓ Correctness

✓ Security Reliability

✓ Test Coverage

✓ Failure Modes

✓ Design Approach

Suggestions

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Automated Code Review

✓ Correctness

✓ Security Reliability

✓ Test Coverage

✓ Failure Modes

✗ Design Approach

Flagged Issues

Suggestions

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

github-actions Bot commented Jun 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants