pyro-mcp/docs/use-cases/review-eval-workflows.md
Thales Maciel dc86d84e96 Add workspace review summaries
Add workspace summary across the CLI, SDK, and MCP, and include it in the workspace-core profile so chat hosts can review one concise view of the current session.

Persist lightweight review events for syncs, file edits, patch applies, exports, service lifecycle, and snapshot activity, then synthesize them with command history, current services, snapshot state, and current diff data since the last reset.

Update the walkthroughs, use-case docs, public contract, changelog, and roadmap for 4.3.0, and make dist-check invoke the CLI module directly so local package reinstall quirks do not break the packaging gate.

Validation: uv lock; ./.venv/bin/pytest --no-cov tests/test_vm_manager.py tests/test_cli.py tests/test_api.py tests/test_server.py tests/test_public_contract.py tests/test_workspace_use_case_smokes.py; UV_OFFLINE=1 UV_CACHE_DIR=.uv-cache make check; UV_OFFLINE=1 UV_CACHE_DIR=.uv-cache make dist-check; real guest-backed workspace create -> patch apply -> workspace summary --json -> delete smoke.
2026-03-13 19:21:11 -03:00

872 B

Review And Evaluation Workflows

Recommended profile: workspace-full

Smoke target:

make smoke-review-eval

Use this flow when an agent needs to read a checklist interactively, run an evaluation script, checkpoint or reset its changes, and export the final report.

Chat-host recipe:

  1. Create a named snapshot before the review starts.
  2. Open a readable PTY shell and inspect the checklist interactively.
  3. Run the review or evaluation script in the same workspace.
  4. Capture workspace summary to review what changed and what to export.
  5. Export the final report.
  6. Reset back to the snapshot if the review branch goes sideways.
  7. Delete the workspace when the evaluation is done.

This is the stable shell-facing story: readable PTY output for chat loops, checkpointed evaluation, explicit export, and reset when a review branch goes sideways.