977 B
AgentRun Manual Test Plan
These tests are placeholders until the CLI and services exist. They define how future manual validation should be written.
T1 Minimal Run Lifecycle
Read AGENTS.md, then use the AgentRun CLI to manually test creating a run, starting a runner for that run, polling events, and observing terminal status. Verify every CLI command returns JSON, includes ids and follow-up commands, and never waits for a full model turn in one request.
T2 Command And Event Polling
Read AGENTS.md, then create a run command and poll command status plus run events. Verify command state is visible, event pagination uses afterSeq, and repeated polling does not duplicate events.
T3 Logs And Failure Visibility
Read AGENTS.md, then start the local service or runner with an intentionally invalid backend profile. Verify the CLI returns a structured failure, the log path is printed, and the log file contains enough detail to diagnose the failure.