What's the time commitment for this bootcamp?

The bootcamp requires 10 hours per week over 6 weeks. This includes live sessions, hands-on projects, and self-paced learning. Most students find this manageable alongside their full-time jobs.

Do I need prior AI experience to join?

No prior AI experience is required, but you should have few years of software development experience. The bootcamp is designed for software engineers who want to upskill in AI engineering.

What if I can't make a live session?

All live sessions are recorded and available for replay. We also offer multiple office hours throughout the week, so you can catch up on any missed content or get help with assignments.

How much should I budget for APIs and resources?

We estimate €10-50 for the entire bootcamp, covering API costs for OpenAI and other services. We'll show you how to optimize costs and use free tiers when possible.

What happens if I can't attend this cohort?

You can defer to the next cohort at no additional cost. We run cohorts every 2-3 months, so you won't have to wait long to join.

How long will I have access to materials after the bootcamp?

You'll have lifetime access to all course materials, recordings, and the private community. This includes future updates and new content we add to the bootcamp.

What's the refund policy?

Yes, you can get a 100% refund if you've progressed less than 10% of the bootcamp or it's within 7 days of your purchase. We're confident in our curriculum and instructor quality, which is why we offer this guarantee.

Do you offer team discounts?

Yes! We offer 20%+ discounts for teams of 3 or more. Contact us at param@learnwithparam.com for team pricing and bulk enrollment options.

Architecting AI Agents: Multi-Step Reasoning and Execution Loops

The Challenge

You built a chatbot that can call tools. Now product asks it to autonomously plan multi-step tasks (book travel, update tickets, triage incident).

Suddenly:

The agent loops forever on ambiguous goals
External APIs get spammed by repeated calls
One bad plan causes high cost and data leaks

Discussion: How do you make an autonomous agent that is useful, safe, and predictable under load?

1. Agent types and when to use them

Agent Type	Behavior	Use Case
Reactive	Single step decisions, no planning	Chat replies, simple tool calls
Deliberative	Plan then execute multiple steps	Complex workflows, multi-call orchestration
Hybrid	Mix planning and reactive fallback	Most production agents

Rule: Start with reactive or guided agents. Add autonomy only when you can observe and control every action.

2. The agent's execution loop (Core pattern)

An agent is an execution loop: observe → plan → act → observe.

sequenceDiagram
    participant User
    participant Agent
    participant Planner
    participant Executor
    participant Tool
    
    User->>Agent: Goal
    Agent->>Planner: Create plan
    Planner-->>Agent: Plan steps
    Agent->>Executor: Execute step 1
    Executor->>Tool: Call external api
    Tool-->>Executor: Result
    Executor-->>Agent: Step result
    Agent->>Planner: Feedback for replanning
    Agent-->>User: Final result

Key decisions: step granularity, synchronous vs asynchronous execution, and whether to checkpoint state after each step.

3. Planner vs Executor separation

Keep planning and execution separate.

flowchart TD
    A[Input Goal] --> B[Planner]
    B --> C[Plan Store]
    C --> D[Executor]
    D --> E[Tool Calls]
    E --> F[Results Store]
    F --> G[Replanner]

Why:

Observability: trace planning decisions separately
Safety: validate plans before execution
Retry and resume: checkpoint plans so restarts are bounded

4. Tool interface design (Define contracts)

Treat each external tool like a strict API you control.

Tool contract fields:

name
arguments schema
idempotency key requirement
required auth scope
cost estimate per call

Example tool spec (conceptual):

tool sendEmail
args { to string, subject string, body string }
idempotency required true
auth scope email_send
estimated cost small

Enforce contracts with runtime validation and dry-run mode from the planner.

Checkpoint after each step: persist step index, intermediate artifacts, and partial outputs
Two-phase commit for multi-step transactions: prepare → commit (use sparingly and only when all tools support rollback)
Compensating actions: for non-rollbackable steps, register compensator tasks

flowchart TD
    A[Plan] --> B[Step 1]
    B --> C[Checkpoint 1]
    C --> D[Step 2]
    D --> E[Checkpoint 2]
    E --> F[Complete]

7. Observability and explainability for agents

Trace and expose:

Plan version and rationale for each step
Tool call arguments and responses (masked for PII)
Decision provenance (which prompt and context produced each plan)
Resource usage per step (tokens, time, cost)

flowchart LR
    Agent --> Trace
    Trace --> Dashboard
    Trace --> Alerting

Explainability reduces debugging time and supports audit requirements.

8. Testing agents: Unit, Integration, Chaos

Test tiers:

Unit: planner heuristics and prompt outputs using deterministic model settings
Integration: execution with stubbed tools
Contract tests: tool schemas and idempotency keys
Chaos tests: simulate tool failures, network partitions, and partial responses
Cost tests: estimate tokens and calls for representative workloads

Tip: Use deterministic model settings for repeatable tests (temperature 0 and fixed seeds).

9. Cost and throughput controls

Agents can blow budgets quickly. Control knobs:

Max steps per plan (hard cap)
Cost budget per request (reject or degrade when exceeded)
Dynamic model selection per step (large model only for planning; smaller for templated text)
Caching of tool results and intermediate artifacts

flowchart LR
    A[Request] --> B[Budget Check]
    B -->|ok| C[Planner]
    B -->|exceed| D[Reject]

10. Example: Task automation agent (Booking workflow)

Scenario: Agent must book travel: check flights, reserve, charge card, send confirmation.

flowchart TD
    UserGoal[User Goal] --> Planner
    Planner --> PlanStore
    PlanStore --> Executor
    Executor --> CheckFlights
    Executor --> ReserveSeat
    Executor --> ChargeCard
    Executor --> SendConfirmation

Safety enforcement:

Require plan approval before ChargeCard
Use idempotency for ReserveSeat and ChargeCard
Log decisions and attach trace id to payment call

11. Dealing with ambiguity and infinite loops

Agents can loop when goals are underspecified.

Defenses:

Clarification step: require confirmation when plan confidence is below threshold
Step budget: cap maximum steps and abort with explainable reason
Divergence detection: monitor for repeated states and abort when seen X times

12. Metrics that matter for agents

Plan success rate
Average steps per task
Tool failure rate per tool
Cost per successful task
Human approval rate (if used)
Time to completion and first token latency

Capture these per agent version to measure regressions.

Discussion prompts for engineers

Where do you draw the line between autonomous action and human approval?
How do you design idempotency for tools that cannot be rolled back?
What's your preferred checkpoint frequency balancing cost and resume granularity?
How would you simulate a malicious planner that tries to exfiltrate data?

TL;DR — Agent Design Cheat Sheet

Separate planner from executor
Treat tools as strict contracted services with idempotency
Checkpoint after steps and make resume explicit
Enforce safety via dry-run, policy checks, and human approval for risky actions
Instrument every plan and step for observability and cost tracing
Test with deterministic settings, integrate chaos testing, and cap budgets

Takeaway

Autonomy is powerful — and dangerous — when unchecked
Production agents are safe only when they are transparent, auditable, and bounded
Build slow, observe fast, and never deploy a fully autonomous agent without cost controls, human-in-the-loop options, and a thorough testing and monitoring pipeline

For more on building production AI systems, check out our AI Bootcamp for Software Engineers.

Architecting AI Agents: Multi-Step Reasoning and Execution Loops

Share this post

The Challenge

1. Agent types and when to use them

2. The agent's execution loop (Core pattern)

3. Planner vs Executor separation

4. Tool interface design (Define contracts)

5. Safety patterns for agents

Dry run / plan approval

Idempotency keys

Rate limits and quotas

Policy engine

Sandboxing tools

6. State, Checkpointing, and Resume

7. Observability and explainability for agents

8. Testing agents: Unit, Integration, Chaos

9. Cost and throughput controls

10. Example: Task automation agent (Booking workflow)

11. Dealing with ambiguity and infinite loops

12. Metrics that matter for agents

Discussion prompts for engineers

TL;DR — Agent Design Cheat Sheet

Takeaway

Share this post

Continue Reading

Domain-Specific Voice Flows: Building the Guardrails

Multi-Agent Voice Systems: The Warm Transfer

Voice Conversation Memory: Why Your Bot Forgets Who You Are

Voice AI Fundamentals: The 500ms Threshold

Browser Automation: Building Agents That See and Click

Architecting AI Agents: Multi-Step Reasoning and Execution Loops

Share this post

Share this post

Continue Reading

Domain-Specific Voice Flows: Building the Guardrails

Multi-Agent Voice Systems: The Warm Transfer

Voice Conversation Memory: Why Your Bot Forgets Who You Are

Voice AI Fundamentals: The 500ms Threshold

Browser Automation: Building Agents That See and Click

Weekly Bytes of AI