Agents Overview

The Swarms.ai Agent API provides powerful endpoints for running individual AI agents and batch agent operations.

Agent API

The Swarms.ai Agent API provides powerful endpoints for running individual AI agents and batch agent operations. This documentation explains how to use these endpoints for effective agent-based task execution.

Getting Started

To use the Agent API, you'll need a Swarms.ai API key:

Go to https://swarms.world/platform/api-keys
Generate a new API key
Store your API key securely - it won't be shown again

import os
import requests
from dotenv import load_dotenv

# Load API key from environment
load_dotenv()
API_KEY = os.getenv("SWARMS_API_KEY")
BASE_URL = "https://api.swarms.world"

# Configure headers with your API key
headers = {
    "x-api-key": API_KEY,
    "Content-Type": "application/json"
}

Individual Agent API

The Individual Agent API allows you to run a single agent with a specific configuration and task.

Agent Configuration (`AgentSpec`)

The AgentSpec class defines the configuration for an individual agent.

Parameter

Type

Default

Description

agent_name

string

Required

Unique name identifying the agent and its functionality

description

string

None

Detailed explanation of the agent's purpose and capabilities

system_prompt

string

None

Initial instructions guiding the agent's behavior and responses

model_name

string

"gpt-4o-mini"

The AI model used by the agent (e.g., gpt-4o, gpt-4o-mini, openai/o3-mini)

auto_generate_prompt

boolean

false

Whether the agent should automatically create prompts based on task requirements

max_tokens

integer

8192

Maximum number of tokens the agent can generate in its responses

temperature

float

0.5

Controls output randomness (lower values = more deterministic responses)

role

string

"worker"

The agent's role within a swarm, influencing its behavior and interactions

max_loops

integer

Maximum number of times the agent can repeat its task for iterative processing

tools_dictionary

array

None

Dictionary of tools the agent can use to complete its task

mcp_url

string

None

URL for the MCP server that the agent can connect to

Agent Completion

The AgentCompletion class combines an agent configuration with a specific task.

Parameter

Type

Description

agent_config

AgentSpec

Configuration of the agent to be completed

task

string

The task to be completed by the agent

history

Optional[Union[Dict[Any, Any], List[Dict[str, str]]]]

The history of the agent's previous tasks and responses. Can be either a dictionary or a list of message objects.

Single Agent Endpoint

Endpoint: POST /v1/agent/completions

Run a single agent with a specific configuration and task.

Request

def run_single_agent(agent_config, task):
    """
    Run a single agent with the AgentCompletion format.
    
    Args:
        agent_config: Dictionary containing agent configuration
        task: String describing the task for the agent
        
    Returns:
        Dictionary containing the agent's response
    """
    payload = {
        "agent_config": agent_config,
        "task": task
    }
    
    try:
        response = requests.post(
            f"{BASE_URL}/v1/agent/completions", 
            headers=headers, 
            json=payload
        )
        response.raise_for_status()
        return response.json()
    except requests.exceptions.RequestException as e:
        print(f"Error making request: {e}")
        return None

Example Usage

agent_config = {
    "agent_name": "Research Analyst",
    "description": "An expert in analyzing and synthesizing research data",
    "system_prompt": (
        "You are a Research Analyst with expertise in data analysis and synthesis. "
        "Your role is to analyze provided information, identify key insights, "
        "and present findings in a clear, structured format. "
        "Focus on accuracy, clarity, and actionable recommendations."
    ),
    "model_name": "gpt-4o",
    "role": "worker",
    "max_loops": 2,
    "max_tokens": 8192,
    "temperature": 0.5,
    "auto_generate_prompt": False,
}

task = "Analyze the impact of artificial intelligence on healthcare delivery and provide a comprehensive report with key findings and recommendations."

result = run_single_agent(agent_config, task)
print(result)

Response Structure

{
  "id": "agent-6a8b9c0d1e2f3g4h5i6j7k8l9m0n",
  "success": true,
  "name": "Research Analyst",
  "description": "An expert in analyzing and synthesizing research data",
  "temperature": 0.5,
  "outputs": {
    "content": "# Impact of Artificial Intelligence on Healthcare Delivery\n\n## Executive Summary\n...",
    "role": "assistant"
  },
  "usage": {
    "input_tokens": 1250,
    "output_tokens": 3822,
    "total_tokens": 5072
  },
  "timestamp": "2025-05-10T18:35:29.421Z"
}

Batch Agent API

The Batch Agent API allows you to run multiple agents in parallel, each with different configurations and tasks.

Batch Agent Endpoint

Endpoint: POST /v1/agent/batch/completions

Run multiple agents with different configurations and tasks in a single API call.

Request

def run_batch_agents(agent_completions):
    """
    Run multiple agents in batch.
    
    Args:
        agent_completions: List of dictionaries, each containing agent_config and task
        
    Returns:
        List of agent responses
    """
    try:
        response = requests.post(
            f"{BASE_URL}/v1/agent/batch/completions",
            headers=headers,
            json=agent_completions
        )
        response.raise_for_status()
        return response.json()
    except requests.exceptions.RequestException as e:
        print(f"Error making batch request: {e}")
        return None

Example Usage

batch_completions = [
    {
        "agent_config": {
            "agent_name": "Research Analyst",
            "description": "An expert in analyzing research data",
            "system_prompt": "You are a Research Analyst...",
            "model_name": "gpt-4o",
            "max_loops": 2
        },
        "task": "Analyze the impact of AI on healthcare delivery."
    },
    {
        "agent_config": {
            "agent_name": "Market Analyst",
            "description": "An expert in market analysis",
            "system_prompt": "You are a Market Analyst...",
            "model_name": "gpt-4o",
            "max_loops": 1
        },
        "task": "Analyze the AI startup landscape in 2025."
    }
]

batch_results = run_batch_agents(batch_completions)
print(batch_results)

Response Structure

[
  {
    "id": "agent-1a2b3c4d5e6f7g8h9i0j",
    "success": true,
    "name": "Research Analyst",
    "description": "An expert in analyzing research data",
    "temperature": 0.5,
    "outputs": {
      "content": "# Impact of AI on Healthcare Delivery\n...",
      "role": "assistant"
    },
    "usage": {
      "input_tokens": 1250,
      "output_tokens": 3822,
      "total_tokens": 5072
    },
    "timestamp": "2025-05-10T18:35:29.421Z"
  },
  {
    "id": "agent-9i8h7g6f5e4d3c2b1a0",
    "success": true,
    "name": "Market Analyst",
    "description": "An expert in market analysis",
    "temperature": 0.5,
    "outputs": {
      "content": "# AI Startup Landscape 2025\n...",
      "role": "assistant"
    },
    "usage": {
      "input_tokens": 980,
      "output_tokens": 4120,
      "total_tokens": 5100
    },
    "timestamp": "2025-05-10T18:35:31.842Z"
  }
]

Error Handling

The API uses standard HTTP status codes to indicate success or failure:

Status Code

Meaning

200

Success

400

Bad Request - Check your request parameters

401

Unauthorized - Invalid or missing API key

403

Forbidden - Insufficient permissions

429

Too Many Requests - Rate limit exceeded

500

Server Error - Something went wrong on the server

When an error occurs, the response body will contain additional information:

{
  "detail": "Error message explaining what went wrong"
}

Common Errors and Solutions

Error

Possible Solution

"Invalid API Key"

Verify your API key is correct and properly included in the request headers

"Rate limit exceeded"

Reduce the number of requests or contact support to increase your rate limit

"Invalid agent configuration"

Check your agent_config parameters for any missing or invalid values

"Failed to create agent"

Ensure your system_prompt and model_name are valid

"Insufficient credits"

Add credits to your account at https://swarms.world/platform/account

Advanced Usage

Setting Dynamic Temperature

The agent can dynamically adjust its temperature for optimal outputs:

agent_config = {
    # Other config options...
    "temperature": 0.7,
    "dynamic_temperature_enabled": True
}

Using Agent Tools

Agents can utilize various tools to enhance their capabilities:

agent_config = {
    # Other config options...
    "tools_dictionary": [
        {
            "name": "web_search",
            "description": "Search the web for information",
            "parameters": {
                "query": "string"
            }
        },
        {
            "name": "calculator",
            "description": "Perform mathematical calculations",
            "parameters": {
                "expression": "string"
            }
        }
    ]
}

Best Practices

API Key Security

Important: Store API keys in environment variables or secure vaults, never in code repositories.

# DON'T do this
api_key = "sk-123456789abcdef"

# DO this instead
import os
from dotenv import load_dotenv
load_dotenv()
api_key = os.getenv("SWARMS_API_KEY")

Agent Naming Conventions

Use a consistent naming pattern for your agents to make your code more maintainable.

# Good naming convention
agent_configs = {
    "market_analyst": {...},
    "research_specialist": {...},
    "code_reviewer": {...}
}

Crafting Effective System Prompts

A well-crafted system prompt acts as your agent's personality and instruction set.

Basic Prompt Example

You are a research analyst. Analyze the data and provide insights.

Enhanced Prompt Example

You are a Research Analyst with 15+ years of experience in biotech market analysis.

Your task is to:
1. Analyze the provided market data methodically
2. Identify key trends and emerging patterns
3. Highlight potential investment opportunities
4. Assess risks and regulatory considerations
5. Provide actionable recommendations supported by the data

Format your response as a professional report with clear sections,
focusing on data-driven insights rather than generalities.

Token Management

Important: Manage your token usage carefully to control costs.

Higher token limits provide more complete responses but increase costs
Consider using different models based on task complexity
For gpt-4o models, typical settings:
- Simple tasks: 2048 tokens (lower cost)
- Medium complexity: 4096 tokens (balanced)
- Complex analysis: 8192+ tokens (higher cost, more detail)

Error Handling

Critical: Implement comprehensive error handling to make your application resilient.

try:
    response = requests.post(
        f"{BASE_URL}/v1/agent/completions",
        headers=headers,
        json=payload,
        timeout=30  # Add timeout to prevent hanging requests
    )
    response.raise_for_status()
    return response.json()
except requests.exceptions.HTTPError as e:
    if e.response.status_code == 429:
        # Implement exponential backoff for rate limiting
        retry_after = int(e.response.headers.get('Retry-After', 5))
        time.sleep(retry_after)
        return run_agent(payload)  # Retry the request
    elif e.response.status_code == 401:
        logger.error("Authentication failed. Check your API key.")
    else:
        logger.error(f"HTTP Error: {e.response.status_code} - {e.response.text}")
    return {"error": e.response.text}
except requests.exceptions.Timeout:
    logger.error("Request timed out. The server might be busy.")
    return {"error": "Request timed out"}
except requests.exceptions.RequestException as e:
    logger.error(f"Request Error: {e}")
    return {"error": str(e)}

Implementing Caching

Cache identical requests to improve performance and reduce costs.

import hashlib
import json
from functools import lru_cache

def generate_cache_key(agent_config, task):
    """Generate a unique cache key for an agent request."""
    cache_data = json.dumps({"agent_config": agent_config, "task": task}, sort_keys=True)
    return hashlib.md5(cache_data.encode()).hexdigest()

@lru_cache(maxsize=100)
def cached_agent_run(cache_key, agent_config, task):
    """Run agent with caching based on config and task."""
    # Convert agent_config back to a dictionary if it's a string representation
    if isinstance(agent_config, str):
        agent_config = json.loads(agent_config)
        
    payload = {
        "agent_config": agent_config,
        "task": task
    }
    
    try:
        response = requests.post(
            f"{BASE_URL}/v1/agent/completions",
            headers=headers,
            json=payload
        )
        response.raise_for_status()
        return response.json()
    except Exception as e:
        return {"error": str(e)}

def run_agent_with_cache(agent_config, task):
    """Wrapper function to run agent with caching."""
    # Generate a cache key
    cache_key = generate_cache_key(agent_config, task)
    
    # Convert agent_config to a hashable type for lru_cache
    hashable_config = json.dumps(agent_config, sort_keys=True)
    
    # Call the cached function
    return cached_agent_run(cache_key, hashable_config, task)

Usage & Cost Monitoring

Set up a monitoring system to track your API usage and costs.

def log_api_usage(api_call_type, tokens_used, cost_estimate):
    """Log API usage for monitoring."""
    with open("api_usage_log.csv", "a") as f:
        timestamp = datetime.now().isoformat()
        f.write(f"{timestamp},{api_call_type},{tokens_used},{cost_estimate}\n")

def estimate_cost(tokens):
    """Estimate cost based on token usage."""
    # Example pricing: $0.002 per 1K tokens (adjust according to current pricing)
    return (tokens / 1000) * 0.002

def run_agent_with_logging(agent_config, task):
    """Run agent and log usage."""
    result = run_single_agent(agent_config, task)
    
    if "usage" in result:
        total_tokens = result["usage"]["total_tokens"]
        cost = estimate_cost(total_tokens)
        log_api_usage("single_agent", total_tokens, cost)
        
    return result

FAQ

What's the difference between Single Agent and Batch Agent APIs?

The Single Agent API (/v1/agent/completions) runs one agent with one task, while the Batch Agent API (/v1/agent/batch/completions) allows running multiple agents with different configurations and tasks in parallel. Use Batch Agent when you need to process multiple independent tasks efficiently.

How do I choose the right model for my agent?

Model selection depends on your task complexity, performance requirements, and budget:

Model

Best For

Characteristics

gpt-4o

Complex analysis, creative tasks

Highest quality, most expensive

gpt-4o-mini

General purpose tasks

Good balance of quality and cost

openai/o3-mini

Simple, factual tasks

Fast, economical

For exploratory work, start with gpt-4o-mini and adjust based on results.

What should I include in my system prompt?

A good system prompt should include:

Role definition: Who the agent is and their expertise
Task instructions: Specific, clear directions on what to do
Output format: How results should be structured
Constraints: Any limitations or requirements
Examples: Sample inputs and outputs when helpful

Keep prompts focused and avoid contradictory instructions.

How can I optimize costs when using the Agent API?

Cost optimization strategies include:

Use the appropriate model for your task complexity
Set reasonable token limits based on expected output length
Implement caching for repeated or similar requests
Batch related requests together
Use max_loops: 1 unless you specifically need iterative refinement
Monitor usage patterns and adjust configurations accordingly

What's the maximum number of agents I can run in a batch?

While there's no hard limit specified, we recommend keeping batch sizes under 20 agents for optimal performance. For very large batches, consider splitting them into multiple calls or contacting support for guidance on handling high-volume processing.

How do I handle rate limiting?

Implement exponential backoff in your error handling:

import time

def run_with_backoff(func, max_retries=5, initial_delay=1):
    """Run a function with exponential backoff retry logic."""
    retries = 0
    delay = initial_delay
    
    while retries < max_retries:
        try:
            return func()
        except requests.exceptions.HTTPError as e:
            if e.response.status_code == 429:  # Too Many Requests
                retry_after = int(e.response.headers.get('Retry-After', delay))
                print(f"Rate limited. Retrying after {retry_after} seconds...")
                time.sleep(retry_after)
                retries += 1
                delay *= 2  # Exponential backoff
            else:
                raise
        except Exception as e:
            raise
            
    raise Exception(f"Failed after {max_retries} retries")

Can I use tools with my agents?

Yes, you can enable tools through the tools_dictionary parameter in your agent configuration. This allows agents to access external functionality like web searches, calculations, or custom tools.

agent_config = {
    # Other configuration...
    "tools_dictionary": [
        {
            "name": "web_search",
            "description": "Search the web for current information",
            "parameters": {
                "query": {
                    "type": "string",
                    "description": "The search query"
                }
            }
        }
    ]
}

How do I debug agent performance issues?

Debugging steps for agent performance issues:

Check system prompts: Ensure they're clear and not overly restrictive
Review model selection: Try a more capable model if output quality is poor
Adjust token limits: Increase max_tokens if outputs are getting truncated
Examine temperature: Lower for more deterministic outputs, higher for creativity
Test with simpler tasks: Isolate whether the issue is with the task complexity
Enable verbose logging: Add detailed logging to track request/response cycles
Contact support: For persistent issues, reach out with example payloads and responses

What's the pricing model for the Agent API?

The Agent API uses a token-based pricing model:

Input tokens: Text sent to the API (task, system prompts)
Output tokens: Text generated by the agent

Pricing varies by model and is calculated per 1,000 tokens. Check the pricing page for current rates.

The API also offers a "flex" tier for lower-priority, cost-effective processing.

PreviousAPI Architecture NextOverview

Last updated 4 days ago

Was this helpful?

Agent API

Getting Started

Individual Agent API

Agent Configuration (AgentSpec)

Agent Completion

Single Agent Endpoint

Batch Agent API

Batch Agent Endpoint

Error Handling

Common Errors and Solutions

Advanced Usage

Setting Dynamic Temperature

Using Agent Tools

Best Practices

API Key Security

Agent Naming Conventions

Crafting Effective System Prompts

Token Management

Error Handling

Implementing Caching

Usage & Cost Monitoring

FAQ

What's the difference between Single Agent and Batch Agent APIs?

How do I choose the right model for my agent?

What should I include in my system prompt?

How can I optimize costs when using the Agent API?

What's the maximum number of agents I can run in a batch?

How do I handle rate limiting?

Can I use tools with my agents?

How do I debug agent performance issues?

What's the pricing model for the Agent API?

Agent Configuration (`AgentSpec`)