Documentation Index
Fetch the complete documentation index at: https://docs.swarms.ai/llms.txt
Use this file to discover all available pages before exploring further.
What is the Swarms API?
What is the Swarms API?
- Agent Orchestration: Create and manage AI agents that can work independently or collaborate
- Multi-Agent Communication: Enable agents to communicate and share information seamlessly
- Flexible Workflows: Support for sequential, concurrent, hierarchical, and other workflow patterns
- Enterprise Scale: Handle up to 10,000+ agents working together
- Multi-Model Support: Integration with OpenAI, Anthropic, and Groq models
Why should I use the Swarms API?
Why should I use the Swarms API?
- Complete Agent Ecosystem: Full ecosystem for building, deploying, and scaling intelligent AI systems
- Advanced Multi-Agent Architectures: Build complex hierarchical, sequential, and parallel agent collaboration systems
- Agent-to-Agent Communication: Advanced communication protocols enable seamless agent interaction
- Ultra-Optimized Runtime: High-performance runtime with built-in optimization for concurrent operations
- Enterprise-Ready: Built-in security, governance, and compliance features
How does agent orchestration work?
How does agent orchestration work?
- Create Individual Agents: Deploy single agents for specific tasks
- Coordinate Multiple Agents: Build swarms where agents work together
- Define Workflow Patterns: Choose from sequential, concurrent, hierarchical, and other patterns
- Enable Communication: Agents can share information and collaborate
- Scale Dynamically: Add or remove agents based on workload
- Research teams with multiple agents researching different aspects
- Medical analysis with lab analyzer and clinical specialist working together
- Financial analysis with market analyst and economic forecaster collaborating
- Content creation with writer, editor, and fact-checker in sequence
What types of workflows are supported?
What types of workflows are supported?
| Workflow Type | Description | Best For |
|---|---|---|
| Sequential | Agents work in order, each building on the previous output | Step-by-step processes, analysis pipelines |
| Concurrent | Agents work simultaneously on the same task | Parallel processing, multiple perspectives |
| Hierarchical | Structured multi-level approach with clear authority | Complex decision-making, management structures |
| Multi-Agent Router | Intelligent task distribution based on capabilities | Load balancing, specialized task routing |
| Mixture of Agents | Diverse teams with specialized skills | Complex problems requiring multiple expertise |
| Majority Voting | Consensus-based decision making | Verification, quality assurance |
| Agent Rearrange | Dynamic agent reconfiguration | Adaptive systems, optimization |
What models are supported?
What models are supported?
- gpt-4: High-quality reasoning and complex task handling
- gpt-4o: Optimized version with improved performance
- gpt-4o-mini: Lightweight version for faster responses
- claude-sonnet-4-20250514-20240620: Balanced performance and reasoning
- claude-3-7-sonnet-latest: Latest Claude model with enhanced capabilities
- groq/llama3-70b-8192: High-performance open-source model
- groq/deepseek-r1-distill-llama-70b: Specialized reasoning model
How much does it cost?
How much does it cost?
- Input Tokens: $6.50 per 1 million tokens (all endpoints)
- Output Tokens: $18.50 per 1 million tokens (all endpoints)
| Item | Cost | Notes |
|---|---|---|
| Base cost per agent | $0.01 per agent | Charged for each agent in swarms and workflows |
| Input tokens | $6.50 per 1M tokens | Unified pricing for all endpoints |
| Output tokens | $18.50 per 1M tokens | Unified pricing for all endpoints |
| MCP cost | $0.10 per call | Charged if an agent uses an MCP URL |
| Image cost | $0.25 per image | Charged for each image processed |
| Exa Search tool | $0.04 per search | Charged per search execution |
| Web Scraper tool | $0.15 per scrape | Charged per scrape execution |
| Night-time discount | 50% off token costs | 8 PM - 6 AM PT (Swarm Completions only) |
- Swarm Completions (with agent cost)
- Agent Completions
- Advanced Research
- Auto Swarm Builder
- Graph Workflow (with agent cost)
- Batched Grid Workflow (with agent cost)
- Night Time Discount: 50% off token costs for Swarm Completions during 8 PM - 6 AM Pacific Time
- Frenzy Mode: All requests are free during Black Friday (24 hours)
What are the rate limits?
What are the rate limits?
| Rate Limit Type | Free Tier | Premium Tier | Time Window |
|---|---|---|---|
| Requests per Minute | 100 | 2,000 | 1 minute |
| Requests per Hour | 50 | 10,000 | 1 hour |
| Requests per Day | 1,200 | 100,000 | 24 hours |
| Tokens per Agent | 200,000 | 2,000,000 | Per request |
| Prompt Length | 200,000 | 200,000 | Per request |
| Batch Size | 10 | 10 | Per request |
- 20x more requests per minute (2,000 vs 100)
- 200x more requests per hour (10,000 vs 50)
- 83x more requests per day (100,000 vs 1,200)
- 10x more tokens per agent (2M vs 200K)
How do I get started?
How do I get started?
What tools and integrations are available?
What tools and integrations are available?
- Search Capabilities: Web search integration for research tasks
- MCP Integration: Model Context Protocol for enhanced interactions
- Custom Tools: Define your own function tools for specific needs
- Connect to external data sources
- Integrate with databases and APIs
- Enable real-time data access for agents
How do I handle errors and troubleshooting?
How do I handle errors and troubleshooting?
- Verify your API key is correct and active
- Check that the
x-api-keyheader is properly set - Ensure your API key has the necessary permissions
- Implement exponential backoff for failed requests
- Monitor your API usage to stay within limits
- Consider upgrading to Premium for higher limits
- Increase request timeout values
- Consider breaking complex tasks into smaller chunks
- Use batch processing for multiple operations
- Check if your requested model is currently available
- Have fallback models configured
- Monitor model status through the health endpoint
What security features are available?
What security features are available?
- API Key Authentication: Secure authentication for all requests
- Rate Limiting: Prevents abuse and ensures fair usage
- Data Encryption: All data is encrypted in transit and at rest
- Access Controls: Granular permissions and access management
- Never commit API keys to version control
- Use environment variables for all sensitive configuration
- Implement proper access controls in production environments
- Regularly rotate API keys
How can I get support?
How can I get support?
- Documentation: docs.swarms.ai
- Discord Community: Join Discord
- Technical Blog: Medium
- Technical Support: Book Support Session
- Enterprise Support: Contact through the platform
- Onboarding Sessions: Book with Kye Gomez
- Twitter: @kyegomez
- LinkedIn: The Swarm Corporation
- YouTube: Swarms Channel
Can I use Swarms for production applications?
Can I use Swarms for production applications?
- High Availability: 99.9% uptime SLA
- Scalability: Handle thousands of concurrent agents
- Enterprise Features: Security, compliance, and governance
- Monitoring: Comprehensive logging and analytics
- Support: Professional support for production deployments
- Implement proper error handling and retry logic
- Monitor API usage and costs
- Use appropriate rate limiting strategies
- Test thoroughly before deployment
- Keep API keys secure and rotate regularly
What's the difference between single agents and swarms?
What's the difference between single agents and swarms?
- Purpose: Focused tasks that don’t require collaboration
- Use Cases: Simple Q&A, content generation, data analysis
- Benefits: Fast, simple, cost-effective
- Limitations: Limited to single perspective, no collaboration
- Purpose: Complex tasks requiring multiple perspectives or specialized skills
- Use Cases: Research teams, medical analysis, financial forecasting
- Benefits: Multiple perspectives, specialized expertise, parallel processing
- Complexity: Higher setup and coordination requirements
How do I choose the right model for my use case?
How do I choose the right model for my use case?
- Use GPT-4 or Claude-3.5-Sonnet for tasks requiring deep reasoning
- Higher token limits for comprehensive analysis
- Choose GPT-4o-mini for quick, straightforward tasks
- Lower costs for high-volume applications
- Higher temperature settings (0.7-0.9) work better
- GPT-4 or Claude models for creative content
- Lower temperature settings (0.1-0.3) provide more consistent responses
- Any model can work well with proper prompting
Can I integrate Swarms with my existing systems?
Can I integrate Swarms with my existing systems?
- REST API: Standard HTTP endpoints for any language
- Client Libraries: Official SDKs for Python, TypeScript, Go, Java
- Webhooks: Real-time notifications for long-running tasks
- Batch Processing: Process multiple requests efficiently
- Microservices: Deploy as independent services
- API Gateway: Integrate with existing API infrastructure
- Event-Driven: Trigger agents based on system events
- Scheduled Tasks: Run agents on regular schedules
Still Have Questions?
If you don’t see your question answered here, we’re here to help:- Join our Discord: discord.gg/EamjgSaEQf
- Book Technical Support: cal.com/swarms/swarms-technical-support