Deployment Options
aiXplain offers flexible deployment options for hosting the full Agentic OS stack—including orchestration, runtime, model execution, and the integrated marketplace. Deploy on aiXplain's cloud, your own infrastructure, or at the edge—designed for both cloud-first enterprises and highly regulated industries requiring complete data sovereignty.
Deployment Models
aiXplain SaaS
Fully managed cloud platform hosted by aiXplain.
Best for: Fast deployment, minimal infrastructure overhead, development and testing
You get:
- Instant deployment with zero infrastructure setup
- Automatic updates and maintenance
- Global availability and scalability
- Pay-as-you-go pricing with usage-based credits
aiXplain OnPrem
Complete on-premises deployment with full data sovereignty.
Best for: Government agencies, regulated industries, enterprises requiring complete data residency
You get:
- Complete data residency—data never leaves your environment
- Air-gapped operation with no internet dependency required
- Full platform functionality without SaaS limitations
- Integration with your identity provider and security infrastructure
aiXplain OnEdge
Hybrid edge computing solution combining local processing with cloud capabilities.
Best for: Distributed operations, low-latency requirements, hybrid cloud strategies
You get:
- Local model execution at edge locations
- Reduced latency for time-sensitive operations
- Flexible connectivity options (air-gapped, VPN, restricted internet)
- Centralized management across edge and cloud deployments
aiXplain OnPrem: Enterprise-Grade Deployment
aiXplain OnPrem delivers the complete Agentic OS with maximum security and control—proven at government scale with full air-gapped capability.
Core Capabilities
Complete Data Sovereignty:
- All data remains in your environment
- No customer data sent to aiXplain
- Local data flow with encrypted storage
- Zero data retention policy
Full Platform Functionality:
- LLMs through unified APIs
- Multi-agent orchestration and workflows
- Custom RAG with vector databases and knowledge graphs
- Built-in guardrails and governance
- Custom Python code execution
- MCP (Model Context Protocol) integration
- Proprietary tool and API integration
Centralized Management:
- Single dashboard for all assets, API keys, and teams
- Rate limits and usage quotas
- Monitoring and telemetry for all agents and solutions
- Role-based access control (RBAC)
- Complete audit logging and compliance tracking
Infrastructure Architecture
aiXplain OnPrem consists of three core infrastructure components:
| Component | Full Name | Key Functions |
|---|---|---|
| aiXPU | AI Processing Unit | Backend services, orchestration engine, frontend UI, agent workflow execution |
| aiXMU | AI Memory Unit | Data storage, knowledge bases, vector databases, secure encryption at rest |
| aiXCU | AI Compute Unit | Model inference, workload execution, high-performance AI processing |
Deployment Flexibility
Infrastructure Options:
- Kubernetes clusters (recommended for scalability)
- Virtual machines (traditional enterprise environments)
- Bare metal servers (maximum performance)
- Private cloud services (AWS, Azure, GCP private regions)
Network Configurations:
- Air-gapped – Complete network isolation with no external connectivity
- Hybrid with secure VPN – Controlled external connectivity through encrypted tunnels
- Restricted internet – Limited, monitored external access with allowlisting
Security & Compliance
Data Protection:
- All data encrypted in transit (TLS) and at rest
- Zero data retention—your data stays yours
- Complete audit logging for compliance tracking
- Integration with your identity provider (OAuth2, OIDC, custom auth)
Enterprise Controls:
- Role-based access control (RBAC)
- Team collaboration with permission management
- API key management per team or user
- Usage quotas and rate limiting
Monitoring & Observability:
- Integration with your monitoring stack (Grafana, Prometheus, ELK)
- Standard metrics, logs, and health probes
- Real-time performance monitoring
- Error tracking and debugging tools
Proven at Scale
Strategic Reference: Ministry of Finance, Kingdom of Saudi Arabia
- Full-stack, air-gapped government deployment
- Demonstrates enterprise readiness at national scale
- Modular architecture reusable across deployments
- 24/7 support for mission-critical operations
Choosing Your Deployment Option
| Consideration | SaaS | OnPrem | OnEdge |
|---|---|---|---|
| Setup Time | Minutes | Weeks | Days-Weeks |
| Infrastructure Management | aiXplain | You | Shared |
| Data Residency | aiXplain cloud | Your environment | Your edge + cloud |
| Air-Gapped Capability | No | Yes | Optional |
| Customization | Standard | Full | Hybrid |
| Compliance | SOC 2 certified | Your standards | Your standards |
| Cost Model | Pay-as-you-go | License + infrastructure | License + infrastructure |
Migration & Portability
Agent Portability: All agents are containerized and portable across deployment models. Develop on SaaS, deploy OnPrem, or run hybrid across multiple environments without code changes.
Data Migration: Tools and support provided for migrating agents, configurations, and knowledge bases between deployment options.
Support & Services
OnPrem Support Includes:
- Tailored deployment training
- 24/7 support for mission-critical deployments
- Regular security patches and updates
- Strategic partnership approach
- Custom SLAs for enterprise clients
Next Steps
- Contact Sales – Discuss OnPrem or OnEdge deployment
- Try SaaS – Start building agents today
- Security & Compliance – Review our security practices
- Pricing – Understand cost models