APIpie AI
APIpie.ai is an AI super aggregator providing a unified API to access a vast array of AI models from leading providers, enabling cost-effective and latency-optimized solutions.
Verdict
Common use cases
- Compare model costs before choosing a provider
- Audit API spend by reviewing query history
- Check country restrictions for compliance
- Clean up unused vector collections
- Track token usage across team projects
Integration
- Vendor
- APIpie AI
- Category
- other
- Auth
- API_KEY
- Tools
- 4
- Composio slug
apipie_ai
Tools
- Delete Vectorsdestructive
Tool to delete an entire vector collection or specific vectors. use when you need to remove vector data after identifying your target collection or vectors.
- Fetch model restrictions
Tool to retrieve a list of country restrictions for models. use when you need to verify allowed deployment countries before final model selection.
- Get query history
Tool to retrieve historic api usage logs including latency, token counts, costs, and source ip. use after authenticating to analyze past queries for cost management, performance monitoring, or auditing.
- List AI Models
Tool to fetch a list of available ai models. use when you need up-to-date model listings and want to filter by. model type, subtype, provider, or retrieve specialized lists like voices or restrictions.
Setup
Setup guide
- 11. Open your Switchy workspace and navigate to Settings > Integrations > MCP Servers. 2. Click 'Add MCP Server' and select APIpie AI from the catalog. 3. Log into your APIpie account at apipie.ai and generate an API key from the dashboard under API Keys or Account Settings. 4. Copy the key and paste it into the API Key field in Switchy, then click Connect. 5. Switchy will verify the key by pinging APIpie's authentication endpoint — you'll see a green checkmark if successful. 6. Open any Space in your workspace and type '@APIpie AI list models with provider openai' to confirm the connection works. 7. If the MCP returns a list of OpenAI models, you're ready to use all four tools in any conversation.
What teammates see: by default, memories from APIpie AI are scoped to the Space (PROJECT visibility) - you can mark any memory PRIVATE or share it ORG-wide.
Works well with
Top models
Compatibility data appears once enough Spaces have used this MCP together with a given model.
How Switchy teams use it
Starter prompts
List GPT Models
@APIpie AI list all models from OpenAI with 'gpt' in the name. Show me the model ID, context window, and cost per million tokens.Open in a Space →
Review Last Week's Spend
@APIpie AI get my query history for the last 7 days. Group results by model and show total tokens used plus total cost.Open in a Space →
Check Deployment Restrictions
@APIpie AI fetch country restrictions for model 'gpt-4'. Tell me if it's available in the EU and Canada.Open in a Space →
Delete Test Vectors
@APIpie AI delete the vector collection named 'prototype-embeddings-v2'. Confirm when it's gone.Open in a Space →
Find Voice Models
@APIpie AI list all voice models. Filter by provider and show me sample rates and supported languages.Open in a Space →
Example outputs
Illustrative - representative of the model's voice and quality, not literal recordings.
@APIpie list all available vision models from Anthropic and OpenAI, showing their context windows and pricing
Found 6 vision-capable models matching your criteria. Anthropic Claude 3.5 Sonnet supports 200K context at $3/$15 per million tokens (input/output). Claude 3 Opus offers 200K context at $15/$75. OpenAI GPT-4o provides 128K context at $2.50/$10, while GPT-4o mini offers 128K at $0.15/$0.60. GPT-4 Turbo supports 128K context at $10/$30. All models accept image inputs alongside text. This is an illustrative summary—actual availability and pricing may vary by region and account tier.
This example demonstrates the List AI Models tool's filtering capabilities. Useful for comparing options before committing to a provider. Note that model availability changes frequently, and some models may have country restrictions you'll need to verify separately using the fetch restrictions tool.
@APIpie show me the last 20 API calls from my account, breaking down total cost and average latency by model
Retrieved 20 queries from the past 48 hours. Total cost: $2.47. GPT-4o accounted for 12 calls ($1.89, avg 847ms latency). Claude 3.5 Sonnet had 6 calls ($0.52, avg 1,203ms). Gemini 1.5 Pro had 2 calls ($0.06, avg 612ms). Highest single query cost was $0.34 (GPT-4o, 18.2K output tokens). All requests originated from IP 203.0.113.45. This represents a sample query history view—your actual logs will reflect your usage patterns and may include additional metadata.
The Get query history tool enables cost tracking and performance auditing across providers. Requires valid API key authentication. Particularly valuable for teams managing multi-model workflows or optimizing spend. Latency data helps identify which models best fit your response-time requirements.
@APIpie delete all vectors in the 'customer-feedback-q1' collection since we've migrated to a new schema
Confirmed deletion request for collection 'customer-feedback-q1'. This action will permanently remove all vector embeddings and associated metadata from this collection. The operation is irreversible. If you intended to delete specific vectors rather than the entire collection, you can target individual vector IDs instead. Proceed with caution—this example illustrates the deletion workflow, but in production you should verify collection names and backup critical data before executing.
The Delete Vectors tool handles both full collection removal and selective vector deletion. This is a destructive operation with no undo mechanism. Best practice: confirm collection names carefully and maintain backups of vector data if recovery might be needed. Useful when rotating embeddings or cleaning up deprecated datasets.
Use-case deep-dives
When APIpie wins for client billing transparency
A 6-person agency runs client projects across GPT-4, Claude, and Gemini, billing model usage back to each account. APIpie's query history tool surfaces token counts, costs, and source IP per request, so the ops lead can reconcile invoices without spreadsheet archaeology. The model restrictions tool confirms which clients can deploy in EU-only regions before quoting. This works if you're routing through APIpie's gateway already—if you're calling OpenAI direct, you're adding a proxy layer just for logs, which only pays off above $2k/month in model spend. For teams under that threshold, a simple usage dashboard in your existing provider is faster. If you're multi-provider and need per-request attribution, APIpie closes the gap between raw API logs and actual client billing.
When this MCP speeds up voice feature prototyping
A 3-person startup is building a voice assistant demo for Series A pitch meetings and needs to test ElevenLabs, Play.ht, and Azure voices in one afternoon. The List AI Models tool filters by voice subtype and returns the current catalog without hunting through three vendor docs. The fetch restrictions tool confirms which voices work in the founder's target markets before the demo script is finalized. This is a win if you're genuinely comparing providers—if you've already picked ElevenLabs, you don't need a meta-catalog. The vector deletion tool is irrelevant here unless you're also embedding user transcripts, which most early demos skip. For teams that prototype across providers weekly, APIpie cuts model research from an hour to five minutes. If you build on one stack, it's overkill.
When query logs matter for SOC 2 evidence
A 12-person SaaS company is prepping for SOC 2 and needs to prove that AI model calls respect data residency rules and log access patterns. The get query history tool exports latency, token counts, and source IPs for the auditor's sampling period. The model restrictions tool documents which models were available in compliant regions at the time of each call. This works if APIpie is your gateway and you're already logging through it—retrofitting this MCP onto direct provider calls won't generate the history. The vector tools are a distraction unless your product embeds user data, which adds a separate compliance surface. If you're under 10k API calls per month, your existing provider's logs are probably sufficient. Above that, APIpie's structured history saves the engineering team from writing custom log parsers for audit season.
Frequently asked
What does the APIpie AI MCP do in Switchy?
It connects your team to APIpie's multi-model AI gateway. You can list available models across providers, check deployment restrictions by country, retrieve query logs with token counts and costs, and manage vector collections. Think of it as a control panel for APIpie's infrastructure — useful if you're routing requests through their service and need visibility into usage or model availability.
Do I need an APIpie account to use this MCP?
Yes. You'll need an API key from APIpie AI, which means you must have an active account with them. Paste that key into Switchy's connection settings. The MCP authenticates every tool call with your key, so query history and vector operations reflect your APIpie account's data. No OAuth dance — just copy the key from APIpie's dashboard.
Can this MCP actually call AI models or just list them?
It lists models and retrieves metadata — it doesn't send inference requests. If you want to generate text or embeddings, you'd call APIpie's inference endpoints directly or use a separate MCP for the underlying provider. This integration is for discovery and ops: finding which models are available, checking restrictions, auditing past queries, and cleaning up vector data.
Why use this instead of APIpie's dashboard or API docs?
Speed and context. Instead of switching tabs to check model availability or dig through logs, you ask Switchy in natural language. The MCP parses APIpie's responses and surfaces the answer inline. It's faster for one-off questions during a conversation, but the dashboard still wins for bulk operations or visual charts. Use whichever fits the moment.
Who on the team should connect the APIpie MCP?
Whoever manages your APIpie account or needs to audit usage. The API key grants read access to query logs and model lists, plus write access to vector collections. If multiple people need this data, consider a shared Switchy workspace where one person connects it. APIpie usage and costs still roll up under the account that owns the key.