otherapi_key

Deepgram

Deepgram provides AI-powered speech recognition and understanding services, offering APIs for real-time and pre-recorded audio transcription, text-to-speech, and audio intelligence.

Verdict

The Deepgram MCP gives your team speech-to-text and text-to-speech tools inside Switchy. @mention Deepgram to transcribe meeting recordings, generate audio from text, or pull usage stats from your Deepgram account. It's useful for teams that record customer calls, produce audio content, or need quick summaries of long recordings. You'll need a Deepgram API key with the right project scopes — the MCP won't work with audio files behind authentication or on private networks.

Common use cases

  • Transcribe customer support calls for review
  • Generate audio clips for product demos
  • Summarize long podcast episodes quickly
  • Detect topics in recorded interviews
  • Check Deepgram usage before month-end

Integration

Vendor
Deepgram
Category
other
Auth
API_KEY
Tools
9
Composio slug
deepgram

Tools

  • Get Project Usage Summary

    Tool to retrieve a summary of usage data for a specified deepgram project. use when you need high-level metrics (submitted, processed, billable durations, etc.) optionally filtered by time window, model, accessor, or tag.

  • Get Public Models

    Tool to retrieve metadata on all the latest public deepgram speech-to-text models. use when you need to list available models; set include outdated to true to include deprecated versions.

  • Get Public TTS Models

    Tool to fetch metadata about all latest public tts voice models. use when you need to list available deepgram tts voices.

  • List Deepgram Projects

    Tool to list all deepgram projects. use after authenticating with your api key.

  • List Project Scopes

    Tool to list all scopes for a specified deepgram project. use when you need to retrieve all permission scopes for a project.

  • Summarize Audio

    Tool to generate concise summaries from audio content. use when you need a quick summary of audio after confirming the file url is publicly accessible.

  • Text-to-Speech (REST)

    Tool to convert text into natural-sounding speech. use when you need tts audio from text inputs.

  • Topic Detection

    Tool to transcribe audio and detect topics. use when you need to identify the main topics discussed in an audio recording after confirming the file url is publicly accessible.

  • Transcribe Pre-recorded Audio

    Tool to transcribe pre-recorded audio files into text. use when you need to convert an audio file at a url to text after confirming the file url is publicly accessible.

Setup

Setup guide

  1. 11. In Switchy, open Settings and navigate to Integrations. 2. Find Deepgram in the MCP catalog and click Connect. 3. You'll be prompted for a Deepgram API key — log into your Deepgram console, create a new key under API Keys, and copy it. 4. Paste the key into Switchy and click Authorize. 5. Switchy will confirm the connection by listing your Deepgram projects. 6. Open any Space and type '@Deepgram list my projects' to verify the MCP responds. 7. To transcribe audio, make sure your file is publicly accessible via URL — Deepgram can't fetch files from authenticated endpoints or local storage. 8. Test with '@Deepgram transcribe this audio: [URL]' using a sample file. 9. If you see usage errors, check your Deepgram account balance and API key scopes in the console.

What teammates see: by default, memories from Deepgram are scoped to the Space (PROJECT visibility) - you can mark any memory PRIVATE or share it ORG-wide.

Works well with

Top models

Compatibility data appears once enough Spaces have used this MCP together with a given model.

How Switchy teams use it

Not enough Spaces yet to publish anonymised usage stats (we require ≥ 50 Spaces per week).

Starter prompts

Transcribe Meeting Recording

@Deepgram transcribe this meeting recording and include speaker diarization: [paste public audio URL]
Open in a Space →

Summarize Call in 3 Sentences

@Deepgram summarize this support call in three sentences: [paste public audio URL]
Open in a Space →

Generate TTS Audio Clip

@Deepgram convert this script to audio using a professional voice: [paste text]
Open in a Space →

Identify Interview Topics

@Deepgram detect the main topics in this interview recording: [paste public audio URL]
Open in a Space →

Check This Month's Usage

@Deepgram show me usage summary for this month across all projects
Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

@Deepgram transcribe this sales call recording and identify the main topics discussed: https://example.com/call-2024-01-15.mp3

Output

I've transcribed the 18-minute call and detected four primary topics: pricing negotiation (6m 12s), product feature requests (4m 38s), implementation timeline (3m 45s), and contract terms (3m 25s). The prospect raised concerns about annual vs. monthly billing and requested custom SSO integration. Full transcript shows the conversation shifted from technical requirements to commercial terms around the 9-minute mark.

Notes

This example uses Topic Detection, which combines transcription with AI-powered topic extraction. The audio file must be publicly accessible via URL—Deepgram fetches it directly. Processing time scales with audio length; a 20-minute file typically takes 30-60 seconds. Your API key must have transcription scope enabled.

Prompt

@Deepgram generate a 30-second voice message for our support team using the Aura model: 'Your ticket has been escalated to engineering. We'll follow up within 24 hours with a detailed timeline.'

Output

I've generated the TTS audio file using Deepgram's Aura voice model. The output is a 12-second MP3 (the text was shorter than your 30-second estimate) with natural prosody and pacing suitable for customer-facing communications. The file is available at the returned URL for 24 hours. You can specify different voices (asteria, luna, stella, etc.) or adjust speaking rate if needed.

Notes

This demonstrates text-to-speech via the REST API. The response includes a temporary audio URL that expires after 24 hours—download and store it if you need persistent access. TTS requests count against your character quota. You can preview all available Aura voices by asking Deepgram to list public TTS models first.

Prompt

@Deepgram show me our transcription usage for the last 30 days—I need to forecast next month's bill.

Output

For the past 30 days across your project, you've submitted 847 hours of audio for transcription. Of that, 831 hours were successfully processed, and 824 hours are billable (some files were duplicates or below minimum duration). Your daily average is 27.5 billable hours. At your current tier rate, that projects to approximately $2,472 for a full month. Peak usage occurred on January 8th with 64 hours processed.

Notes

This uses Get Project Usage Summary to pull billing metrics. The tool returns aggregate data—you won't see per-file breakdowns here, but you get the high-level view needed for budget planning. If you manage multiple Deepgram projects, specify which project ID to query, or list all projects first to identify the right one. Usage data updates hourly, not real-time.

Use-case deep-dives

Customer support call analysis

When Deepgram wins for support QA at scale

A 6-person support team handling 200+ customer calls weekly needs to spot escalation patterns without listening to every recording. Deepgram's topic detection and summarization tools let you batch-process call audio overnight and surface recurring issues in morning standup. The MCP exposes both transcription and topic tagging in one call, so you can filter by keyword or theme without building a pipeline. This works best when your audio is already hosted (S3, CDN) and publicly accessible—if files live behind auth, you'll spend more time wrangling URLs than analyzing calls. If your team reviews fewer than 50 calls a month, manual spot-checks are faster than setting up API keys and managing usage quotas. For teams running structured QA on hundreds of calls, Deepgram's MCP turns audio into searchable, taggable data in under an hour of setup.

Podcast production workflow

Why this MCP matters for solo creators with guests

A solo podcaster recording 2-4 interviews a month wants to generate show notes and pull quotes without hiring a VA. Deepgram's summarization and text-to-speech tools let you transcribe the raw audio, extract a 3-paragraph summary, and even generate promo clips with synthetic voiceovers for social. The MCP's public models endpoint shows you which voices and transcription models are current, so you're not guessing at deprecated options. This setup shines when your episodes are under 90 minutes and you're comfortable hosting audio files on a public URL for processing. If you're editing 10+ episodes a week or need speaker diarization with custom vocabulary, you'll hit the limits of the basic toolset and want Deepgram's full API with webhooks. For the 1-4 episode-per-month creator, this MCP delivers production-ready transcripts and summaries in a single Switchy workspace without learning their entire platform.

Sales team call coaching

When to use Deepgram for rep feedback loops

A 3-person sales team records discovery calls and wants the manager to review talk-time balance and objection handling without replaying 45-minute recordings. Deepgram's topic detection flags when pricing or competitor names come up, and the summarization tool gives a 4-sentence recap of each call for weekly 1-on-1s. The MCP's usage summary tool also tracks your monthly transcription spend, so you know when you're approaching your plan limit before the bill surprises you. This works when calls are stored in a shared drive with public links and your team reviews 10-30 calls a month. If you need real-time transcription during live calls or want to score sentiment on every objection, the MCP's batch-oriented tools won't keep up—you'd need Deepgram's streaming API or a dedicated conversation intelligence platform. For post-call coaching at small team scale, this MCP turns audio into coachable insights in under 5 minutes per call.

Frequently asked

What does the Deepgram MCP do in Switchy?

It connects Switchy to Deepgram's speech-to-text and text-to-speech APIs. Your team can transcribe audio files, generate summaries from recordings, detect topics in conversations, and convert text to spoken audio — all without leaving the workspace. It also pulls usage metrics and lists available voice models.

Do I need a Deepgram account to use this MCP?

Yes. You need a Deepgram account and an API key with project-level permissions. The MCP authenticates using that key, so whoever connects it must have access to the Deepgram project you want to query. Free-tier keys work, but usage limits apply on Deepgram's side.

Can it transcribe audio from Zoom or Slack calls?

Only if you have a publicly accessible URL for the audio file. The MCP can't pull recordings directly from Zoom or Slack — you need to export the file first, host it somewhere Deepgram can reach, then pass the URL to the transcription tool. For live calls, use Deepgram's streaming API separately.

Why use this instead of Deepgram's web console?

The MCP lets your team transcribe, summarise, and synthesise speech inside Switchy's shared workspace, where the output lives alongside your other work. You skip the copy-paste loop between Deepgram's dashboard and your docs. It's faster for ad-hoc tasks; for bulk processing, Deepgram's API or console is still better.

Who on the team should connect the Deepgram MCP?

Whoever owns your Deepgram account or has an API key with read/write project access. Once connected, any Switchy user in the workspace can invoke the tools — transcription usage counts against your Deepgram plan, not Switchy's. If you're on a free Deepgram tier, watch your monthly minutes.

Data last verified 607 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.