docsapi_key

LLMWhisperer

LLMWhisperer is a technology that presents data from complex documents to LLMs in a way that they can best understand.

Verdict

LLMWhisperer converts PDFs and scanned documents into text that LLMs can actually parse. When you @mention it in a Space, your team can extract tables, forms, and multi-column layouts from contracts, invoices, or research papers without the garbled formatting that breaks most OCR. It handles async jobs — you submit a file, poll for status, then pull the cleaned text. Best for teams dealing with legacy documents or client uploads that need to feed into prompts. You'll need an API key from LLMWhisperer's dashboard, and each conversion counts against your account's page quota.

Common use cases

  • Extract tables from scanned invoices for analysis
  • Convert contract PDFs into prompt-ready text
  • Pull data from multi-column research papers
  • Process client uploads before summarization
  • Monitor document conversion quotas across projects

Integration

Vendor
LLMWhisperer
Category
docs
Auth
API_KEY
Tools
10
Composio slug
llmwhisperer

Tools

  • Check Whisper Status

    Tool to check the status of a text extraction process in LLMWhisperer. Use when the conversion is done in async mode to poll for completion status.

  • Convert Document to Text

    Tool to convert PDFs and scanned documents into LLM-optimized text format asynchronously. Use when you need to extract text from documents for LLM processing. After submission, use the returned whisper_hash to poll status and retrieve conve

  • Delete Webhook
    destructive

    Tool to delete a registered webhook from LLMWhisperer system. Use when you need to remove a webhook that is no longer needed.

  • Get Usage Information

    Tool to check usage metrics of your LLMWhisperer account. Use when you need to monitor API consumption, verify quotas, or check remaining page limits.

  • Get Usage Statistics

    Tool to retrieve usage statistics for your LLMWhisperer account based on a specific tag. Use when you need to check consumption metrics for a given tag and optional date range. Returns usage data for the preceding 30 days when date paramete

  • Get Webhook Details

    Tool to retrieve registered webhook details for LLMWhisperer. Use when you need to get the configuration of a specific webhook including its URL and authentication token.

  • Get Whisper Detail

    Tool to retrieve comprehensive details about ongoing or completed text extraction process. Use when you need to monitor the status and progress metrics of a text extraction job.

  • Register Webhook

    Tool to register a new webhook endpoint for LLMWhisperer async notifications. Use when you need to set up a callback URL to receive processing results. During registration, a test payload is sent to verify the webhook endpoint is functionin

  • Retrieve Whisper Text

    Tool to retrieve extracted text from asynchronous whisper processing. Use when the conversion process was initiated in async mode and you need to retrieve the results using the whisper_hash identifier. Note that retrieval is single-use for

  • Update Webhook Configuration

    Tool to update an existing webhook configuration for document conversion callbacks. Use when you need to modify the callback URL, authentication token, or webhook identifier. The system validates the webhook by sending a test payload and re

Setup

Setup guide

  1. 11. Open your Switchy workspace settings and navigate to the MCP Integrations panel. 2. Click 'Add Integration' and select LLMWhisperer from the docs category. 3. Visit the LLMWhisperer dashboard at unstract.com, sign in, and generate an API key from the account settings. 4. Paste the API key into Switchy's connection form and click 'Connect'. 5. Switchy will verify the key by fetching your usage information. 6. Once connected, open any Space and type '@LLMWhisperer convert this PDF' with a file attachment to test the integration. 7. The tool will return a whisper ID — use '@LLMWhisperer check status for [whisper_id]' to poll until extraction completes. 8. When status shows 'processed', retrieve the cleaned text and paste it into your next prompt. 9. Check '@LLMWhisperer get usage' periodically to monitor your remaining page quota.

What teammates see: by default, memories from LLMWhisperer are scoped to the Space (PROJECT visibility) - you can mark any memory PRIVATE or share it ORG-wide.

Works well with

Top models

Compatibility data appears once enough Spaces have used this MCP together with a given model.

How Switchy teams use it

Not enough Spaces yet to publish anonymised usage stats (we require ≥ 50 Spaces per week).

Starter prompts

Extract Invoice Data

@LLMWhisperer convert this invoice PDF to text. Once extraction completes, parse out vendor name, line items, subtotal, tax, and total amount into a JSON object.
Open in a Space →

Summarize Contract Terms

@LLMWhisperer extract text from this contract PDF. After processing, summarize the payment terms, deliverable deadlines, and any termination provisions in bullet points.
Open in a Space →

Check Conversion Status

@LLMWhisperer check the status of whisper ID [paste_id_here]. If it's done, retrieve the full text so I can analyze it.
Open in a Space →

Monitor Page Quota

@LLMWhisperer get my account usage information. Show me how many pages I've processed this month and how many remain in my quota.
Open in a Space →

Parse Research Paper

@LLMWhisperer convert this research paper to text. Once ready, pull out the abstract, methodology, and key findings into separate sections.
Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

@LLMWhisperer extract text from this 47-page contract PDF and format it so Claude can analyze the liability clauses

Output

I've submitted your PDF for async extraction (whisper_id: wh_abc123). LLMWhisperer is processing 47 pages with layout-preserving mode enabled to maintain clause structure. Estimated completion: 90 seconds. I'll poll the status and retrieve the optimized text once ready, then analyze the liability sections you mentioned. This example assumes the conversion completes successfully; actual processing time varies by document complexity and current queue depth.

Notes

This showcases LLMWhisperer's core capability: converting complex PDFs into LLM-friendly text while preserving semantic structure. The async workflow requires polling with Check Whisper Status until completion. API key must have document conversion permissions. Large documents consume more page quota—check usage limits before batch processing.

Prompt

@LLMWhisperer how many pages have I processed this month, and do I have enough quota left for 200 more scanned invoices?

Output

Your LLMWhisperer account shows 3,847 pages processed in the current billing cycle (started March 1). Your plan includes 10,000 pages/month, leaving 6,153 pages available. Processing 200 scanned invoices at an average of 2 pages each would consume ~400 pages, well within your remaining quota. This example reflects typical usage data returned by the Get Usage Information tool; actual metrics depend on your account tier and processing history.

Notes

Demonstrates usage monitoring—critical for teams processing documents at scale. The Get Usage Information tool returns real-time quota data, helping you avoid mid-workflow failures due to limit exhaustion. No special permissions needed beyond API key access. Useful for capacity planning before bulk operations.

Prompt

@LLMWhisperer set up a webhook at https://our-app.com/llm-results so we get notified when document conversions finish instead of polling

Output

I've registered your webhook endpoint with LLMWhisperer. During registration, they sent a test payload to https://our-app.com/llm-results to verify connectivity—your server must respond with 200 OK to confirm. Once active, you'll receive POST notifications with whisper_id and status whenever async conversions complete, eliminating the need for status polling. This example assumes your endpoint is publicly accessible and configured to handle LLMWhisperer's payload format.

Notes

Shows webhook setup for event-driven workflows—more efficient than polling for teams processing many documents. Your endpoint must be reachable by LLMWhisperer's servers and return proper HTTP responses during registration. The auth token returned should be validated on incoming webhook calls. Use Delete Webhook to clean up unused registrations.

Use-case deep-dives

Contract review for legal ops

When LLMWhisperer makes sense for high-volume PDF intake

A 3-person legal ops team processes 40-60 vendor contracts per week, mostly scanned PDFs with mixed formatting. LLMWhisperer is the right call here because it handles the OCR and layout normalization before the LLM sees the text—no more garbled tables or missing clauses in the AI summary. The async mode with webhook registration means you drop the PDF in Slack, the extraction runs in the background, and the team gets a clean text block ready for clause extraction or risk flagging. The trade-off: if your contracts are already digital-native PDFs with clean text layers, you're paying for OCR you don't need. Check usage stats by tag to confirm you're not burning pages on documents a simpler tool could handle. For teams doing 200+ pages a week of scanned intake, this is the move.

Customer support knowledge extraction

Why LLMWhisperer works for one-off document questions

A 5-person support team gets ad-hoc questions about product specs buried in 80-page user manuals—PDFs that change every quarter. LLMWhisperer fits because you're not building a persistent knowledge base; you're converting a specific manual on-demand when a ticket references it, then feeding the text to the LLM for a targeted answer. The status-check tool lets you poll for completion without blocking the support agent, and the usage info tool keeps you honest about whether you're over-indexing on document conversion versus just maintaining a FAQ. The boundary: if the same 10 manuals get referenced daily, you should convert them once and store the text in a vector DB, not re-whisper them every time. For infrequent, high-page-count lookups where the source changes often, this is faster than manual copy-paste.

Grant application research for nonprofits

When async PDF processing beats real-time for research workflows

A 2-person nonprofit team reviews 15-20 foundation RFPs per month, each a 30-50 page PDF with eligibility criteria and reporting requirements. LLMWhersperer's async mode is the right fit because the team batches the research—drop all the RFPs into a folder on Monday, register a webhook to ping when extraction finishes, then spend Tuesday comparing requirements across the converted text. The tag-based usage stats let you track cost per grant cycle, which matters when you're on a tight budget. The limit: if you need instant answers during a live call with a program officer, the polling delay (even at a few seconds) breaks the flow. For batch research where you can wait 30 seconds per document and need clean text from scanned foundation PDFs, this is the cleanest path to LLM-ready input.

Frequently asked

What does the LLMWhisperer MCP do in Switchy?

It converts PDFs and scanned documents into text optimised for AI processing. Your team can upload documents through Switchy, extract the text asynchronously, and feed it directly into Claude or other LLMs without manual copy-paste. Useful for contract review, research summarisation, or any workflow where locked-up document text needs to become AI-readable input.

Do I need a paid LLMWhisperer account to use this MCP?

Yes. You need an active LLMWhisperer subscription and an API key. The MCP authenticates with that key, and all document conversions count against your account's page quota. Free trials exist, but production use requires a paid plan. One team member with API access can connect it for everyone in your Switchy workspace.

Can it extract tables and images from PDFs?

It extracts text in a layout-preserving format optimised for LLMs, which handles tables reasonably well. Images are not extracted as files — OCR text from scanned images is included, but charts or diagrams won't come through as visual assets. If you need pixel-perfect table extraction or image files, export those separately before feeding text to the AI.

Why use this instead of uploading PDFs directly to Claude?

Claude's native PDF support is limited to smaller files and doesn't always preserve complex layouts. LLMWhisperer pre-processes documents into clean, structured text that works reliably across large files, scanned images, and multi-column layouts. You also get usage tracking and webhook callbacks for async jobs, which Claude's file upload doesn't provide.

Does document conversion count against my Switchy plan limits?

No. LLMWhisperer usage counts against your LLMWhisperer account quota, not Switchy's. However, the extracted text you feed into Claude does consume Switchy tokens. Connect one LLMWhisperer account per workspace — multiple team members can trigger conversions through the same MCP connection without needing separate API keys.

Data last verified 607 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.