docsapi_key

Nanonets

Nanonets provides an AI-driven Intelligent Document Processing API that transforms unstructured documents into structured data, enabling efficient data extraction and workflow automation.

Verdict

Nanonets exposes OCR and document workflow tools inside Switchy. @mention it to create custom OCR models, upload training images, run inference on PDFs or scans, and retrieve extracted text or structured data. Teams that process invoices, receipts, contracts, or forms get the most value — you can train a model on your document format, then query it from chat instead of logging into a web portal. The MCP handles model lifecycle (create, train, delete) and prediction retrieval, but you'll need a Nanonets API key and existing workflows configured in their dashboard before you can process documents at scale.

Common use cases

  • Extract invoice line items from PDF uploads
  • Train custom OCR on proprietary form layouts
  • Retrieve processed documents from workflow queue
  • Audit model training images before retraining
  • Delete obsolete OCR models after project ends

Integration

Vendor
Nanonets
Category
docs
Auth
API_KEY
Tools
11
Composio slug
nano_nets

Tools

  • Create OCR Model

    Tool to create a new ocr model. use when you need to initialize an ocr model before training.

  • Delete OCR Model
    destructive

    Tool to delete an ocr model. use when you need to permanently remove a trained model by its id.

  • Get all OCR models

    Tool to retrieve a paginated list of all ocr models. use when you need to inspect or manage existing ocr models. call after authentication to fetch your account's models.

  • Get All Prediction Files

    Tool to fetch all prediction files associated with a specific model. use when you need to list all inference requests after model processing is complete.

  • Get OCR Model Details

    Tool to retrieve details of an ocr model. use when you need full metadata of a model by its id.

  • Get OCR Training Images

    Tool to retrieve training images for an ocr model. use when you need to page through images associated with a model before training or analysis.

  • Get Workflows

    Tool to retrieve a list of all workflows in your nanonets account. use when you need to inventory or inspect all configured workflows.

  • List Workflow Documents

    Tool to retrieve a paginated list of documents processed by a workflow. use when you need to view documents after processing.

  • Update OCR Model

    Tool to update an ocr model's details. use after reviewing the model's current configuration to modify its name, categories, notes, or classification settings.

  • Upload Training Images by File

    Tool to upload training images by file to a specified ocr model. use when adding files to a model for training purposes.

  • Upload Training Images by URL

    Tool to upload training images by url to a specified ocr model. use when adding urls of images to a model for training purposes.

Setup

Setup guide

  1. 11. In Switchy, open your workspace settings and navigate to the MCP integrations panel. 2. Select 'Add MCP' and choose Nanonets from the catalog. 3. You'll be prompted to paste a Nanonets API key — generate one by logging into your Nanonets account, clicking your profile icon, selecting 'API Keys', and copying the key. 4. Paste the key into Switchy and click 'Connect'. 5. Switchy will verify the key by fetching your OCR models; if successful, you'll see a green checkmark. 6. Open any Space, type '@Nanonets get all ocr models' and send the message to confirm the connection works. 7. If you see a list of models (or an empty array if you haven't created any yet), the integration is live and ready to use.

What teammates see: by default, memories from Nanonets are scoped to the Space (PROJECT visibility) - you can mark any memory PRIVATE or share it ORG-wide.

Works well with

Top models

Compatibility data appears once enough Spaces have used this MCP together with a given model.

How Switchy teams use it

Not enough Spaces yet to publish anonymised usage stats (we require ≥ 50 Spaces per week).

Starter prompts

List My OCR Models

@Nanonets get all ocr models and show me their names, IDs, and creation dates in a table
Open in a Space →

Create Invoice OCR Model

@Nanonets create a new ocr model named 'Invoice Extractor Q1 2025' and return the model ID
Open in a Space →

Fetch Workflow Documents

@Nanonets list all documents from workflow ID wf_abc123 and summarize the file names and processing status
Open in a Space →

Inspect Training Images

@Nanonets get training images for model ID mod_xyz789 and tell me how many images are uploaded
Open in a Space →

Clean Up Old Models

@Nanonets delete ocr model with ID mod_old456 and confirm it's been removed
Open in a Space →

Example outputs

Illustrative - representative of the model's voice and quality, not literal recordings.

Prompt

@Nanonets list all my OCR models and show me which ones have been trained in the last 30 days

Output

Here are your Nanonets OCR models. You have 4 models total: 'Invoice Parser v2' (trained 12 days ago, 847 images), 'Receipt Extractor' (trained 8 days ago, 1,203 images), 'Contract Analyzer' (trained 22 days ago, 456 images), and 'Legacy Invoice v1' (trained 94 days ago, 312 images). The first three models fall within your 30-day window. Invoice Parser v2 and Receipt Extractor are your most active models based on recent training activity.

Notes

This example demonstrates the read-heavy 'Get all OCR models' tool. The response synthesizes metadata from multiple models. Note that you'll need a valid API key with read access to your Nanonets account. Model counts and training dates will vary by your actual account state.

Prompt

@Nanonets create a new OCR model called 'Purchase Order Extractor' for processing vendor POs from our procurement inbox

Output

I've created a new OCR model named 'Purchase Order Extractor' in your Nanonets account. The model ID is `mdl_8k3j9x2p`. This model is now initialized but not yet trained—you'll need to upload training images (typically 50-100 sample purchase orders with labeled fields) before it can process documents. Once trained, you can use this model to extract structured data from PO PDFs or images.

Notes

This showcases the 'Create OCR Model' tool, which initializes a new model but does not train it. After creation, you must upload labeled training images separately. The model won't be usable for predictions until training completes, which can take 15-45 minutes depending on dataset size.

Prompt

@Nanonets pull all documents processed by my 'Invoice Parser v2' workflow in the last 7 days and summarize common extraction errors

Output

I retrieved 143 documents from your 'Invoice Parser v2' workflow (last 7 days). Common extraction issues: 18 documents had low-confidence vendor name extraction (likely due to handwritten headers), 12 failed to parse line-item tables with merged cells, and 7 flagged mismatched totals between extracted subtotal and tax fields. 106 documents processed cleanly with >95% confidence across all fields. The handwritten-header issue appears most frequently on documents from vendor 'Acme Supplies'.

Notes

This combines 'List Workflow Documents' with AI reasoning to surface patterns. The MCP returns raw document metadata and confidence scores; the AI synthesizes trends. Useful for quality audits, but note that document counts and error types depend entirely on your workflow's recent activity and model accuracy.

Use-case deep-dives

Invoice processing for finance teams

When Nanonets handles recurring invoice layouts at scale

A 6-person finance team at a SaaS company processes 200+ vendor invoices monthly, each following one of twelve standard layouts. Nanonets wins here because you train one OCR model per vendor template, then route incoming PDFs through the matching workflow. The Create OCR Model and Get OCR Training Images tools let your team label 20-30 sample invoices per vendor in an afternoon, then the prediction endpoint extracts line items with 95%+ accuracy. If your invoices are one-off formats or handwritten, accuracy drops fast and you'll spend more time correcting than you save. For recurring B2B invoice formats, Nanonets cuts manual data entry from 40 hours to 4 hours per month.

Onboarding document verification for HR

Why Nanonets fits compliance-heavy document intake

A 3-person HR team at a 50-employee startup onboards 8-12 new hires per quarter, each submitting tax forms, ID scans, and signed offer letters. Nanonets is the right call when you need to extract structured fields (SSN, address, signature date) from semi-standardized government forms and route them into your HRIS. You create one workflow per document type using Get Workflows, then the List Workflow Documents tool shows which submissions passed validation. The threshold: if your forms vary wildly by state or country, you'll train 20+ models and the setup cost outweighs the benefit. For US-only onboarding with 3-5 repeating form types, Nanonets automates 80% of the data entry and flags edge cases for human review.

Support ticket attachment triage

When Nanonets speeds up screenshot analysis in support queues

A 10-person customer support team receives 400 tickets weekly, 60% including screenshots of error messages or account dashboards. Nanonets helps when you need to extract error codes or account IDs from images before routing tickets to the right specialist. You train a single OCR model on your app's UI using Get OCR Training Images, then the prediction tools pull structured data from each screenshot attachment. The model works if your UI is consistent; if customers submit photos of monitors or heavily cropped images, accuracy falls below 70% and manual review takes longer than the automation saves. For SaaS products with stable UI and clear error states, Nanonets cuts ticket routing time from 15 minutes to 2 minutes per ticket.

Frequently asked

What does the Nanonets MCP do in Switchy?

It connects Switchy to your Nanonets account so AI agents can create, train, and query OCR models without leaving the workspace. Agents can upload training images, run document extraction workflows, and retrieve prediction results — useful when your team needs to automate invoice parsing, receipt scanning, or custom document understanding tasks at scale.

Do I need a Nanonets API key to connect this MCP?

Yes. The MCP authenticates with a Nanonets API key, which you generate from your Nanonets dashboard under account settings. You'll paste that key into Switchy once during setup. Anyone on your team who connects the MCP must have their own key, or you can share one service account key if your Nanonets plan allows it.

Can the MCP train new OCR models or only query existing ones?

Both. Agents can create fresh OCR models, upload training images, and delete models when they're obsolete. They can also list all models in your account, fetch training data, and pull prediction files from completed workflows. If you only want read access, keep the API key scoped to inference endpoints in Nanonets.

Why use this MCP instead of calling Nanonets APIs directly?

The MCP wraps Nanonets' REST API in natural-language tools, so your team asks "extract line items from this invoice" instead of writing curl commands. It's faster for ad-hoc document tasks and lets non-technical users trigger OCR workflows. For high-volume batch jobs, calling the API from your own scripts is still more efficient.

Who on the team should connect the Nanonets MCP?

Whoever manages your document automation workflows or needs to inspect OCR results. Typically ops, finance, or data teams who already use Nanonets for invoice processing or form extraction. If multiple people need access, decide whether to share one API key or issue separate keys per user, depending on your Nanonets plan and audit requirements.

Data last verified 607 hours ago.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.