B
LLMbytedance

ByteDance: UI-TARS 7B

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

Specifications

Provider
bytedance
Category
llm
Context length
128,000 tokens
Max output
2,048 tokens
Modalities
image, text
License
proprietary
Released
2025-07-22

Pricing

Input
$0.10/Mtok
Output
$0.20/Mtok
Model ID
bytedance/ui-tars-1.5-7b

Team cost calculator

Estimated monthly spend
$2.29
17.6M tokens / month
5 seats · 80 msgs/day

Providers

ProviderContextInputOutputP50 latencyThroughput30d uptime
bytedance128k$0.10/Mtok$0.20/Mtok

Performance

Performance snapshots are collected daily. Check back after the next ingestion run.

Benchmarks

Public benchmark scores are not available yet for this model. Check back after the next ingestion run.

Works well with

Top MCPs

Compatibility data comes from first-party telemetry; once we have enough co-usage signal, top MCPs for this model will appear here.

How Switchy teams use it

Not enough Spaces have used this model yet to share anonymised team stats. We wait for at least 50 distinct Spaces per week before publishing any aggregate.

Starter prompts

Starter prompts for this model will land here soon.
Data last verified just now.Sources aggregated hourly to weekly. See docs/architecture/model-directory.md.