Directory
Every AI model, in one place.
Pricing, benchmarks, provider latency, and how teams actually use each one.
4 matches
- BbytedanceByteDance: UI-TARS 7B
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
Language128k ctx$0.10/M - BbytedanceSeedance 1.5 Pro
Simultaneous video and audio generation with multi-language lip-sync and cinematic camera control.
Video0$0.00/M - BbytedanceSeedance 2.0
First unified audio-video joint generation model with phoneme-level lip-sync in 8+ languages.
Video0$0.00/M - BbytedanceSeedance 2.0 Fast
Speed-optimized Seedance variant prioritizing fast generation and lower cost.
Video0$0.00/M