End-to-end AI platform (formerly MLflow + Mosaic ML) for training, fine-tuning, deploying, and monitoring foundation models and custom ML models on the Lakehouse

Jurisdictional exposure

Provider HQ
USSan Francisco, USA

Subject to CLOUD Act, FISA-702, DPF

Region locations
APACEUUKUSOther26 regions across 5 jurisdictions
Sovereign option
Yes — 1 sovereign-flagged region available

Sub-services (3)

Model Training

Distributed training on GPU clusters with MLflow experiment tracking

Model Serving

Real-time and batch inference endpoints with autoscaling

AI Gateway

Unified proxy for external LLM APIs with rate limiting and cost tracking

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

Where this runs

26 regions
11 countries
1sovereign
Sovereign regions (1)
  • AWS GovCloud (US-West) · San Francisco Bay AreaDatabricks on AWS GovCloud (FedRAMP High)
Commercial regions (25)

Europe (7)

  • EU (Frankfurt)
  • Europe West 3 (Frankfurt)
  • EU (Ireland)
  • North Europe (Ireland)
  • West Europe (Netherlands)
  • EU (London)
  • UK South (London)

North America (11)

  • Canada (Central)
  • Canada Central
  • US East (N. Virginia)
  • US East 4 (Virginia)
  • East US (Virginia)
  • East US 2 (Virginia)
  • US East (Ohio)
  • US Central 1 (Iowa)
  • Central US (Iowa)
  • US West (Oregon)
  • West US 2 (Washington)

South America (1)

  • South America (São Paulo)

Asia (4)

  • Asia Pacific (Mumbai)
  • Asia Pacific (Tokyo)
  • Asia Pacific (Singapore)
  • Southeast Asia (Singapore)

Oceania (2)

  • Asia Pacific (Sydney)
  • Australia East (Sydney)

Tags

Equivalent services on other platforms

Alibaba Qwen (Tongyi Qianwen)Alibaba

Alibaba's flagship open-source foundation model family covering Qwen (text), Qwen-VL (vision-language), Qwen-Audio, and Qwen-Coder — accessible via the DashScope API with chat, completion, embeddings, and function-calling endpoints

Amazon SageMakerAWS

Next-generation SageMaker (rebranded SageMaker AI) unifying data, analytics, and AI in one workspace — Studio notebooks, HyperPod for foundation-model training at scale, Lakehouse with QuickSight + S3 Tables integration, AutoPilot AutoML, managed training jobs, hosted inference endpoints, and Feature Store, with re:Invent 2024 introducing the unified SageMaker AI workspace and 2025 Summit additions extending it with lakehouse auto-onboarding

Amazon BedrockAWS

Build generative AI applications with foundation models from Anthropic (Claude Opus 4.7 from April 2026), Cohere, Meta, Mistral, Stability AI, TwelveLabs (video understanding), and Amazon's own Nova family — accessed via a single API with fine-tuning, knowledge bases, agents, and a model marketplace for discovery and easy onboarding

Amazon NovaAWS

AWS-built foundation model family covering text (Micro, Lite, Pro, Premier), image generation (Canvas), and video generation (Reel) — accessed through the Bedrock runtime with tight pricing and low-latency streaming, launched at re:Invent 2024

Azure OpenAI ServiceAzure

Enterprise access to OpenAI models including GPT-4, GPT-3.5, and DALL-E with Azure security, private networking, regional deployments, and pay-as-you-go or provisioned throughput

Azure Machine LearningAzure

End-to-end platform for building and deploying ML models with automated ML, designer (drag-and-drop), managed compute clusters, MLflow tracking, and responsible AI dashboards

Cloudflare Workers AICloudflare

Serverless GPU-backed AI inference at the edge running a catalogue of open-source text, image, speech, and embedding models (Llama, Mistral, Stable Diffusion, Whisper, BGE) with pay-per-neurone pricing and direct binding from Workers code

Gcore Inference at the EdgeGcore

AI inference runtime that deploys models to Gcore's edge POPs and routes requests to the nearest GPU-backed endpoint, with support for open-source LLMs and custom model containers

Vertex AIGCP

Unified platform to build, deploy, and scale ML models with AutoML, custom training on TPUs and GPUs, model registry, pipelines, feature store, and generative AI studio

Gemini APIGCP

Direct API access to Google's most capable multimodal AI models with text, image, audio, and video understanding, long context windows, and function calling support

IBM watsonx.aiIBM

Enterprise AI studio for training, validating, tuning, and deploying foundation models and traditional ML models, with IBM's Granite model family, Hugging Face integration, prompt lab, synthetic data generation, and governance via watsonx.governance

Kanana AIKakao

Kakao's Korean-first foundation-model family (Kanana Flash / Essence / Nano) for chat, code, and embeddings — multilingual but tuned for Korean conversational performance

CLOVA StudioNaver

Naver's HyperCLOVA X foundation-model platform for Korean-language LLM workloads — chat completion, embeddings, function calling, RAG over Korean text with strong native-language performance

Red Hat OpenShift AIOpenShift

Managed MLOps platform (formerly Open Data Hub) for training, serving, and monitoring ML models on OpenShift with JupyterHub, KServe, Kubeflow, and PyTorch operators

OCI Generative AIOracle

Fully managed service offering Cohere and Llama large language models

Generative APIsScaleway

Managed inference for open-source LLMs (Llama, Mistral, DeepSeek) hosted in EU datacentres

Snowflake CortexSnowflake

Fully managed AI and ML service offering hosted LLMs, vector search, and ML functions inside Snowflake SQL

Tencent HunyuanTencent

Tencent's in-house family of large language models (Hunyuan-Pro, Standard, Lite, plus multimodal Hunyuan-Vision) accessible via the Hunyuan API, with enterprise-grade context windows up to 256K, function calling, embeddings, and tuning

Pricing

Pricing model:dbu-consumption