AWS Glue

AWSAnalyticsFree tier available

Serverless data integration platform with visual ETL authoring via Glue Studio, a Hive-compatible Data Catalog, automatic crawlers, DataBrew visual prep, and Glue Data Quality for declarative rule-based validation across Spark and Python jobs

Jurisdictional exposure

Provider HQ
USSeattle, USA

Subject to CLOUD Act, FISA-702, DPF

Region locations
APACCNEEAEUUKUSOther40 regions across 7 jurisdictions
Sovereign option
Yes — 6 sovereign-flagged regions available

Attributes

SLA Uptime
99.9%
Serverless
Yes

Sub-services (5)

Glue Data Catalog

Hive-compatible metastore used by Athena, Redshift Spectrum, and EMR

Glue ETL Jobs

Serverless Spark and Python ETL with Glue Studio visual authoring

Crawlers

Automatic schema discovery and Data Catalog population from S3, JDBC, and more

Glue DataBrew

Visual point-and-click data preparation for analysts

Glue Data Quality

Declarative rule-based validation and anomaly detection on Glue tables

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

Where this runs

40 regions
28 countries
6sovereign
Sovereign regions (6)
  • AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
  • AWS GovCloud (US-East) · AshburnAWS GovCloud (US)
  • AWS GovCloud (US-West) · HillsboroAWS GovCloud (US)
  • AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
  • China (Beijing) · BeijingAWS China (Sinnet)
  • China (Ningxia) · YinchuanAWS China (NWCD)
Commercial regions (34)

Europe (8)

  • Europe (Paris)
  • Europe (Frankfurt)
  • Europe (Ireland)
  • Europe (Milan)
  • Europe (Spain)
  • Europe (Stockholm)
  • Europe (Zurich)
  • Europe (London)

North America (7)

  • Canada West (Calgary)
  • Canada (Central)
  • Mexico (Central)
  • US East (N. Virginia)
  • US West (Oregon)
  • US East (Ohio)
  • US West (N. California)

South America (1)

  • South America (São Paulo)

Asia (11)

  • Asia Pacific (Hong Kong)
  • Asia Pacific (Hyderabad)
  • Asia Pacific (Mumbai)
  • Asia Pacific (Jakarta)
  • Asia Pacific (Osaka)
  • Asia Pacific (Tokyo)
  • Asia Pacific (Malaysia)
  • Asia Pacific (Singapore)
  • Asia Pacific (Seoul)
  • Asia Pacific (Taipei)
  • Asia Pacific (Thailand)

Oceania (3)

  • Asia Pacific (Melbourne)
  • Asia Pacific (Sydney)
  • Asia Pacific (New Zealand)

Middle East (3)

  • Middle East (Bahrain)
  • Israel (Tel Aviv)
  • Middle East (UAE)

Africa (1)

  • Africa (Cape Town)

Tags

Equivalent services on other platforms

Alibaba DataWorksAlibaba

End-to-end data development platform over MaxCompute, EMR, and Hologres with visual and SQL-based task authoring, scheduled pipelines, data quality rules, data lineage, and a built-in business-glossary catalog

Azure Data FactoryAzure

Managed ETL and ELT service for data integration at scale with 100+ connectors, visual pipeline designer, mapping data flows, and triggers for event-driven orchestration

Microsoft PurviewAzure

Unified data governance, discovery, and risk management platform with automatic classification, lineage across Azure, on-prem, SaaS, and multi-cloud sources, plus information protection, insider risk, and communication compliance modules spun out of Microsoft 365 Compliance

Databricks Delta Live TablesDatabricks

Declarative ETL framework for streaming and batch pipelines on the lakehouse — define tables in SQL or Python, DLT handles dependency graph, retries, data quality, and observability

DataflowGCP

Unified stream and batch data processing service running Apache Beam pipelines with autoscaling, exactly-once semantics, and native sinks to BigQuery and Cloud Storage

Dataplex Universal CatalogGCP

Intelligent data fabric and catalog that unifies distributed data across Cloud Storage, BigQuery, and third-party lakes with automatic discovery, quality scoring, data lineage, attribute-based access control, and generative AI-powered metadata enrichment

DataformGCP

Serverless workflow orchestration service for SQL transformations in BigQuery with Git-based version control, declarative SQLX models, dependency graph, assertions (data tests), and scheduled execution — the BigQuery-native alternative to dbt

DatastreamGCP

Serverless change-data-capture (CDC) and replication service that continuously streams change events from MySQL, PostgreSQL, Oracle, and SQL Server sources into BigQuery, Cloud Storage, Spanner, and Cloud SQL with minimal source-side impact

Cloud Data FusionGCP

Fully managed enterprise data integration service built on open-source CDAP, with a visual drag-and-drop pipeline builder, 150+ pre-built connectors and transformations, and GitOps-friendly pipeline export for hybrid ETL across Google Cloud and third-party data sources

OCI Data CatalogOracle

Centralised metadata repository for OCI data assets — Object Storage buckets, Autonomous Database schemas, on-prem databases. Discovery, tagging, and lineage tracking for analytics governance.

Pricing

Pricing model:pay-as-you-go