AWS Glue

AWS AnalyticsFree tier available

Serverless data integration platform with visual ETL authoring via Glue Studio, a Hive-compatible Data Catalog, automatic crawlers, DataBrew visual prep, and Glue Data Quality for declarative rule-based validation across Spark and Python jobs

FluffyStack tools

Add to Service Builder Add to Compare Compare with equivalents Explore AWS in Treemap Explore analytics in Honeycomb See AWS regions on the World Map See analytics as a network Score jurisdiction exposure

Documentation Pricing AWS website

Jurisdictional exposure

Provider HQ

USSeattle, USA

Subject to CLOUD Act, FISA-702, DPF

Region locations

APACCNEEAEUUKUSOther40 regions across 7 jurisdictions

Sovereign option

Yes — 6 sovereign-flagged regions available

Full scorecard for this service →US lens detail →Sovereign cloud coverage map →

Attributes

SLA Uptime: 99.9%
Serverless: Yes

Sub-services (5)

Glue Data Catalog

Hive-compatible metastore used by Athena, Redshift Spectrum, and EMR

Glue ETL Jobs

Serverless Spark and Python ETL with Glue Studio visual authoring

Crawlers

Automatic schema discovery and Data Catalog population from S3, JDBC, and more

Glue DataBrew

Visual point-and-click data preparation for analysts

Glue Data Quality

Declarative rule-based validation and anomaly detection on Glue tables

Compliance & Certifications

This service is attested for the following frameworks. Always verify with the provider before relying on a specific compliance posture.

GDPR SOC 2 ISO 27001 HIPAA PCI DSS FedRAMP C5 TISAX IRAP ENS High CCCS Medium ISMAP MTCS L3 K-ISMS

Where this runs

40 regions

28 countries

6sovereign

Sovereign regions (6)

AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
AWS GovCloud (US-East) · AshburnAWS GovCloud (US)
AWS GovCloud (US-West) · HillsboroAWS GovCloud (US)
AWS European Sovereign Cloud (Brandenburg) · BrandenburgAWS European Sovereign Cloud
China (Beijing) · BeijingAWS China (Sinnet)
China (Ningxia) · YinchuanAWS China (NWCD)

Commercial regions (34)

Europe (8)

Europe (Paris)
Europe (Frankfurt)
Europe (Ireland)
Europe (Milan)
Europe (Spain)
Europe (Stockholm)
Europe (Zurich)
Europe (London)

North America (7)

Canada West (Calgary)
Canada (Central)
Mexico (Central)
US East (N. Virginia)
US West (Oregon)
US East (Ohio)
US West (N. California)

South America (1)

South America (São Paulo)

Asia (11)

Asia Pacific (Hong Kong)
Asia Pacific (Hyderabad)
Asia Pacific (Mumbai)
Asia Pacific (Jakarta)
Asia Pacific (Osaka)
Asia Pacific (Tokyo)
Asia Pacific (Malaysia)
Asia Pacific (Singapore)
Asia Pacific (Seoul)
Asia Pacific (Taipei)
Asia Pacific (Thailand)

Oceania (3)

Asia Pacific (Melbourne)
Asia Pacific (Sydney)
Asia Pacific (New Zealand)

Middle East (3)

Middle East (Bahrain)
Israel (Tel Aviv)
Middle East (UAE)

Africa (1)

Africa (Cape Town)

Equivalent services on other platforms

Alibaba DataWorksAlibaba

End-to-end data development platform over MaxCompute, EMR, and Hologres with visual and SQL-based task authoring, scheduled pipelines, data quality rules, data lineage, and a built-in business-glossary catalog

Azure Data FactoryAzure

Managed ETL and ELT service for data integration at scale with 100+ connectors, visual pipeline designer, mapping data flows, and triggers for event-driven orchestration

Microsoft PurviewAzure

Unified data governance, discovery, and risk management platform with automatic classification, lineage across Azure, on-prem, SaaS, and multi-cloud sources, plus information protection, insider risk, and communication compliance modules spun out of Microsoft 365 Compliance

Databricks Delta Live TablesDatabricks

Declarative ETL framework for streaming and batch pipelines on the lakehouse — define tables in SQL or Python, DLT handles dependency graph, retries, data quality, and observability

DataflowGCP

Unified stream and batch data processing service running Apache Beam pipelines with autoscaling, exactly-once semantics, and native sinks to BigQuery and Cloud Storage

Dataplex Universal CatalogGCP

Intelligent data fabric and catalog that unifies distributed data across Cloud Storage, BigQuery, and third-party lakes with automatic discovery, quality scoring, data lineage, attribute-based access control, and generative AI-powered metadata enrichment

DataformGCP

Serverless workflow orchestration service for SQL transformations in BigQuery with Git-based version control, declarative SQLX models, dependency graph, assertions (data tests), and scheduled execution — the BigQuery-native alternative to dbt

DatastreamGCP

Serverless change-data-capture (CDC) and replication service that continuously streams change events from MySQL, PostgreSQL, Oracle, and SQL Server sources into BigQuery, Cloud Storage, Spanner, and Cloud SQL with minimal source-side impact

Cloud Data FusionGCP

Fully managed enterprise data integration service built on open-source CDAP, with a visual drag-and-drop pipeline builder, 150+ pre-built connectors and transformations, and GitOps-friendly pipeline export for hybrid ETL across Google Cloud and third-party data sources

OCI Data CatalogOracle

Centralised metadata repository for OCI data assets — Object Storage buckets, Autonomous Database schemas, on-prem databases. Discovery, tagging, and lineage tracking for analytics governance.

OCI Data IntegrationOracle

Managed ETL / ELT service for building visual pipelines that move and transform data between OCI Object Storage, Autonomous Database, ADW, and external sources — with Spark-based execution, incremental loads, and DataOps CI/CD via version-controlled workspaces

Pricing

Pricing model:pay-as-you-go