SageMaker Data Processing helps you explore data, build data-transformation jobs, orchestrate, and deploy data pipelines at scale. It improves performance, driving faster insights than traditional open source systems with cost-effective and open source API-compatible versions of Apache Spark, Apache Airflow, Apache Flink, Trino, and more. SageMaker Data Processing provides access to your data sources in Amazon SageMaker Lakehouse through zero-ETL integrations, federated querying capabilities, and connectors.