Amazon EMR
Easily run and scale Apache Spark, Trino, and other big data workloads
Why Amazon EMR?
Amazon EMR is a big data processing service that accelerates analytics workloads with unmatched flexibility and scale. EMR features performance-optimized runtimes for Apache Spark, Trino, Apache Flink, and Hive, drastically cutting costs and processing times. The service integrates seamlessly with AWS, simplifying data lake workflows and enterprise-scale architectures. With built-in auto-scaling, intelligent monitoring, and managed infrastructure, EMR lets you focus on extracting insights—not managing clusters—delivering petabyte-scale analytics efficiently without the operational overhead of traditional solutions.

Flexible deployment options
Why EMR Serverless?
Amazon EMR Serverless makes it easy for data analysts and engineers to run open-source big data analytics frameworks like Apache Spark without configuring, managing, and scaling clusters or servers. EMR Serverless is the fastest way to get started with all the features and benefits of Amazon EMR without the need for experts to plan and manage clusters.
Why Amazon EMR on Amazon EC2?
Amazon EMR on Amazon EC2 provides control over cluster configuration and supports long-running clusters, making it perfect for continuous data processing tasks that require specific hardware setups. You can install custom applications alongside popular frameworks like Apache Spark and Trino, while offering a wide range of EC2 instance types to optimize for both cost and performance. Integration with other AWS services and the ability to use Spot Instances makes it a cost-effective solution for organizations requiring granular control over their big data operations.
Why Amazon EMR on Amazon EKS?
Amazon EMR on Amazon Elastic Kubernetes Service (EKS) enables you to submit Apache Spark jobs on demand on EKS without provisioning EMR clusters. With EMR on EKS, you can run your analytical workloads on the same Amazon EKS cluster as your other Kubernetes-based applications to improve resource utilization and simplify infrastructure management.
Process your data with Amazon EMR in the next generation of Amazon SageMaker
Amazon EMR is available in the next generation of Amazon SageMaker, allowing you to effortlessly run Apache Spark, Trino, and other open-source analytics frameworks in a unified data and AI development environment.
