Everything you care about in one place

Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.

Get Feeder

aws.amazon.com

AWS Big Data Blog

Get the latest updates from AWS Big Data Blog directly as they happen.

Follow now 545 followers

Latest posts

Last updated 11 days ago

Run Apache Spark and Iceberg 4.5x faster than open source Spark with Amazon EMR

11 days ago

This post shows how Amazon EMR 7.12 can make your Apache Spark...

Apache Spark encryption performance improvement with Amazon EMR 7.9

11 days ago

The Amazon EMR runtime for Apache Spark is a performance-optimized runtime for...

Run Apache Spark and Apache Iceberg write jobs 2x faster with Amazon EMR

11 days ago

Amazon EMR runtime for Apache Spark offers a high-performance runtime environment while...

Medidata’s journey to a modern lakehouse architecture on AWS

11 days ago

This post was co-authored by Mike Araujo Principal Engineer at Medidata Solutions...

Achieve 2x faster data lake query performance with Apache Iceberg on Amazon Redshift

11 days ago

With the growing adoption of open table formats like Apache Iceberg, Amazon...

Introducing catalog federation for Apache Iceberg tables in the AWS Glue Data Catalog

11 days ago

Apache Iceberg has become the standard choice of open table format for...

Accelerate data lake operations with Apache Iceberg V3 deletion vectors and row lineage

11 days ago

Organizations building petabyte-scale data lakes face increasing challenges as their data grows...

How Octus achieved 85% infrastructure cost reduction with zero downtime migration to Amazon OpenSearch Service

11 days ago

As data volumes continue to grow exponentially, there is increasing pressure to...

Getting started with Apache Iceberg write support in Amazon Redshift

11 days ago

Many companies store structured data in warehouses for analytics while keeping diverse...

Orchestrating data processing tasks with a serverless visual workflow in Amazon SageMaker Unified Studio

12 days ago

Automation of data processing and data integration tasks is essential for data...

Save up to 24% on Amazon Redshift Serverless compute costs with Reservations

13 days ago

 Amazon Redshift Serverless makes it convenient to run and scale analytics without...

Introducing Cluster insights: Unified monitoring dashboard for Amazon OpenSearch Service clusters

16 days ago

Amazon OpenSearch Service clusters offer a wealth of operational metrics accessible through...