AWS-Blog: Making the TPC-H dataset available in Athena using Airflow

The TPC-H dataset is commonly used to benchmark data warehouses or, more generally, decision support systems. It describes a typical e-commerce workload and includes benchmark queries to enable performance comparison between different data warehouses. I think the dataset is also useful to teach building different kinds of ETL or analytics workflows, so I decided to explore ways of making it available in Amazon Athena.

2024-08-29 · 7 min · Maurice Borgmeier

AWS-Blog: Enabling Apache Airflow to copy large S3 objects

If you’re trying to use Apache Airflow to copy large objects in S3, you might have encountered issues where S3 complains about you sending an InvalidRequest. We will fix that in this post by writing a custom operator to handle the underlying problem.

2024-08-27 · 3 min · Maurice Borgmeier

AWS-Blog: You can't Opt-Out of Performance Tracking in the AWS Console

Even though I had opted out of performance measurement cookies, I noticed a lot of web requests that look like performance measurement in the AWS console. In this article I investigate what’s being sent and what we can do about it.

2024-08-22 · 7 min · Maurice Borgmeier

AWS-Blog: Improving Accessibility by Generating Image-alt texts using GenAI

In this article, we’ll be using GenAI to generate alternative texts for images in Markdown documents, which will help people relying on screen readers to access your content.

2024-08-21 · 7 min · Maurice Borgmeier

AWS-Blog: Going on an Industry Quest: Manufacturing and Auto

Using Industry Quest: Manufacturing and Auto you can learn about building IoT and factory management solutions in AWS. It’s a game that teaches you about real time monitoring, predictive maintenance, machine learning and data analytics. This blog gives an introduction to the game and covers my thoughts about its usefulness.

2024-06-28 · 8 min · Maurice Borgmeier

AWS-Blog: Switching Identity Providers in the IAM Identity Center

Switching the Identity Provider in the IAM Identity Center while keeping all permissions intact and minimizing disruptions can be a daunting task. In this post I’m explaining how we solved this for one of our customers.

2024-05-16 · 7 min · Maurice Borgmeier

AWS-Blog: Introduction to SSO with the IAM Identity Center and Entra ID

IAM Identity Center, formerly AWS SSO, is often used as an access management solution in front of one or more AWS accounts. More often than not, its purpose is to grant access to AWS accounts within an AWS organization. Today we’ll shed some light on the basic concepts and explain how this solution can be integrated with Azure AD which has recently been renamed to Entra ID to provide Single-Sign-On to your AWS environment.

2024-04-23 · 5 min · Maurice Borgmeier