Open in app

Sign In

Write

Sign In

Anders Elton
Anders Elton

90 Followers

Home

About

Published in Compendium

·Nov 30, 2022

GCP Workload Identity Federation on Gitlab Passing Authentication between Jobs

Gitlab (late 2022) is relatively new to workload identity federation, and there are not many good templates or guides out there. The official guides explain how to set up the federation pool and authenticate with it, but not really how to use this in an enterprise pipeline. …

Gitlab

5 min read

Gitlab

5 min read


Published in Compendium

·Aug 31, 2021

Using dataform to improve data quality in BigQuery

There is always one thing that everyone will tell you if you start the path of a data driven project: You will spend most of your time dealing with data quality issues. This is especially true when dealing with data collected from legacy or ad hook systems. You will suffer…

Dataform

5 min read

Using dataform to improve data quality in BigQuery
Using dataform to improve data quality in BigQuery
Dataform

5 min read


Published in Compendium

·May 7, 2021

Building a serverless datawarehouse pipeline using GCP

Traditionally, building a data warehouse requires massive capital investments in infrastructure, tools and licenses to get insight to your data. As they live and grow, these solutions often have a tendency to become time-consuming to maintain and complex or slow to change in response to new business needs. BigQuery is…

Google Cloud Run

4 min read

Building a serverless datawarehouse pipeline using GCP
Building a serverless datawarehouse pipeline using GCP
Google Cloud Run

4 min read


Apr 23, 2021

Copy SQL Server data to BigQuery without CDC

Sometimes you just want data from your source into your analytical tool and start doing experiments. I have created a tool that can help you in this kind of prototyping. A common way to integrate SQL server and BigQuery the lazy way is to: Export table to disk upload CSV…

Bigquery

2 min read

Copy SQL Server data to BigQuery without CDC
Copy SQL Server data to BigQuery without CDC
Bigquery

2 min read


Published in Compendium

·Oct 16, 2020

Best practises for KubernetesPodOperator in Cloud Composer

In this post I will go through best practises on using the KubernetesPodOperator with examples. I will share dags and terraform scripts so it should be easy to test it out for yourself. Quite a few of the questions I get when talking about Cloud Composer is how to use…

Gcp

6 min read

Best practises for KubernetesPodOperator in Cloud Composer
Best practises for KubernetesPodOperator in Cloud Composer
Gcp

6 min read


Published in Compendium

·May 5, 2020

Argo workflows as alternative to Cloud Composer

Background In previous posts (scheduling jobs #1, scheduling jobs #2)I have been writing about how to do workflow scheduling using GCPs Cloud Composer (airflow). Something that has been bugging me about Cloud Composer is the steep price (380$ / month minimum!). …

Airflow

5 min read

Argo workflows as alternative to Cloud Composer
Argo workflows as alternative to Cloud Composer
Airflow

5 min read


Apr 28, 2020

Troubleshooting cloud composer

Today I had an interesting case from one of our customers. They are running a decent sized composer cluster with 4 n2-highmem-2 machines, with an additional node pool to run data-science jobs spawned with Pod Operator (with even beefier machines). …

Google Cloud Composer

3 min read

Troubleshooting cloud composer
Troubleshooting cloud composer
Google Cloud Composer

3 min read


Published in Compendium

·Dec 9, 2019

Debugging a Python Workload Gone Silent inside Kubernetes

Today I was at a customer helping them to optimize their Cloud Composer setup. Cloud Composer is a managed Airflow installation, a job orchestration tool that runs on Kubernetes, made by Airbnb. I had previously advised the customer to use Cloud Composer to only run Docker containers (KubernetesPodOperator), as that…

Docker

5 min read

Debug a python workload that has gone silent (hung) inside kubernetes
Debug a python workload that has gone silent (hung) inside kubernetes
Docker

5 min read


Published in Compendium

·Jun 3, 2019

Copy data from cloud SQL to BigQuery using apache airflow/cloud composer part 2

In my previous post I explained how to load data from cloud SQL into bigquery using command line tools like gcloud and bq. In this post I will go though an example on how to load data using apache airflow operators instead of command line tools. Doing it this way…

Kubernetes

5 min read

Copy data from cloud SQL to BigQuery using apache airflow/ cloud composer part 2
Copy data from cloud SQL to BigQuery using apache airflow/ cloud composer part 2
Kubernetes

5 min read


Published in Compendium

·Mar 22, 2019

Mounting a GCP bucket as NFS in kubernetes

Creating a fileshare of unlimited size as NFS mounted on a bucket inside a kubernetes cluster? Disregarding if this is a good idea or not, here is a little description of the problem we faced and how we solved it. Background Why did I want this as a NFS server in…

Kubernetes

4 min read

Mounting a GCP bucket as NFS in kubernetes
Mounting a GCP bucket as NFS in kubernetes
Kubernetes

4 min read

Anders Elton

Anders Elton

90 Followers

Software developer and cloud enthusiast

Following
  • Jostein Leira

    Jostein Leira

  • Simon Hawe

    Simon Hawe

  • salmaan rashid

    salmaan rashid

  • Per Axel Aamot

    Per Axel Aamot

  • Thomas Gariel

    Thomas Gariel

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech