Open in app

Sign in

Write

Sign in

Anders Elton
Anders Elton

94 Followers

Home

About

Nov 8

BigQuery and holidays

TL; DR — BigQuery doesnt have a list of holidays natively, just follow this link https://github.com/ael-computas/bigquery-holidays or use the public dataset “ael-cx.holidays.holidays” While writing code to analyse working days in the company where I work, it is important to factor in holidays to get the complete picture. For example, May…

Bigquery

6 min read

BigQuery and holidays
BigQuery and holidays
Bigquery

6 min read


Published in

Compendium

·Mar 29

Understanding BigQuery Costs

In this article you will learn what drives BigQuery costs and I will show a SaaS solution that can help you analyze the costs instantly! Working as a consultant I have had the opportunity to work with several clients that are using Google BigQuery. What all of the clients have…

Bigquery

4 min read

Understanding BigQuery Costs
Understanding BigQuery Costs
Bigquery

4 min read


Published in

Compendium

·Nov 30, 2022

GCP Workload Identity Federation on Gitlab Passing Authentication between Jobs

Gitlab (late 2022) is relatively new to workload identity federation, and there are not many good templates or guides out there. The official guides explain how to set up the federation pool and authenticate with it, but not really how to use this in an enterprise pipeline. …

Gitlab

5 min read

Gitlab

5 min read


Published in

Compendium

·Aug 31, 2021

Using dataform to improve data quality in BigQuery

There is always one thing that everyone will tell you if you start the path of a data driven project: You will spend most of your time dealing with data quality issues. This is especially true when dealing with data collected from legacy or ad hook systems. You will suffer…

Dataform

5 min read

Using dataform to improve data quality in BigQuery
Using dataform to improve data quality in BigQuery
Dataform

5 min read


Published in

Compendium

·May 7, 2021

Building a serverless datawarehouse pipeline using GCP

Traditionally, building a data warehouse requires massive capital investments in infrastructure, tools and licenses to get insight to your data. As they live and grow, these solutions often have a tendency to become time-consuming to maintain and complex or slow to change in response to new business needs. BigQuery is…

Google Cloud Run

4 min read

Building a serverless datawarehouse pipeline using GCP
Building a serverless datawarehouse pipeline using GCP
Google Cloud Run

4 min read


Apr 23, 2021

Copy SQL Server data to BigQuery without CDC

Sometimes you just want data from your source into your analytical tool and start doing experiments. I have created a tool that can help you in this kind of prototyping. A common way to integrate SQL server and BigQuery the lazy way is to: Export table to disk upload CSV…

Bigquery

2 min read

Copy SQL Server data to BigQuery without CDC
Copy SQL Server data to BigQuery without CDC
Bigquery

2 min read


Published in

Compendium

·Oct 16, 2020

Best practises for KubernetesPodOperator in Cloud Composer

In this post I will go through best practises on using the KubernetesPodOperator with examples. I will share dags and terraform scripts so it should be easy to test it out for yourself. Quite a few of the questions I get when talking about Cloud Composer is how to use…

Gcp

6 min read

Best practises for KubernetesPodOperator in Cloud Composer
Best practises for KubernetesPodOperator in Cloud Composer
Gcp

6 min read


Published in

Compendium

·May 5, 2020

Argo workflows as alternative to Cloud Composer

Background In previous posts (scheduling jobs #1, scheduling jobs #2)I have been writing about how to do workflow scheduling using GCPs Cloud Composer (airflow). Something that has been bugging me about Cloud Composer is the steep price (380$ / month minimum!). …

Airflow

5 min read

Argo workflows as alternative to Cloud Composer
Argo workflows as alternative to Cloud Composer
Airflow

5 min read


Apr 28, 2020

Troubleshooting cloud composer

Today I had an interesting case from one of our customers. They are running a decent sized composer cluster with 4 n2-highmem-2 machines, with an additional node pool to run data-science jobs spawned with Pod Operator (with even beefier machines). …

Google Cloud Composer

3 min read

Troubleshooting cloud composer
Troubleshooting cloud composer
Google Cloud Composer

3 min read


Published in

Compendium

·Dec 9, 2019

Debugging a Python Workload Gone Silent inside Kubernetes

Today I was at a customer helping them to optimize their Cloud Composer setup. Cloud Composer is a managed Airflow installation, a job orchestration tool that runs on Kubernetes, made by Airbnb. I had previously advised the customer to use Cloud Composer to only run Docker containers (KubernetesPodOperator), as that…

Docker

5 min read

Debug a python workload that has gone silent (hung) inside kubernetes
Debug a python workload that has gone silent (hung) inside kubernetes
Docker

5 min read

Anders Elton

Anders Elton

94 Followers

Software developer and cloud enthusiast

Following
  • Jostein Leira

    Jostein Leira

  • Simon Hawe

    Simon Hawe

  • salmaan rashid

    salmaan rashid

  • Per Axel Aamot

    Per Axel Aamot

  • Thomas Gariel

    Thomas Gariel

See all (8)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams