Guides & Resources.

Want to learn more about Databricks? We can help. We've written some handy guides and listed ressources we've found helpful when we started our Databricks journey. We keep all these resources up to date so we hope you find these resources handy.

Data Engineering

Navigating the data lake using Rust - Part Two

This post explains how to create and write to Delta Lake tables using Rust on AWS S3 while addressing the challenge of eventual consistency. It demonstrates what can happen if multiple concurrent writers are not synchronised when writing data to a Delta table and how this challenge can be overcome. By the end of this post, you will also gain a better understanding of how the Delta Transaction Protocol works and how cloud storage services differ.

Read More
Read More
Navigating the data lake using Rust - Part Two
Yousry Mohamed

March 22, 2023

20 mins

Data Engineering

Navigating the data lake using Rust - Part One

Most data engineers correlate delta lake format with Spark and Databricks. That's not true. Delta can be used by so many other tools and most cloud providers have added delta support to their analytics tools. In this post we will see how to use delta lake from a Rust client.

Read More
Read More
Navigating the data lake using Rust - Part One
Yousry Mohamed

January 16, 2023

15 mins

Data Engineering

Databricks productivity tips and tricks

Do you use Databricks? Do you think it is just a hosted optimised flavour of Apache Spark? In that case you are missing many features and ideas that can take your productivity to the next level. This post has tips and tricks to take you to the next level on Databricks.

Read More
Read More
Databricks productivity tips and tricks
Yousry Mohamed

December 6, 2022

15 mins

General

The Databricks Resources Page

There is tonnes out of Information out there about Databricks and it can be a bit overwhelming. Where does somebody even start? We have done you a favor and curated a list of learning materials we found useful when we started our Databricks journey and we share with new employees.

Read More
Read More
The Databricks Resources Page
Cuusoo Team

July 29, 2022

5 mins

General

What is Databricks and what’s it used for?

Forget the marketing jargon, here’s a clear answer, finally!

Read More
Read More
What is Databricks and what’s it used for?
Cuusoo Team

July 12, 2022

10 mins

Data Engineering

How to run a hello world program in Databricks - Part 2

A guide on how to run a hello world program in Databricks and Azure.

Read More
Read More
How to run a hello world program in Databricks - Part 2
Yousry Mohamed

June 18, 2022

15 mins

Data Engineering

How to run a hello world program in Databricks - Part 1

A guide on how to run a hello world program in Databricks and Azure.

Read More
Read More
How to run a hello world program in Databricks - Part 1
Yousry Mohamed

May 11, 2022

7 mins

Data Engineering

What do you mean immovable object?

In this blogpost we are generating our first large dummy dataset and migrating it to AWS S3 using the multipart uploading concept.

Read More
Read More
What do you mean immovable object?
Cuusoo Team

April 8, 2022

20 mins

Data Engineering

Navigating the data lake using Rust - Part Two

This post explains how to create and write to Delta Lake tables using Rust on AWS S3 while addressing the challenge of eventual consistency. It demonstrates what can happen if multiple concurrent writers are not synchronised when writing data to a Delta table and how this challenge can be overcome. By the end of this post, you will also gain a better understanding of how the Delta Transaction Protocol works and how cloud storage services differ.

Read More
Read More
Navigating the data lake using Rust - Part Two
Yousry Mohamed

March 22, 2023

20 mins

Data Engineering

Navigating the data lake using Rust - Part One

Most data engineers correlate delta lake format with Spark and Databricks. That's not true. Delta can be used by so many other tools and most cloud providers have added delta support to their analytics tools. In this post we will see how to use delta lake from a Rust client.

Read More
Read More
Navigating the data lake using Rust - Part One
Yousry Mohamed

January 16, 2023

15 mins

Data Engineering

Databricks productivity tips and tricks

Do you use Databricks? Do you think it is just a hosted optimised flavour of Apache Spark? In that case you are missing many features and ideas that can take your productivity to the next level. This post has tips and tricks to take you to the next level on Databricks.

Read More
Read More
Databricks productivity tips and tricks
Yousry Mohamed

December 6, 2022

15 mins

Data Engineering

How to run a hello world program in Databricks - Part 2

A guide on how to run a hello world program in Databricks and Azure.

Read More
Read More
How to run a hello world program in Databricks - Part 2
Yousry Mohamed

June 18, 2022

15 mins

Data Engineering

How to run a hello world program in Databricks - Part 1

A guide on how to run a hello world program in Databricks and Azure.

Read More
Read More
How to run a hello world program in Databricks - Part 1
Yousry Mohamed

May 11, 2022

7 mins

Data Engineering

What do you mean immovable object?

In this blogpost we are generating our first large dummy dataset and migrating it to AWS S3 using the multipart uploading concept.

Read More
Read More
What do you mean immovable object?
Cuusoo Team

April 8, 2022

20 mins

No items found.

General

The Databricks Resources Page

There is tonnes out of Information out there about Databricks and it can be a bit overwhelming. Where does somebody even start? We have done you a favor and curated a list of learning materials we found useful when we started our Databricks journey and we share with new employees.

Read More
Read More
The Databricks Resources Page
Cuusoo Team

July 29, 2022

5 mins

General

What is Databricks and what’s it used for?

Forget the marketing jargon, here’s a clear answer, finally!

Read More
Read More
What is Databricks and what’s it used for?
Cuusoo Team

July 12, 2022

10 mins