When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

pluralsight logo

Delta Lake with Azure Databricks: Deep Dive

In this course, you'll learn about Delta Lake using Azure Databricks and its ecosystem – Delta Lake Storage, Delta Engine, Delta Architecture, Delta Live Tables, etc. – and how it provides warehouse-like features that you can use to build a Lakehouse.

     
  • 5
  •  |
  • Reviews ( 50 )
Free

This Course Includes

  • iconpluralsight
  • icon5 (50 reviews )
  • icon3 hour 35 minutes
  • iconenglish
  • iconOnline - Self Paced
  • iconExpanded Courses
  • iconpluralsight

About Delta Lake with Azure Databricks: Deep Dive

Delta Lake is an open-source storage layer that brings reliability to Data Lakes, by providing data warehouse-like features, on top of Data Lake.

It has a big ecosystem, and there are various tools and architectures based on that - Delta Lake Storage, Delta Engine, Delta Architecture, Delta Live Tables, Delta Sharing, etc. It can also handle Batch and Streaming data seamlessly. And these components and features can you help build an optimized, and well-integrated Lakehouse architecture.

In this course, Delta Lake with Azure Databricks: Deep Dive, you’ll learn how Delta Lake and various components in its ecosystem, allows us to build a Lakehouse architecture. And to do that, we will be using Azure Databricks.

First, you’ll learn what Delta Lake is, and how it works. You’ll also see the different components in its ecosystem.

Then, you’ll discover how to work with Delta Lake storage and its various features.

Next, you’ll see how to handle streaming data on Delta Lake.

After, you’ll explore Delta Engine in Databricks to optimize storage and queries.

Followed by this, you’ll see how to build a Lakehouse architecture.

And you’ll also see how to build reliable ETL pipelines with Delta Live Tables.

Finally, you’ll end with some common use cases, and how to implement them.

By the end of this course, you’ll have the knowledge and skills to work with Delta Lake and use its ecosystem components to build an optimized, well-integrated Lakehouse solution.

What You Will Learn?

  • Course Overview : 1min.
  • Getting Started with Delta Lake : 36mins.
  • Working with Delta Lake Storage : 45mins.
  • Handling Streaming Data on Delta Lake : 34mins.
  • Optimizing with Delta Engine in Databricks : 34mins.
  • Building a Lakehouse Architecture : 24mins.
  • Building ETL Pipelines with Delta Live Tables : 27mins.
  • Implementing Common Use Cases : 11mins.