When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

Distributed Computing with Spark SQL
This course is part of Learn SQL Basics for Data Science Specialization
Free

This Course Includes
coursera
4.5 (660 reviews )
13 hours (approximately)
english
Online - Self Paced
course
University of California Davis
About Distributed Computing with Spark SQL
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
What You Will Learn?
- Use the collaborative Databricks workspace to write scalable Spark SQL code that executes against a cluster of machines.
- Inspect the Spark UI to analyze query performance and identify bottlenecks.
- Create an end-to-end pipeline that reads data, transforms it, and saves the result.
- Build a medallion (bronze, silver, gold) lakehouse architecture with Delta Lake to ensure the reliability, scalability, and performance of your data.