When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

University of California Davis logo

Distributed Computing with Spark SQL

This course is part of Learn SQL Basics for Data Science Specialization

     
  • 4.5
  •  |
  • Reviews ( 660 )
Free

This Course Includes

  • iconcoursera
  • icon4.5 (660 reviews )
  • icon13 hours (approximately)
  • iconenglish
  • iconOnline - Self Paced
  • iconcourse
  • iconUniversity of California Davis

About Distributed Computing with Spark SQL

Learn new concepts from industry experts

Gain a foundational understanding of a subject or tool

Develop job-relevant skills with hands-on projects

Earn a shareable career certificate

What You Will Learn?

  • Use the collaborative Databricks workspace to write scalable Spark SQL code that executes against a cluster of machines.
  • Inspect the Spark UI to analyze query performance and identify bottlenecks.
  • Create an end-to-end pipeline that reads data, transforms it, and saves the result.
  • Build a medallion (bronze, silver, gold) lakehouse architecture with Delta Lake to ensure the reliability, scalability, and performance of your data.