When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

pluralsight logo

Data Engineering with AWS Machine Learning

The whole field of machine learning revolves around data. This course will teach you how to properly choose between the various AWS data repositories, ingestion services, and transformation services in a cost-effective, best-practice manner.

     
  • 3
  •  |
  • Reviews ( 33 )
Free

This Course Includes

  • iconpluralsight
  • icon3 (33 reviews )
  • icon2 hour 55 minutes
  • iconenglish
  • iconOnline - Self Paced
  • iconcore courses
  • iconpluralsight

About Data Engineering with AWS Machine Learning

Storing data for machine learning is challenging due to the varying formats and characteristics of data. Raw ingested data must first be transformed into the format necessary for downstream machine learning consumption, and once the data is ready to be used, it must be ingested from storage to the machine learning service. In this course, Data Engineering with AWS Machine Learning, you'll learn to choose the right AWS service for each of these data-related machine learning ML tasks for any given scenario. First, you'll explore the wide variety of data storage solutions available on AWS and what each type of storage is used for. Next, you'll discover the differing AWS services used to ingest data into ML-specific services and when to use each one. Finally, you'll learn how to transform your raw data into the proper formats used by the various AWS ML services. When you're finished with this course, you'll have the skills and knowledge of how to properly provide data solutions for storing, preparing, and ingesting data needed to architect data engineering solutions on AWS for Machine Learning, and be prepared to take the AWS Machine Learning Certification exam.

What You Will Learn?

  • Course Overview : 1min.
  • Important Data Characteristics to Consider in a Machine Learning Solution : 8mins.
  • Typical Data Flow for Machine Learning on AWS : 8mins.
  • Data Storage Options for Machine Learning on AWS : 22mins.
  • Database Options for Machine Learning on AWS : 13mins.
  • Using a Data Warehouse or a Data Lake as a Machine Learning Repository : 18mins.
  • Streaming Data Ingestion Solutions on AWS for Machine Learning : 23mins.
  • Batch Data Ingestion Solutions on AWS for Machine Learning : 10mins.
  • Data Transformation Overview on AWS for Machine Learning : 10mins.
  • Data-driven Workflows: The AWS Data Pipeline : 6mins.
  • Data Transformation Using Apache Spark on Amazon EMR : 10mins.
  • Data Transformation Using Serverless AWS Glue and Serverless Amazon Athena : 40mins.