When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

SkillShare logo

Data Science and Machine Learning with Python - Hands On!

Unlock creativity with Skillshare! Learn acrylic painting, AI writing, graphic design, and photography.

     0 |
  • Reviews ( 0 )
Free

This Course Includes

  • iconskillshare
  • icon0 (0 reviews )
  • icon10 hours 37 minutes
  • iconenglish
  • iconOnline - Self Paced
  • iconcourse
  • iconSkillShare

About Data Science and Machine Learning with Python - Hands On!

Introduction

Windows Setup Instructions

Mac Setup Instructions

Linux Setup Instructions

Python Basics, Part 1

Python Basics, Part 2

Python Basics, Part 3

Python Basics, Part 4

Intro to Pandas

Types of Data

Mean, Median, Mode

Using mean, media, and mode in Python

Variation and Standard Deviation

Probability Density Function; Probability Mass Function

Common Data Distributions

Percentiles and Moments

A Crash Course in matplotlib

Data Visualization with Seaborn

Covariance and Correlation

Exercise: Conditional Probability

Exercise Solution: Conditional Probability

Bayes' Theorem

Linear Regression

Polynomial Regression

Multiple Regression

Multi-Level Models

Supervised vs. Unsupervised Learning, Train / Test

Using Train/Test to Prevent Overfitting

Bayesian Methods: Concepts

Implementing a Spam Classifier with Naive Bayes

K-Means Clustering

Clustering People by Income and Age

Measuring Entropy

Windows: Installing Graphviz

Mac: Installing Graphviz

Linux: Installing Graphviz

Decision Trees: Concepts

Decision Trees: Predicting Hiring Decisions

Ensemble Learning

[Activity] XGBoost

Support Vector Machines (SVM) Overview

Using SVM to Cluster People

User-Based Collaborative Filtering

Item-Based Collaborative Filtering

Finding Movie Similarities

Improving the Results of Movie Similarities

Making Movie Recommendations to People

Improving the Recommender's Results

K-Nearest-Neighbors: Concepts

Using KNN to Predict a Rating for a Movie

Dimensionality Reduction; Principal Component Analysis

PCA Example with the Iris Data Set

Data Warehousing; ETL and ELT

Reinforcement Learning

Hands-On with Q-Learning

Understanding a Confusion Matrix

Measuring Classifiers (Precision, Recall, F1, ROC, AUC)

Bias / Variance Tradeoff

K-Fold Cross Validation

Data Cleaning and Normalization

Cleaning Web Log Data

Normalizing Numerical Data

Detecting Outliers

Feature Engineering and the Curse of Dimensionality

Imputation Techniques for Missing Data

Handling Unbalanced Data: Oversampling, Undersampling, and SMOTE

Binning, Transforming, Encoding, Scaling, and Shuffling

Important Spark Installation Notes

Installing Spark - Part 1

Installing Spark - Part 2

Spark Introduction

Spark and the Resilient Distributed Dataset (RDD)

Introducing MLLib

Decision Trees in Spark

K-Means Clustering in Spark

TF / IDF

Searching Wikipedia with Spark

Using the Spark 2 DataFrame API for MLLib

Deploying Models to Production

A/B Testing Concepts

T-Tests and P-Values

Hands-On with T-Tests

Determining How Long to Run an Experiment

A/B Test Gotchas

What You Will Learn?

  • Data Scientists enjoy one of the top-paying jobs, with an average salary of $120,000 according to Glassdoor and Indeed. That's just the average! And it's not just about money - it's interesting work too!.
  • If you've got some programming or scripting experience, this course will teach you the techniques used by real data scientists in the tech industry - and prepare you for a move into this hot career path. This comprehensive course includes 68 lectures spanning almost 9 hours of video, and most topics include hands-on Python code examples you can use for reference and for practice. I’ll draw on my 9 years of experience at Amazon and IMDb to guide you through what matters, and what doesn’t..
  • Each concept is introduced in plain English, avoiding confusing mathematical notation and jargon. It’s then demonstrated using Python code you can experiment with and build upon, along with notes you can keep for future reference. You won't find academic, deeply mathematical coverage of these algorithms in this course - the focus is on practical understanding and application of them..
  • The topics in this course come from an analysis of real requirements in data scientist job listings from the biggest tech employers. We'll cover the machine learning and data mining techniques real employers are looking for, including:.
  • ...and much more! There's also an entire section on machine learning with Apache Spark, which lets you scale up these techniques to "big data" analyzed on a computing cluster..
  • If you're new to Python, don't worry - the course starts with a crash course. If you've done some programming before, you should pick it up quickly. This course shows you how to get set up on Microsoft Windows-based PC's; the sample code will also run on MacOS or Linux desktop systems, but I can't provide OS-specific support for them..
  • If you’re a programmer looking to switch into an exciting new career track, or a data analyst looking to make the transition into the tech industry – this course will teach you the basic techniques used by real-world industry data scientists. I think you'll enjoy it!.