When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

pluralsight logo

Preparing Data for Modeling with scikit-learn

This course covers important steps in the pre-processing of data, including standardization, normalization, novelty and outlier detection, pre-processing image and text data, as well as explicit kernel approximations such as the RBF and Nystroem methods.

     
  • 4
  •  |
  • Reviews ( 17 )
Free
Preparing Data for Modeling with scikit-learn

    This Course Includes

    • iconpluralsight
    • icon4 (17 reviews )
    • icon3 hour 40 minutes
    • iconenglish
    • iconOnline - Self Paced
    • iconcore courses
    • iconpluralsight

    About Preparing Data for Modeling with scikit-learn

    Even as the number of machine learning frameworks and libraries increases on a daily basis, scikit-learn is retaining its popularity with ease. Scikit-learn makes the common use-cases in machine learning - clustering, classification, dimensionality reduction and regression - incredibly easy. In this course, Preparing Data for Modeling with scikit-learn, you will gain the ability to appropriately pre-process data, identify outliers and apply kernel approximations. First, you will learn how pre-processing techniques such as standardization and scaling help improve the efficacy of ML algorithms. Next, you will discover how novelty and outlier detection is implemented in scikit-learn. Then, you will understand the typical set of steps needed to work with both text and image data in scikit-learn. Finally, you will round out your knowledge by applying implicit and explicit kernel transformations to transform data into higher dimensions. When you're finished with this course, you will have the skills and knowledge to identify the correct data pre-processing technique for your use-case and detect outliers using theoretically robust techniques.

    What You Will Learn?

    • Course Overview : 1min.
    • Preparing Numeric Data for Machine Learning : 46mins.
    • Understanding and Implementing Novelty and Outlier Detection : 47mins.
    • Preparing Text Data for Machine Learning : 30mins.
    • Preparing Image Data for Machine Learning : 34mins.
    • Working with Specialized Datasets : 27mins.
    • Performing Kernel Approximations : 32mins.