When you enroll through our links, we may earn a small commission—at no extra cost to you. This helps keep our platform free and inspires us to add more value.

Udemy logo

Process Big Data using Apache PIG

Learn analyzing and processing big data Using Apache Pig

     
  • 2.8
  •  |
  • Reviews ( 29 )
₹799

This Course Includes

  • iconudemy
  • icon2.8 (29 reviews )
  • icon5.5 total hours
  • iconenglish
  • iconOnline - Self Paced
  • iconcourse
  • iconUdemy

About Process Big Data using Apache PIG

Pig is a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. In this course we will go through the PIG data flow platform and the language used by PIG tool. The concepts which are covered in this course are:

Writing complex MapReduce transformations using a simple scripting language.

Basics of Big Data, Hadoop and MapReduce Framework.

PIG Data Model and Different type of operators to operate on datasets.

Built-in Functions as well as User Defined Functions for performing a specific task.

Running PIG Script, Unit Testing and Compression.

Many more advance topics such as Embedding PIG in Java, PIG Macros etc.

All the books and PDFs are included, allowing you to follow along with the author throughout the modules in this course.

What You Will Learn?

  • Overview of Big Data and Hadoop Framework.
  • Anatomy of a MapReduce Framework.
  • Basics of Apache Pig tool and Where we should use it or not.
  • Run Pig in different Modes.
  • Use Pig Latin Queries.
  • Different types of PIG Operators for analysing the data.
  • Understand the architecture of PIG tool.
  • Work with PIG data model.
  • Different kinds of built-in functions.
  • Advanced PIG concepts such as PIG Streaming, PIG scripts and User Defined Functions(UDFs).
  • Compress the input files, final output files and intermediate output files.
  • Pig Unit Testing, PIG Macros and Parameter Substitution.
  • How to embed PIG in Java.