Course

Skills Expanded

Getting Started with Stream Processing Using Apache Flink

Flink is a stateful, tolerant, and large scale system with excellent latency and throughput characteristics. It works with bounded and unbounded datasets using the same underlying stream-first architecture, focusing on streaming or unbounded data.

Preview this course

What you'll learn

Apache Flink is a distributed computing engine used to process large scale data. Flink is built on the concept of stream-first architecture where the stream is the source of truth. This course, Getting Started with Stream Processing Using Apache Flink, walks the users through exploratory data analysis and data munging with Flink. You'll start off learning about simple data transformations on streams such as map(), filter(), flatMap(), reduce(), sum(), min(), and max() on simple DataStreams and KeyedStreams. You'll then learn about window transformations in detail using tumbling, sliding, count, and session windows. You'll wrap up the course explore operations on multiple streams such as union and joins. All of this with hands on demos using Flink's Java API along with a real world project using Twitter's streaming API. After you've watched this course you'll have a strong foundation for stream processing concepts using Apache Flink.

Course Overview

2mins

Course Overview 2m

Understanding Streaming Data and Stream Processing

33mins

Implementing Basic Operations on Streaming Data

40mins

Data Representation and Transformations on a Stream 4m
The Filter Transformation 9m
The Map Transformation 4m
The FlatMap Transformation 6m
Stateless and Stateful Transformations 2m
Keyed Streams 2m
Transformations on Keyed Streams 6m
The Reduce Operation 7m

Windowing Operations on Streams

44mins

Introduction to Window Transformations 4m
Tumbling Windows 3m
Sliding Windows 2m
Count, Session, and Global Windows 5m
Event Time, Ingestion Time, and Processing Time 7m
Implementing Tumbling and Sliding Windows 5m
Implementing the Count Window 5m
Implementing the Session Window 3m
Getting the Twitter Consumer Keys and Access Tokens 3m
Connecting to the Twitter Streaming API 7m

Fault Tolerance with State and Checkpoints

32mins

Categories of State 5m
Rich Functions to Store State 3m
Making Transformations Stateful: ValueState<T> 6m
Making Transformations Stateful: ListState<T> 4m
Making Transformations Stateful: ReducingState<T> 4m
Fault Tolerance with Checkpoint 6m
Restart Strategies 4m

Working with Multiple Stream Sources

11mins

The Union Operation 4m
The Join Operation 7m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Ready to upskill? Get started

Contact Sales

Getting Started with Stream Processing Using Apache Flink

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Getting Started with Stream Processing Using Apache Flink

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?