This event has passed.

Big Data Analysis with Apache Spark

Name: Big Data Analysis with Apache Spark
Start: 2023-02-21T12:00:00-04:00
End: 2023-02-21T16:00:00-04:00
Location: Online

February 21, 2023 @ 12:00 pm - 4:00 pm AST

Free

Apache Spark is a user-friendly open-source platform for large-scale data processing, analytics and for parallel-computing. Using Apache Spark and Python (PySpark), this workshop is aimed at analyzing data sets that are too large to be handled and processed by a single computer.

With hands-on guided examples, the workshop covers the basics of Spark and Resilient Distributed Datasets (RDD) high-level architecture. The examples are mainly written in Python, hence the APIs covered are the ones available in PySpark, including Spark Core API (RDD API), Spark SQL, and Pandas on Spark.

Participants will learn how to import data, use functions to transform, reduce and compile the data, and produce parallel algorithms that can run on Alliance clusters.

Prerequisites:

ACENET Basics or equivalent
How to write functions in Python

Details

Date: February 21, 2023
Time:
12:00 pm - 4:00 pm AST
Cost: Free
Event Categories: Events, Webinar
Website: https://www.eventbrite.ca/e/acenet-big-data-analysis-with-spark-tickets-487990822687

Organizer

ACENET

Venue

Online

Events

Big Data Analysis with Apache Spark

February 21, 2023 @ 12:00 pm - 4:00 pm AST

Details

Organizer

Venue

Related Events

Content That Connects: Social Strategy & Short-Form Videos

Exclusive Masterclass: Power Your Snapchat Strategy

Moving from Theory to Practice