Program Requirements and Expectations
Participants are required to have both a Prakerja and an Alibaba Cloud account.
Participants need to complete the e-modules and pass the quizzes.
Participants will be eligible to attend the "Beyond the Basic Series" if:
- They have passed the ACA Big Data Computing exam.
- They have attended a minimum of one instructor-led class session.
- They have completed the e-learning modules and passed all the quizzes.
Self-paced Learning and Self Test
Module |
Course Name |
Objective |
|
Big Data Basics |
Big Data Basic Concepts |
In this video, we introduce the core components of Alibaba Cloud’s Big Data ecosystem. Learn how tools such as DataWorks, MaxCompute, QuickBI, and DataV fit together, and see how each tool can be used to manage and process large datasets more effectively. |
|
Big Data Basics |
Alibaba Cloud Big Data Products Overview |
In this video, we introduce the core components of Alibaba Cloud’s Big Data ecosystem. Learn how tools such as DataWorks, MaxCompute, QuickBI, and DataV fit together, and see how each tool can be used to manage and process large datasets more effectively. |
|
Big Data Basics |
Scenario in Which Products Would be Used |
This this video, we examine some real data processing scenarios, and look at how Ailbaba Cloud services can be tied together to address a large number of different storage and analysis requirements. |
|
DataWorks |
DataWorks Overview |
This short video introduces the topics which will be covered in the following several video lectures, such as Data Acquisition, Processing, and Quality Monitoring with DataWorks. |
|
DataWorks |
DataWorks Introduction |
In this video lecture, you'll learn what DataWorks is, and how it fits into the Alibaba Cloud Big Data ecosystem. Develop a fuller understanding of the capabilities of DataWorks with analyses of common use cases. |
|
DataWorks |
Demo of DataWorks - Data Acquisition |
Learn how to create a new DataWorks Workspace, and import log data into MaxCompute using DataWorks Data Integration. |
|
DataWorks |
Demo of DataWorks - Data Processing |
Learn how DataWorks can be used to create "Business Flows" (Workflows) for automatic, scheduled data processing. Create a multi-stage data processing workflow and run it from the DataWorks console. See how User Defined Functions (UDFs) can be used to add custom features to MaxCompute SQL. |
|
DataWorks |
Demo of DataWorks - Data Quality Monitoring |
See how DataWorks Data Quality monitoring can be used to ensure consistent data quality each time data import and processing tasks are run. See how Data Quality metrics are selected and applied to MaxCompute tables. |
|
DataWorks |
DataWorks Introduction Summary and Review |
In this video, we quickly summarize the DataWorks concepts covered in the previous few sections. |
|
DataWorks |
More Details Of DataWorks |
Learn more about DataWorks by visiting the DataWorks documentation. Here, you can learn about the specific features offered by different DataWorks editions, as well as watch useful video walkthroughs, access the DataWorks FAQ page, and see specific step-by-step guides to using core features like Operation Center or DataService Studio. |
|
Big Data Blog |
Setting up a MySQL data source |
Learn how to create, configure, and load sample e-commerce data into an RDS MySQL database. This database is used in the next section as a data source for a DataWorks project. |
|
Big Data Blog |
Importing and Processing MySQL data using DataWorks |
Learn how to import data from MySQL into MaxCompute using DataWorks, how to process that data by creating a "Business Flow" (Workflow), and learn how to export your processed data from MaxCompute back into your MySQL database. |
Module |
Course Name |
Objectives |
Course Video |
Python Structured Data Processing |
Introduction to Python Pandas and business scenarios |
Learn how to install the Anaconda Python distribution and open a new Jupyter notebook, which can be used for processing data with Pandas. |
|
Python Structured Data Processing |
Loading data from different data types |
In this video lecture, you'll learn how to import CSV or JSON data into Pandas. The video also demonstrates how to access all or part of the data once it is loaded into Pandas. |
|
Python Structured Data Processing |
Problems of the raw data |
Learn how to identify common issues with raw datasets such as missing (null) values. |
|
Python Structured Data Processing |
Data scrubbing |
Learn how Pandas can be used to clean your raw data by replacing NaNs and NULL values, and by standardizing data types. |
|
Python Structured Data Processing |
Data analysis and virtualization |
Learn how to filter, sort, and group data in Pandas, as well as generate plots which allow you to gain insights by visualizing data instribution, trends, and correlations. |
|
Python Structured Data Processing |
Deal with more than one data set |
In this video lecture, you'll learn how to work with multiple datasets in Pandas at the same time, as well as how to establish connections between datasets (merge and join) and plot data relationships. |
|
SQL for Beginners |
SQL For Beginners Course Objectives |
This video briefly introduces the SQL content discussed in the following sections. |
|
SQL for Beginners |
Select Statement Basic |
Learn how to query datasets with the SQL SELECT statement. See how SELECT can be used to filter, group, and sort data. This section also discusses more advanced usage, such as using the DISTINCT keyword on tables which contain NULL values. |
|
SQL for Beginners |
SELECT statement with WHERE |
Learn advanced filtering techniques with SQL SELECT using the WHERE keyword. Learn how WHERE can be used with logical keywords (AND, OR, XOR, NOT) as well as arithmetic operators and comparison operators such as IN, IS, NULL, and LIKE. |
|
SQL for Beginners |
SELECT with ORDER BY and Tips |
Learn advanced filtering techniques with SQL SELECT using the ORDER BY keyword. See how ORDER BY can be used to sort results in ascending or descending order. |
|
SQL for Beginners |
Table Join |
Learn how SQL JOIN statements work. See how inner, outer, left, and right joins are used to combine multiple tables in SQL. |
|
SQL for Beginners |
Troubleshooting |
In this section, learn some of the basic skills needed to read and understand SQL errors and warnings |
|
SQL for Beginners |
SQL Models for Syntax Checking |
In this video lecture, you'll learn how MySQL databases allow you to change SQL modes, allowing you to take advantage of different SQL features for different use-cases. |
|
MaxCompute Basic |
MaxCompute Course Content Briefing |
In this video, we provide a brief outline of the following sections, which focus on Alibaba Cloud's data warehousing tool, MaxCompute. |
|
MaxCompute Basic |
Introduction of MaxCompute |
In this video lecture, you'll learn about MaxCompute, Alibaba Cloud's distributed data storage and processing tool. Learn how MaxCompute works, what it can do, and how it is used at Alibaba Group to store and process petabytes of data. |
|
MaxCompute Basic |
MaxCompute Architecture |
In this video, we provide a brief outline of the following sections, which focus on Alibaba Cloud's data warehousing tool, MaxCompute. |
|
MaxCompute Basic |
Basic Concepts of MaxCompute |
MaxCompute In this video, we explore key concepts in MaxCompute, including Projects, Tables, Partitions, and Resources. |
|
MaxCompute Basic |
How to Use MaxCompute |
In this section, we review the interfaces that allow you to interact with MaxCompute, including the MaxCompute CLI, the MaxCompute Studio IDE plugin, and DataWorks. |
|
MaxCompute Basic |
Quick Start Guide of MaxCompute |
Learn how to import data from OSS into MaxCompute using DataWorks, and see how the data can be manipulated and modified using the DataWorks console. |
|
MaxCompute Basic |
MaxCompute Users And Roles |
Learn how MaxCompute users and roles can be used to control data access privilages within MaxCompute projects. |
Module |
Course Name |
Objective |
|
MaxCompute SQL Development |
MaxCompute SQL Overview |
Gain a basic understanding of MaxCompute's SQL dialect, and how it differs from common SQL dialects such as MySQL's SQL. |
|
MaxCompute SQL Development |
Data Definition Language (DDL) |
Develop an understanding of the basic MaxCompute SQL DDL operations (creating, deleting, and modifying tables). |
|
MaxCompute SQL Development |
Data Manipulation Language (DML) |
Develop an understanding MaxCompute's SQL DML operations (selecting and inserting records). |
|
MaxCompute SQL Development |
Built-In Function (Part 1) |
In this video, we review some of MaxCompute SQL's built-in functions, including mathematical functions like MAX, ABS, and RAND. |
|
MaxCompute SQL Development |
Built-In Function (Part 2) |
In this video, we review some of MaxCompute SQL's built-in functions, including time manipulation functions such as GETDATE, DATEPART, and WEEKDAY. |
|
MaxCompute SQL Development |
Built-In Function (Part 3) |
In this video, you will learn how SQL functions are used in general, along with common use-cases for common SQL functions in most SQL dialects. |
|
MaxCompute SQL Development |
MaxCompute SQL Development Summary And Review |
In this video lecture, we review the MaxCompute concepts discussed in the previous sections, with a focus on MaxCompute SQL development. |
|
MaxCompute User Define Function |
Introduction Of UDF |
Learn the basics of MaxCompute's User Defined Functions (UDFs), which allow you to add your own new SQL functionality to MaxCompute's SQL language. Learn the differences between the major UDF types: UDF, UDAF, and UDTF. |
|
MaxCompute User Define Function |
The Implement Logical Of UDF |
Learn how to create your own MaxCompute UDF functions in Java. |
|
MaxCompute User Define Function |
UDF Development Process |
In this video lecture, we explain the UDF development process in detail, including the steps needed to create and compile a Java UDF in a local IDE. |
|
MaxCompute User Define Function |
UDF Summary And Review |
In this section, we briefly review User Defined Functions, as covered in the previous three sections. |
Module |
Course Name |
Objectives |
Course Video |
QuickBI |
QuickBI Course Content Briefing |
This video outlines the topics covered in the following several video lectures, including basic concepts in Alibaba Cloud QuickBI, proper chart selection, and construction of QuickBI dashboards. |
|
QuickBI |
Background Introduction Of BI |
Learn how to choose the right type of charts to display your data, and how different types of charts are best suited to different roles (comparison, relationship, distribution, composition). |
|
QuickBI |
Features Highlights Of QuickBI |
This section introduces QuickBI and explains how QuickBI fits into the Alibaba Cloud Big Data ecosystem. |
|
QuickBI |
Commonly Used Charts Introduction Part One |
In this video lecture, we give an in-depth explanation of different chart types, and examine scenarios in which each chart type is a good (or bad) choice. |
|
QuickBI |
Demo Of Commonly Used Charts Part One |
See a hands-on demo of the QuickBI console, and learn how to construct a basic data dashboard. |
|
QuickBI |
Commonly Used Charts Introduction Part Two |
Learn about additional useful chart types including the funnel chart, tree chart, tree map, and conversion chart. |
|
QuickBI |
Demo Of Commonly Used Charts Part Two |
See a hands-on demo of more advanced charts in QuickBI, such as the conversion chart. |
|
QuickBI |
Commonly Used Charts Introduction Part Three |
In this section, we examine some additional dashboard elements in QuickBI such as the card, word cloud, and map (geo-chart) elements. |
|
QuickBI |
Demo Of Commonly Used Charts Part Three |
See a hands-on demo of some of QuickBI's additional dashboard elements such as the card and word cloud. |
|
QuickBI |
Summary Of All Charts In Categories |
In this video lecture, we summarize all the chart types discussed previously, tying each chart back to its best use-cases. |
|
QuickBI |
End To End Demo Introduction |
Here we introduce the scenario for the end-to-end QuickBI dashboard construction demo carried out in the next video. |
|
Machine Learning Platform for AI |
Introduction Of PAI |
Gain a basic understanding of Platform for AI (PAI), Alibaba Cloud's Machine Learning platform. See how a simple PAI environment can be quickly and easily created using the Alibaba Cloud web console. |
|
Machine Learning Platform for AI |
Quick Start And Architecture Overview |
See a demonstration of PAI's low-code Machine Learning tool, PAI Studio. Learn how this interface can be used to prepare data, train and test machine learning models, and evaluate model accuracy. |
|
Machine Learning Platform for AI |
Demo: Best Practice Of PAI |
Learn how PAI's low-code Studio tool can be used to train a machine learning model to detect financial fraud. This end-to-end demo explains how fraud rings are identified and shows the steps needed to train a working machine learning model on the provided sample data. |
|
Machine Learning Platform for AI |
User Cases |
In this section, we take a look at some of PAI's current users, and discuss the ways in which they are utilizing PAI to improve a variety of different business processes. |