Sl.No | Main Topic | Sub Topic | No of Days |
1 |
Basic Big data related knowledge | Hadoop ecosystem |
2 |
Distributed file system (HDFS) Computation framework (MapReduce), Resource Management Component (YARN) |
Hadoop ecosystem tool-Spark, Zookeeper, Kafka, Jindo Filesystem. |
Hadoop ecosystem tool-PIG, HIVE, HBase, Oozie |
Hadoop ecosystem tool-Flume, Sqoop, Solr, Lucene, Ambari |
2 |
MaxCompute | Max Compute components such as project, table, partition, resources, task, etc.
|
2 |
Max Computing Pricing Models
|
odpscmd, management console, Java SDK, tunnel command-line tools, Tunnel SDK |
user-defined functions, including UDF, UDAF, and UDTF |
Permission management of MaxCompute such as users, roles, authorization (ACL & Policy), project space protection, external and security level |
3 |
DataWorks | Data Integration and Data governance |
2 |
Data Development-workflow task and node task development and design, can configure appropriate dependencies and periodic scheduling |
Data Studio, data service, operation & maintenance center |
DataAnalysis. organization management and project management |
4
|
E-MapReduce: | E-MapReduce-Benefits and Architecture |
2 |
Cluster Management and knowledge on cluster type such as Hadoop cluster Kafka cluster ZooKeeper cluster Druid cluster Dataflow cluster |
Data Development Manage a workflow project Edit jobs Edit a workflow |
Ad hoc queries ,Scheduling center and Pricing |
5 |
Platform for AI(PAI) | Machine Learning Platform for AI, Architecture and Benefits |
1 |
Create a project Data preparation Data preprocessing |
Manage Projects and Data Sources |
6
|
Data V | Manage Widgets such as Line charts,Pie charts,Scatter charts and Basic flat map widgets |
2 |
| Manage Workspace,Projects,Editor and Data Sources |
7 |
Quick BI | Data Modelling and Data Analysis |
1 |
Create dataset, create a workbook, data modelling and create report tables |