BIG DATA ADVANCED

Data Science Program

In this training, you will learn what big data is, take a look at the latest open-source technologies in each area independently, and create a VM based on Rocky OS on your local machine with Python, Anaconda, and JupyterHub as your developing tools.

You learn how to use NumPy and Pandas as your data structure, and what operations you can perform on these data structures with code examples. You will learn how you can visualize your data for better understanding and to display your analysis results.

You learn how to download Apache Spark, configure it in your Jupyter notebooks, use PySpark library to create data frames and manipulate them with code examples, and do Spark tunning and optimization.

You will learn how to develop neural network models using PyTorch library with use cases for each network type.


Related Program(s)