Scaling Data Analysis with Python and Dask Certificate for Luc ...
Certificate ID:
743287
Authentication Code:
9dcd1
Certified Person Name:
Luc HERMITTE
Trainer Name:
Augustin Poelmans
Duration Days:
2
Duration Hours:
14
Course Name:
Scaling Data Analysis with Python and Dask
Course Date:
9 November 2023 09:00 to 10 November 2023 17:00
Course Outline:
Introduction
- Overview of Dask features and advantages
- Parallel computing in Python
Getting Started
- Installing Dask
- Dask libraries, components, and APIs
- Best practices and tips
Scaling NumPy, SciPy, and Pandas
- Dask arrays examples and use cases
- Chunks and blocked algorithms
- Overlapping computations
- SciPy stats and LinearOperator
- Numpy slicing and assignment
- DataFrames and Pandas
Dask Internals and Graphical UI
- Supported interfaces
- Scheduler and diagnostics
- Analyzing performance
- Graph computation
Optimizing and Deploying Dask
- Setting up adaptive deployments
- Connecting to remote data
- Debugging parallel programs
- Deploying Dask clusters
- Working with GPUs
- Deploying Dask on cloud environments
Troubleshooting
Summary and Next Steps