Parallel Processing and Distributed Data Structures Using the Dask Package for Python

April 16, 2019, 3:30pm to 6:00pm
Intended audience: 
Academic Support Staff, Faculty, Graduate Students

Dask allows you to set up parallel computations on one or more machines (or Savio nodes), including working with large datasets distributed across multiple Savio nodes. We'll cover the different ways to set up and run parallel computations using Dask.

Topics will include:

-Parallelizing loops using delayed evaluation

-Distributed data structures (including parallel I/O)

-Parallelization on one or more machines

-Using Dask in the context of SLURM job submissions

-Random number generation

-Nested parallelization, memory use, and load-balancing

After the training, we'll have an informal get together with snacks and drinks.

Training: 3:40-5pm

Reception: 5-6pm

Please register here: https://docs.google.com/forms/d/e/1FAIpQLSdVZnop1Nl0DM6D-lxl3zL0ISAJZ_GHzh63rEbSXXazfi0Y0Q/viewform

Registration

See event details for participation information.

Host

Event type