Skip to main content
Search
Join
Log in
Mentorship
Join CCMNet
CCMNet Guide
Mentorship Opportunities
Community
CCMNet Members
CCMNet Affinity Group
People
Affinity Groups
Blog
Jobs
Organizations
Community of Communities
Join the CSSN
Get Help
Ask a Question
Resources
Request a Consult
Projects
Knowledge Base
Mentorship Resources
KB Resources
Ask.CI Forum
Tags
About Us
About Us
User Guide
Become a Campus Champion
User Guide
Affinity Groups FAQ
Governance
Code of Conduct
News
About CCMNet
Annual Meeting
Tags
Using Dask on HPC Systems
Submission navigation links for Knowledge Base Resources
‹
Previous submission
Next submission
›
Submission information
Submission Number:
95
Submission ID:
3441
Submission UUID:
beb19e4c-3bb9-4635-acf3-6f805b8243d1
Submission URI:
/form/resource
Created:
Wed, 03/15/2023 - 13:56
Completed:
Wed, 03/15/2023 - 13:58
Changed:
Thu, 03/14/2024 - 11:50
Remote IP address:
73.229.137.18
Submitted by:
Daniel Howard
Language:
English
Is draft:
No
Webform:
Knowledge Base Resources
Approved
Yes
Title
Using Dask on HPC Systems
Category
Learning
Tags
training
,
jupyterhub
,
python
Skill Level
Beginner
,
Intermediate
Description
A tutorial on the effective use of Dask on HPC resources. The four-hour tutorial will be split into two sections, with early topics focused on novice Dask users and later topics focused on intermediate usage on HPC and associated best practices. The knowledge areas covered include (but are not limited to):
Beginner section
High-level collections including dask.array and dask.dataframe
Distributed Dask clusters using HPC job schedulers
Earth Science data analysis using Dask with Xarray
Using the Dask dashboard to understand your computation
Intermediate section
Optimizing the number of workers and memory allocation
Choosing appropriate chunk shapes and sizes for Dask collections
Querying resource usage and debugging errors
Link to Resource
Dask Tutorial Github Page
Video Recording of Tutorial - Part 1
Video Recording of Tutorial - Part 2