dask
0.12.0

Getting Started

  • Install Dask
  • Use Cases
  • Examples
    • Array
      • Creating Dask arrays from NumPy arrays
      • Creating Dask arrays from HDF5 Datasets
      • Creating random arrays
      • Build Custom Dask.Array Function
    • Bag
      • Read JSON records from disk
      • Word count
    • DataFrame
      • Dataframes from CSV files
      • Dataframes from HDF5 files
    • Delayed
      • Build Custom Arrays
      • Data Processing Pipelines
    • Distributed Concurrent.futures
    • Tutorial
  • Dask Cheat Sheet

Collections

  • Array
  • Bag
  • DataFrame
  • Delayed

Scheduling

  • Distributed Scheduling
  • Scheduler Overview
  • Choosing between Schedulers
  • Shared Memory
  • Scheduling in Depth

Diagnostics

  • Inspecting Dask objects
  • Diagnostics

Graphs

  • Overview
  • Specification
  • Custom Graphs
  • Optimization

Help & reference

  • Contact and Support
  • Changelog
  • Development Guidelines
  • Frequently Asked Questions
  • Comparison to PySpark
  • Opportunistic Caching
  • Internal Data Ingestion
  • Citations
dask
  • Docs »
  • Examples
  • View page source

Examples¶

Array¶

Array documentation

  • Creating Dask arrays from NumPy arrays
  • Creating Dask arrays from HDF5 Datasets
  • Creating random arrays
  • Build Custom Dask.Array Function
  • Use Dask.array to generate task graphs

Bag¶

Bag documentation

  • Read JSON records from disk
  • Word count

DataFrame¶

DataFrame documentation

  • Dataframes from CSV files
  • Dataframes from HDF5 files
  • Distributed DataFrames on NYCTaxi data
  • Build Parallel Algorithms for Pandas
  • Simple distributed joins
  • Build Dask.dataframes from custom format, feather

Delayed¶

Delayed documentation

  • Build Custom Arrays
  • Data Processing Pipelines
  • Basic Delayed example
  • Build Parallel Algorithms for Pandas
  • Build Dask.dataframes from custom format, feather

Distributed Concurrent.futures¶

Concurrent.futures documentation

  • Custom workflows
  • Ad Hoc Distributed Random Forests

Tutorial¶

A Dask tutorial from July 2015 (fairly old) is available here: https://github.com/dask/dask-tutorial

Next Previous

© Copyright 2016, Continuum Analytics.

Built with Sphinx using a theme provided by Read the Docs.