Dask: Scalable Analytics in Python

dask.org

Dask is a flexible library for parallel computing in Python. Can be used on a single laptop or on a cluster of 100s machines. Its secret is split in two parts: Dynamic task scheduling optimized for computation, and “Big Data” collections like parallel arrays, dataframes, and lists that extend common interfaces.

Read more...
Linkedin

Want to receive more content like this in your inbox?