Coiled
@CoiledHQ
Followers
3K
Following
2K
Media
339
Statuses
2K
Lightweight cloud compute platform for Python people.
Everywhere
Joined February 2020
@mrocklin @mkennedy @TalkPython @CoiledHQ @natt941 made a good point on the podcast. "This message of 'Things are supposed to be delightful' is important to us." And Matthew Rocklin agreed that the cloud "can be a delightful experience... Go play." https://t.co/1Qhod30KZ6
thenewstack.io
Data scientist Matthew Rocklin argues Kubernetes isn't the best way to run large-scale Python workloads in the cloud. His company, Coiled.io, offers an alternative.
0
2
5
Easily configure shared memory size for CLI jobs with `--docker-shm-size`. Training PyTorch models on a GPU and need more memory? Ever run into "Error: No space left on device"? Customize Docker shared memory size with `--docker-shm-size`. https://t.co/9pwx86miBv
0
0
3
π¨ Job setup option for Coiled Batch Use `--host-setup-script` to configure your VM before your batch job starts. Easily: β
Install dependencies β
Mount cloud storage β
Handle authentication or any other setup your jobs need. https://t.co/LOQaSbvXr4
docs.coiled.io
Run your jobs on the cloud in parallel Coiled Batch jobs are a lightweight API that make it easy to run your code on any cloud hardware and scale out parallel workflows. This is useful when you wan...
0
0
0
@CoiledHQ ~5 years ago I worked at a startup where we had multiple engineers screwing around for months with Terraform, Kubernetes, EKS, etc. just to get the same capabilities I got after an hour of playing around w/ Coiled. Pretty cool.
2
1
3
Coiled 2024 in Review https://t.co/jC16ztdgE3 Itβs the time when companies issue year-end summaries, acclaiming success (or not), and forecasting incredible growth for the next year (or not). I thought Iβd do something similar for Coiled. Itβs been quite a year for us ...
docs.coiled.io
Coiled's 2024 in review. Where we started, what we did, and where we're going.
0
5
27
Calculating quantiles, a common application in #geospatial workloads, used to be slow due to GIL contention in NumPy. The new implementation in @dask_dev + @xarray_dev is up to a hundred times faster and scales independently of the number of threads π₯³. https://t.co/UnJjPEF3Pd
docs.coiled.io
Dec 17, 2024 2 m read Patrick Hoefler There have been a number of engineering improvements to Dask Array like consistent chunksizes in Xarray rolling-constructs and improved efficiency in map_overl...
0
5
9
We're big fans of rich for a nice terminal experience, but have found sometimes folks log things even rich can't handle. In the latest coiled=1.67.0 release, coiled logs automatically falls back to non-rich printing in these situations. Release notes: https://t.co/jWOKtDlRz0
0
1
3
New Post: Cloud Computing is Broken https://t.co/Ode3eXkGFO Investor asks: "What's next for Data/Cloud Infrastructure?" My answer: "Boring stuff. People struggle with basics." Cloud feels like MP3 players before iPod. In theory everything is good. In practice adoption is low
matthewrocklin.com
Introduction: Recently I was chatting with an investor about the market and they asked: What do you see coming for the world of Data/Cloud/Compute Infrastructure? Is there some new dataframe librar...
1
3
22
We're now on Bluesky! Should be pretty easy to find us, since bluesky lets us use our https://t.co/FYMNE1Ch34 domain as our handle βοΈ
0
0
0
Read about the latest improvement to https://t.co/sNVU1DIXuJ with Dask: https://t.co/EAkjYHQxZe Thanks to Patrick Hoefler of @CoiledHQ for the great work here!
xarray.dev
Recent dask improvements make GroupBy.map a lot better!
1
7
32
New Post: SLURM-Style Job Arrays on the Cloud https://t.co/Fu7kVUSVAZ HPC Job scripts were the first form of parallelism I ever used as a graduate student. They're dead simple and accessible to almost anyone. We replicated the API with Coiled. It feels pretty slick to me π
docs.coiled.io
Nov 19, 2024 3 m read Matthew Rocklin Update Jan. 9th, 2025: Weβve added documentation for Coiled Batch. SLURM and other job schedulers have made it easy to run scripts on HPC systems for decades. ...
1
5
16
@CoiledHQ is amazing. If you want to have distributed compute and provisioned infrastructure from the code - its easy as that. Forget @ApacheSpark and @awscloud Sagemaker, EMR
0
1
4
New blog post: Scale AI-based Data Processing EASY The FineWeb-Edu dataset comes from processing 45TB (π€―) of FineWeb And it uses a Language Model to classify the educational level of the text ππ Still, we reproduced it in a few lines of code ! The key ? HF + Dask π
1
25
98
Implemented @CoiledHQ into our product to offload data syncing from BigQuery to Neo4j π€― Works like butter π§ Now I donβt have to worry about scaling VMs dynamically to handle variable loads.
0
1
3
We're to build a 100-TB scale geospatial benchmark suite https://t.co/vlzt3Szdmd We've seen an uptick in geospatial users and in challenges of the Xarray/Dask stack to scale beyond ~500-GiB. This post presents a call for benchmark workloads.
docs.coiled.io
Sep 9, 2024 3 m read James Bourbeau, Matt Rocklin TL;DR: We need your help creating a geospatial benchmark suite. Please propose your workload on this GitHub discussion. People love the Xarray/Dask...
2
12
44
Arraylake and @CoiledHQ work great together! You can use Coiled to manage your cloud computing infrastructure with @dask_dev, and store your data as @zarr_dev in Arraylake. We just added new a documentation page about our integration with Coiled. https://t.co/5070jBW2k1
docs.earthmover.io
Built on Dask, a parallel computing library, Coiled makes it easy to use Python on the cloud.
0
4
16
Run a Python script on a cloud GPU with one line of code. Training a @PyTorch model training takes ~10 minutes and cost ~$0.12 on the NVIDIA T4 GPU on AWS. Coiled handles provisioning hardware, setting up drivers, and installing CUDA-compiled PyTorch. https://t.co/JoUAOWUe9e
0
0
2
Dask DataFrame is now 20x faster. Some of most prominent changes include: - Apache Arrow support in @pandas_dev - Better shuffling algorithm for faster joins - Automatic query optimization Learn more: https://t.co/eVbgpWE7BZ
0
8
30
TPC-H Cloud Benchmarks: Spark, Dask, DuckDB, Polars Across scales: 10 GiB, 100 GiB, 1 TiB, 10 TiB Hardware: MBP and AWS It was a fun experiment. No project wins uniformly. DuckDB and Dask do pretty well. https://t.co/yHzHuksU3E
6
8
44
Recommendation of the day: `coiled notebook start` to run a remote Jupyter Lab from big machines in cloud but with file sync that feel "local". Demo from @CoiledHQ
https://t.co/18bP7NCHCI
#python #jupyter
0
1
4