Tobias Raabe Profile
Tobias Raabe

@tobraab

Followers
76
Following
378
Media
18
Statuses
86

he/him — econ and programming — check out my soundcloud: https://t.co/aJW0b4s8Gr

Joined July 2014
Don't wanna be here? Send us removal request.
@tobraab
Tobias Raabe
4 years
Hi everyone, I want to show you pytask, a tool for managing workflows in your research or data analysis project! #datascience #econtwitter #python #reproducibility https://t.co/nURATMVLK4 🧵 1/16
Tweet card summary image
github.com
pytask is a workflow management system that facilitates reproducible data analyses. - pytask-dev/pytask
1
6
18
@tobraab
Tobias Raabe
3 months
observation since moving in with gf washing clothes seems like an area where people heavily optimize for their personal local minima. If you bring two people together and they try to mix clothes and approaches, nothing works anymore
0
0
0
@tobraab
Tobias Raabe
2 years
I realized GPT-4o knows about pytask but only proposes code that follows removed syntax. I guess I can be happy that I have a little userbase and I am not flooded with issues.
0
0
0
@tobraab
Tobias Raabe
2 years
Old: Deprecation cycles are as long as necessary for efficient communication and user adjustment. New: Deprecation cycles should be as long as needed for LLM APIs to incorporate the latest syntax in their training checkpoints, ensuring they don't suggest outdated code.
1
0
0
@tobraab
Tobias Raabe
2 years
✨ Bonus: Project Templates with pixi cookiecutter-pytask-project v1.7.0 has been released. It now uses @prefix_dev's pixi, which offers a fantastic developer experience. 🙏Thanks to everyone who contributed via code, feedback, and discussions! https://t.co/XaYtS0DInB
Tweet card summary image
github.com
What's Changed Apart from some little updates, this release switches the package and environment manager from conda/mamba/micromamba to pixi. If you do not know about it, check it out. It is am...
0
4
7
@tobraab
Tobias Raabe
2 years
🤖 Task Generators Following up on the previous example, we can use task generators to process each downloaded file individually, copy it to another location, and add the .txt extension. https://t.co/ho4uCJfAHw
1
0
0
@tobraab
Tobias Raabe
2 years
🤖 Provisional Nodes Provisional nodes can be used when you do not know how many products are created by a task. For example, downloading all files without file extension from a GitHub repository. https://t.co/0BdYRlDqXD
1
0
0
@tobraab
Tobias Raabe
2 years
Caveats when working with remote backends. ❌ Limited cross-platform support. Windows users should use the WSL2. ⚠ Local files. pytask automatically syncs local files to remote workers. Store files in S3, etc., to avoid time-consuming uploads. https://t.co/7rIriwoQdA
pytask-parallel.readthedocs.io
There are a couple of things you need to know when using backends that launch workers remotely, meaning not on your machine. Cross-platform support: Issue:#102. Currently, it is not possible to run...
1
0
1
@tobraab
Tobias Raabe
2 years
You can also run every task in a cloud VM using the coiled.Cluster. https://t.co/OlS9RJtxrk
1
0
0
@tobraab
Tobias Raabe
2 years
Using pytask-parallel with coiled, you can run single tasks in a cloud VM of your choice with the hardware you need! And it scales automatically with the load! https://t.co/e6Gu5VVqQk
1
0
0
@tobraab
Tobias Raabe
2 years
✨pytask v0.5 release announcement✨ This release comes with two big features: 🤖 Provisional Nodes and Task Generators 🚀 Support for HPC and your favorite cloud provider (AWS, GCP, Azure) using dask / @CoiledHQ ✨Bonus at the end Find out more ⬇ https://t.co/7dSRYEDXsm
github.com
Highlights ✨ pytask v0.5.0 is released and contains two big features. ✨ 🤖 Provisional Nodes and Task Generators pytask now has mechanisms to define tasks that produce an unknown number of products...
1
1
5
@tobraab
Tobias Raabe
2 years
I understand creating this one button at the end feels like a waste of time after so many iterations and checks. I would argue you should build this button right from the start to be more confident in your code, to find errors quicker. Maybe use my own https://t.co/ayTn2mZgVo 😄
Tweet card summary image
pytask-dev.readthedocs.io
PyPI PyPI - Python Version image image PyPI - License image image image pre-commit.ci status Ruff Features: pytask is a workflow management system that facilitates reproducible data analyses. Its f...
@toniwhited
Toni Whited
2 years
Texting with a colleague at another uni who is mired in the AEA replication process. Wasting > month getting all in order so that some RA can just push one button and get all the data merges right so that the results are perfect to the 3rd decimal place. 1/5
0
0
3
@tobraab
Tobias Raabe
3 years
👉pytask.mark.parametrize is deprecated and will be removed in v0.4 (coming in ~1-2 months). Switch to the new approach! https://t.co/AkCMoNWUQ7 👉pytask is even lazier (or more careful) and uses hashes to detect changes in task modules. Thank you to all contributors!
0
0
2
@JanosGabler
Janos Gabler
3 years
I wrote a blogpost about dags, a package I created with @tobraab. Dags provides tools to combine several interrelated Python functions into one function. If you want to know what that means and why it is useful, check out the post:
1
4
9
@tobraab
Tobias Raabe
3 years
I know it is an example of emotional data viz and I know that I am the joke here 🙃
0
0
0
@tobraab
Tobias Raabe
3 years
My climbing book casually introduced me to time travel.
1
0
2
@tobraab
Tobias Raabe
3 years
It’s not my profile of course. I cannot manage my time.
0
0
4
@tobraab
Tobias Raabe
3 years
Cherish your weekends!
1
0
3
@JanosGabler
Janos Gabler
3 years
Ever ran an optimization with @scipy but none of the algorithms worked? To use other algorithms, you had to learn a new library and rewrite your code? Say hi to #estimagic, a modern #python package that wraps all algorithms you’ll ever need 🚀 Of course, there’s more: A 🧵
1
20
39
@tobraab
Tobias Raabe
3 years
Wrong column.
0
0
2