
Bruno Rodrigues
@brodriguesco
Followers
4K
Following
8K
Media
2K
Statuses
12K
Sworn in Data Janitor blog: https://t.co/id384GeKXV youtube: https://t.co/zKHU0YfLcA mastodon: https://t.co/Dooa7319vP bsky: https://t.co/LKIAD3RUnw "nix solves this"
Joined November 2014
My book is done. I need to write a conclusion to the last chapter and will likely rewrite some paragraphs, but I won't make major changes anymore. So if you're interested in building reproducible analytical pipelines with #RStats, take a look it's free:
11
284
1K
if you're a data 'anything' and wonder "why is R still a thing?" maybe this answer I wrote on Reddit will help #RStats #pydata #datascience
18
86
577
And another chapter done! Learn about build automation using {targets}. If you’re interested in upping your data science game and build reproducible analytical pipelines, give my book a read, it’s free:. Comments more than welcome! #RStats #DataScience
7
112
543
#RStats is used by 69 billion people each day. But no one uses it effectively. Here are 420 tips and tricks you should know TODAY to extract value from your data:.
11
29
336
I've just read that that Yihui Xie @xieyihui left @posit_pbc . he will continue to work on his packages as a contractor . that's really surprising (not in a good way) news to start 2024. .
22
75
313
I don't think I've ever read the paper that introduced the R programming language. here it is #RStats
6
62
278
#RStats anecdote: When I started learning econometrics at Uni in France, I asked the professor which software was the standard. He said "Air". That evening at home I Google "Air econometrics software" but didn't find anything. I asked him the next day to clarify. .
10
13
251
Little #RStats function I use when I quickly need to open the result of various %>% operations in Excel. as in : mtcars %>% filter(am == 1) %>% show_in_excel()
8
34
251
amazing read by @nntaleb and providing a cheap black and white version of the book as print on demand is a great idea
6
14
230
If you use #rstats and are interested in reproducibility but confused by all of the ecosystem around it, I'm currently working on an ebook that should. Called "Building reproducible analytical pipelines with R" you can start reading it here
8
31
185
Let me take this opportunity to shill the infer #RStats package. read the vignette which explains why charts like the one below can be ignored nowadays (we have very good computers these days).
4
27
198
for those of you that might not understand why the claim in this tweet is so ridiculous, here’s the @duckdb benchmark showing how many popular data analysis packages perform on data with 1000000000 rows.
@JuddLegum @elonmusk In my initial run, which processed the first 60,000 rows, I did not find these awards—my hard drive overheated long before I could complete a full pass through the database. In a later run, which I referenced in another post, I did identify two such awards. That discrepancy is a.
6
22
197
My new book project, "Building reproducible analytical pipelines with #rstats", will teach you how to make projects reproducible. Manifest your interest here: . just want to read the WIP version? 👇. if you prefer physical 1/2.
5
25
171
I really love this . The {targets} #RStats package is probably one of the most underrated packages out there! Give it a try! You won't regret it!
8
19
153
Are you struggling to see how #Docker can make your #datascience pipelines reproducible? Want to learn how to make projects reproducible?Take a look at the latest chapter of my free book:. Comments more than welcome! #RStats #DataScience
1
44
158
a very cool illustration showing why you shouldn’t use loops in #RStats : remember, many operations in R are vectorized and will be very quick if you take advantage of this!.
More languages, more insights!. A few interesting takeaways:. * Java and Kotlin are quick! Possible explanation: Google is heavily invested in performance here. * Js is really fast as far as interpreted / jit languages go. * Python is quite slow without things like PyPy.
10
13
161
It is with great joy that I announce today that our (together with @PhilippBauman15 ) package rix is now on CRAN:. #RStats.
8
27
149
Really happy with how it looks! I keep looking at it!
The proof is in, and it's looking niiiiiice!. Building reproducible analytical pipelines with #rstats will be available on Amazon on June 20th!
2
15
135
Just finished the chapter on testing in my book. If you’re interested in upping your data science game and build reproducible analytical pipelines, give my book a read, it’s free:. Comments more than welcome! #RStats #DataScience
2
29
148
Today is my birthday, so I thought I'd make a coupon code so you can get a DRM-free Epub and Pdf of Building reproducible analytical pipelines with #rstats for 14.99 . Physical books are coming next week to Amazon !
7
15
118
The proof is in, and it's looking niiiiiice!. Building reproducible analytical pipelines with #rstats will be available on Amazon on June 20th!
3
12
121
New blog post: Machine learning with {tidymodels} (by @topepos ). Learn how to tune several models in parallel using list of models, list of grids and furrrr::future_map() (by @dvaughan32 )! #rstats #datascience
2
28
120
Python is the best language for statistical analysis and data science hands down. I'm currently porting all my reproducible pipelines (R in Nix shells) over to Jupyter notebooks (pip is all you need) running inside docker containers.long live Pandas!.this now a #pydata account.
23
4
131
You can now get physical copies of my book, "Building reproducible analytical pipelines with #RStats" on (USA only for now):. other marketplaces (FR, DE, ES, and more) will follow in the coming days!.
1
25
121
Please, if you have an idea for a package, just write it and publish it on CRAN. The availability of high quality packages are one of #RStats greatest strengths, just package stuff please
5
9
125
I love the {withr} #RStats package. Need to create a single plot? Just use withr::with_package("ggplot2", . )
10
18
119
You can get the Docker image with all the code and slides here: the slides are also online here:
𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗿𝗲𝗽𝗿𝗼𝗱𝘂𝗰𝗶𝗯𝗹𝗲 𝗮𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝗮𝗹 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲𝘀 𝘄𝗶𝘁𝗵 𝗥 . Join #ReproT 𝟭𝟵 𝗝𝘂𝗹𝘆, where Dr Rodrigues @brodriguesco will show how to make research more reproducible: @jessiebaldwin.@SwissRN.@ukrepro.@ReproducibiliT
0
29
117
@Cor3ntin I ask myself following question: has there ever been any vaccine, any at all ever, where several decades after a vaccination campaign people developed any disease due to said vaccine?.
29
6
91
My new (invited) blog post for @rdata_lu : Skip errors in R loops by not writing loops #rstats #purrr #DataScience @k_rosamont
2
29
100
#RStats 4.4.3, released 40 minutes ago, is already available through rix by using today’s date!
0
7
108
I'm very happy to announce that {chronicler}, an #RStats 📦 I've been working on for 3+ months is now available on CRAN! The following blog post introduces it very quickly:. but continue scrolling down if you want an even TL;DR summary!
4
12
92
Building reproducible analytical pipelines with #rstats will be available on Amazon on June 20th. 34.99$, 29.99£, 29.99€, 44.99CAD. You can buy a DRM free epub and pdf at for 19.99 (min. price). You can read it for free forever at
0
30
98
new blog post: teaching the #tidyverse to beginners: . This blog post details what I covered and how the participants to the courses reacted to it! #rstats. @drob @strengejacke @hadleywickham.
0
15
83
#Python users, which R packages do you use, if any? Does it annoy you to have to use them? Why/why not? please RT.
14
9
90
It is now possible to easily pass data from and to Python and #RStats by serializing to json (but you can still convert more complex objects using rxp_py2r() and rxp_r2py() which uses {reticulate} under the hood for conversin)
0
12
88
New blog post: Reproducible #DataScience with #Nix, part 6 -- CI/CD has never been easier. or how to run an #RStats {targets} pipeline on Github Actions using #Nix
3
15
79
Need a last minute gift for the fan of reproducibility and #RStats fan in your life?. Get my ebook, "Building reproducible analytical pipelines with R" for $9.99. Merry Christmas!
0
17
82
PSA: If you're an R user and rely on MRAN for reproducibility, you should know MRAN is getting shutdown. You might want to start migrating to the @posit_pbc package manager which also allows you to use packages as of a specific date like MRAN.
3
24
81
Just changed like 100 grepl calls to stringi::stri_detect and my pipeline now runs 4 times faster #RStats.
7
5
74
Even though I think #RStats is the superior choice when it comes to data *anything*, I think I might work on a python edition of the book. Those notebooks hosted on Github without any requirements.txt or Dockerfiles need to stop!.
Need a last minute gift for the fan of reproducibility and #RStats fan in your life?. Get my ebook, "Building reproducible analytical pipelines with R" for $9.99. Merry Christmas!
2
5
73
New blog post: Reproducible data science with Nix, part 13 -- {rix} is on CRAN!. #RStats #DataScience #nix
0
18
72
Looking for a nice introduction to #reproducibility using #Docker and #RStats? @_ColinFay has you covered with this great blog post!
2
14
68
New release of rix on CRAN!.rix generates Nix expressions which are then used by the Nix package manager to build.reproducible #RStats development environments. Many new features in this release:. 1/3
2
13
74
R's native pipe is almost 4 years old, I think I'll start using it inside my packages (thus requiring at least #RStats 4.1.0 to be installed)
6
1
72
New #RStats x #Nix blog post: Reproducible data science with Nix, part 9 -- rix is looking for testers!. Check out the latest feature implemented by @PhilippBauman15 , with_nix(), it'll blow your mind!
1
14
59
Almost one year ago, I’ve ran all example scripts shipped in versions of R from 0.6.0 up to 4.2.2 on R version 4.2.2 . as you can see, #RStats 4.2.2 is still able to run more than three quarters of the examples of the oldest release of R!
0
6
60