jaceklaskowski Profile Banner
Jacek Laskowski Profile
Jacek Laskowski

@jaceklaskowski

Followers
7K
Following
15K
Media
4K
Statuses
26K

Freelance Data(bricks) Engineer • #ApacheSpark #DeltaLake #UnityCatalog #MLflow #Databricks #DSPy | Java Champion | @theASF | #DatabricksMVP

Warsaw, Poland
Joined May 2009
Don't wanna be here? Send us removal request.
@jaceklaskowski
Jacek Laskowski
4 years
Turned out we as a society learnt nothing since the World War II if we allowed people like Vladimir Putin to start a war against #Ukraine! I'm very very sorry Ukraine that you must suffer from our stupidity. Wish you all the best and kick Russian asses out of your soil! 🙏
1
2
40
@apachekafka
Apache Kafka
4 days
We are happy to announce the Apache Kafka 4.1.0 release! Most exciting, Queues for Kafka enters preview status, and the new “streams” rebalance protocol is available as early access. Thanks a lot to 167 contributors and our RM @MickaelMaison! https://t.co/DUiDrpIpPN
Tweet card summary image
kafka.apache.org
Apache Kafka: A Distributed Streaming Platform.
1
35
91
@realpython
Real Python
11 days
🐍📺 Learn how to speed up your programs using concurrency and the asyncio module in the standard library. See step-by-step how to leverage concurrency and parallelism in your own apps. #python
Tweet card summary image
realpython.com
Learn how to speed up your Python 3 programs using concurrency and the asyncio module in the standard library. See step-by-step how to leverage concurrency and parallelism in your own programs, all...
1
4
20
@swapnakpanda
Swapna Kumar Panda
11 days
Book Details - Mathematics for Computer Science - Eric Lehman, F Thomson Leighton, Albert R Meyer - From MIT Press - 2018 Edition - 1048 pages 🔗
1
22
101
@_avichawla
Avi Chawla
12 days
If the temperature is high, the probabilities start to look like a uniform distribution: This means the sampling process may select any token. This makes the generation process random and heavily stochastic, like we saw earlier. Check this👇
Tweet media one
1
1
17
@_avichawla
Avi Chawla
12 days
If the temperature is low, the probabilities look like a max value instead of a “soft-max” value. This means the sampling process will almost certainly choose the token with the highest probability. This makes the generation process (nearly) greedy. Check this👇
Tweet media one
1
2
22
@unitycatalog_io
Unity Catalog
14 days
In our latest article, learn how you can use the Unity Catalog REST API to store and manage your data and AI assets. There are two ways to use Unity Catalog: 🔹 the built-in Command Line Interface 🔹 the REST API The Unity Catalog REST API is a great alternative to using the
Tweet media one
0
3
5
@jaceklaskowski
Jacek Laskowski
21 days
📢 New #meetup, folks 🔥 Learn #Python 🐍 through functools module (and #OpenAI's Python API) ➡️ https://t.co/abIWQXPzkJ (Only in Polish 🇵🇱 yet the announcement in English 🇬🇧 🤷‍♂️)
Tweet media one
0
0
0
@jaceklaskowski
Jacek Laskowski
1 month
Give #DeclarativePipelines framework (#ApacheSpark 4.1.0-SNAPSHOT) a serious try with the demo. Enjoy! ❤️ ➡️ https://t.co/nlgsc7l75y It's not released yet, and under heavy development.
Tweet media one
Tweet media two
Tweet media three
0
1
3
@realpython
Real Python
1 month
🐍📰 Ruff: A Modern Python Linter for Error-Free and Maintainable Code — https://t.co/y2sZCtlDLM #python
Tweet media one
1
3
26
@akshay_pachaar
Akshay 🚀
1 month
Stop using pip! Here’s a 10× faster alternative:
@akshay_pachaar
Akshay 🚀
1 month
uv in Python, clearly explained (with code):
4
26
144
@AndrewYNg
Andrew Ng
1 year
I'm teaching a new course! AI Python for Beginners is a series of four short courses that teach anyone to code, regardless of current technical skill. We are offering these courses free for a limited time. Generative AI is transforming coding. This course teaches coding in a way
464
2K
8K
@jaceklaskowski
Jacek Laskowski
1 month
BatchTableWrite Flow Execution in #ApacheSpark #DeclarativePipelines framework ➡️ https://t.co/7UEDsVCi2E ⚠️ Spark Declarative Pipelines are still in the works in Spark 4.1.0-SNAPSHOT
Tweet media one
Tweet media two
Tweet media three
0
0
1
@jaceklaskowski
Jacek Laskowski
2 months
#TIL while with #DeclarativePipelines in #ApacheSpark 4.1.0-SNAPSHOT ... #Scala 2.13 comes with Option.when for conditional evaluation 👏 (Great, but I'm mostly with #Python these days 🤷‍♂️) ➡️ https://t.co/7rTyBthYm0]
Tweet media one
Tweet media two
0
0
1
@jaceklaskowski
Jacek Laskowski
2 months
My excitement is doubled 2️⃣ while with @windsurf_ai and taking notes about #DeclarativePipelines in the not-yet-released #ApacheSpark 4.1.0-SNAPSHOT There are so many repretitions, and Windsurf made it so pleasant 👏👏👏 ➡️ https://t.co/adceA0bU8M
Tweet media one
0
0
3
@jaceklaskowski
Jacek Laskowski
2 months
Back to X after many many months away, and I found this seemingly great writeup about differentiables. ➡️ https://t.co/hM7P5Ey1pE And that's right after I'd found yet another writeup about the very same topic! ➡️ https://t.co/u4ACA7LGxT Enjoy! ❤️
@akshay_pachaar
Akshay 🚀
2 months
A fantastic, visually stunning intro to deep learning. This free book covers: - Maths for ML - Datasets & Losses - Linear Models - Fully Connected Models - CNNs for Images & Beyond - Transformers & LLMs - Graph Models Highly recommended!
Tweet media one
0
0
4
@byte_array
Vinoth Chandar
9 months
🎉 We’re proud to announce the @apachehudi  1.0 release! This release has been the result of a massive community effort, with tons of new code (re)written. I want to thank all 60+ contributors who worked on ~180K lines of change. 🗒️ Release blog: https://t.co/H5VvCxsovH Hudi
Tweet media one
0
5
41
@KyleJWeller
Kyle Weller
9 months
@byte_array One thing I did not expect when doing this research was coming to the unfortunate realization that you might need more than one catalog to cover all the bases for a complete data platform solution...
Tweet media one
0
2
2
@byte_array
Vinoth Chandar
9 months
Data Catalogs are getting much-needed attention across #datalakehouse and #datawarehouse as the plot thickens, as they say. We are sharing some of the deep internal research we did to support our multi-catalog sync feature in the Onehouse product in this blog from @KyleJWeller .
1
2
10
@andy_pavlo
Andy Pavlo (@andypavlo.bsky.social)
10 months
Correction: @GlareDB is moving away from DataFusion! @LegitSeanSmith's excellent talk discusses problems with building a DBMS using off-shelf parts. Like @DuckDB, the GlareDB rewrite borrows ideas from @tum_db's HyPer system but it's written in Rust:
@CMUDB
CMU Database Group
10 months
Today's Database Building Blocks Seminar Speaker: @LegitSeanSmith (Founder) will present the journey of rewriting @GlareDB to use @ApacheDataFusio. Zoom talk open to public at 4:30pm ET. YouTube video available after:
1
7
88