
Jacek Laskowski
@jaceklaskowski
Followers
7K
Following
15K
Media
4K
Statuses
26K
Freelance Data(bricks) Engineer • #ApacheSpark #DeltaLake #UnityCatalog #MLflow #Databricks #DSPy | Java Champion | @theASF | #DatabricksMVP
Warsaw, Poland
Joined May 2009
Turned out we as a society learnt nothing since the World War II if we allowed people like Vladimir Putin to start a war against #Ukraine! I'm very very sorry Ukraine that you must suffer from our stupidity. Wish you all the best and kick Russian asses out of your soil! 🙏
1
2
40
We are happy to announce the Apache Kafka 4.1.0 release! Most exciting, Queues for Kafka enters preview status, and the new “streams” rebalance protocol is available as early access. Thanks a lot to 167 contributors and our RM @MickaelMaison! https://t.co/DUiDrpIpPN
kafka.apache.org
Apache Kafka: A Distributed Streaming Platform.
1
35
91
🐍📺 Learn how to speed up your programs using concurrency and the asyncio module in the standard library. See step-by-step how to leverage concurrency and parallelism in your own apps. #python
realpython.com
Learn how to speed up your Python 3 programs using concurrency and the asyncio module in the standard library. See step-by-step how to leverage concurrency and parallelism in your own programs, all...
1
4
20
Book Details - Mathematics for Computer Science - Eric Lehman, F Thomson Leighton, Albert R Meyer - From MIT Press - 2018 Edition - 1048 pages 🔗
1
22
101
If the temperature is high, the probabilities start to look like a uniform distribution: This means the sampling process may select any token. This makes the generation process random and heavily stochastic, like we saw earlier. Check this👇
1
1
17
If the temperature is low, the probabilities look like a max value instead of a “soft-max” value. This means the sampling process will almost certainly choose the token with the highest probability. This makes the generation process (nearly) greedy. Check this👇
1
2
22
In our latest article, learn how you can use the Unity Catalog REST API to store and manage your data and AI assets. There are two ways to use Unity Catalog: 🔹 the built-in Command Line Interface 🔹 the REST API The Unity Catalog REST API is a great alternative to using the
0
3
5
📢 New #meetup, folks 🔥 Learn #Python 🐍 through functools module (and #OpenAI's Python API) ➡️ https://t.co/abIWQXPzkJ (Only in Polish 🇵🇱 yet the announcement in English 🇬🇧 🤷♂️)
0
0
0
Give #DeclarativePipelines framework (#ApacheSpark 4.1.0-SNAPSHOT) a serious try with the demo. Enjoy! ❤️ ➡️ https://t.co/nlgsc7l75y It's not released yet, and under heavy development.
0
1
3
🐍📰 Ruff: A Modern Python Linter for Error-Free and Maintainable Code — https://t.co/y2sZCtlDLM
#python
1
3
26
I'm teaching a new course! AI Python for Beginners is a series of four short courses that teach anyone to code, regardless of current technical skill. We are offering these courses free for a limited time. Generative AI is transforming coding. This course teaches coding in a way
464
2K
8K
BatchTableWrite Flow Execution in #ApacheSpark #DeclarativePipelines framework ➡️ https://t.co/7UEDsVCi2E ⚠️ Spark Declarative Pipelines are still in the works in Spark 4.1.0-SNAPSHOT
0
0
1
#TIL while with #DeclarativePipelines in #ApacheSpark 4.1.0-SNAPSHOT ... #Scala 2.13 comes with Option.when for conditional evaluation 👏 (Great, but I'm mostly with #Python these days 🤷♂️) ➡️ https://t.co/7rTyBthYm0]
0
0
1
My excitement is doubled 2️⃣ while with @windsurf_ai and taking notes about #DeclarativePipelines in the not-yet-released #ApacheSpark 4.1.0-SNAPSHOT There are so many repretitions, and Windsurf made it so pleasant 👏👏👏 ➡️ https://t.co/adceA0bU8M
0
0
3
Back to X after many many months away, and I found this seemingly great writeup about differentiables. ➡️ https://t.co/hM7P5Ey1pE And that's right after I'd found yet another writeup about the very same topic! ➡️ https://t.co/u4ACA7LGxT Enjoy! ❤️
A fantastic, visually stunning intro to deep learning. This free book covers: - Maths for ML - Datasets & Losses - Linear Models - Fully Connected Models - CNNs for Images & Beyond - Transformers & LLMs - Graph Models Highly recommended!
0
0
4
🎉 We’re proud to announce the @apachehudi 1.0 release! This release has been the result of a massive community effort, with tons of new code (re)written. I want to thank all 60+ contributors who worked on ~180K lines of change. 🗒️ Release blog: https://t.co/H5VvCxsovH Hudi
0
5
41
Here's surprising data on GitHub Copilot no longer the IDE of choice across early adopters. All surging IDEs use Sonnet 3.5 primarily AFAIK https://t.co/CtA4lVHaSy
blog.pragmaticengineer.com
Software engineers shared their favorite IDEs with GenAI features on social media. The most-mentioned one by a comfortable margin was Cursor. WindSurf and Zed also seem to be getting traction at the...
4
6
30
@byte_array One thing I did not expect when doing this research was coming to the unfortunate realization that you might need more than one catalog to cover all the bases for a complete data platform solution...
0
2
2
Data Catalogs are getting much-needed attention across #datalakehouse and #datawarehouse as the plot thickens, as they say. We are sharing some of the deep internal research we did to support our multi-catalog sync feature in the Onehouse product in this blog from @KyleJWeller .
1
2
10
Correction: @GlareDB is moving away from DataFusion! @LegitSeanSmith's excellent talk discusses problems with building a DBMS using off-shelf parts. Like @DuckDB, the GlareDB rewrite borrows ideas from @tum_db's HyPer system but it's written in Rust:
Today's Database Building Blocks Seminar Speaker: @LegitSeanSmith (Founder) will present the journey of rewriting @GlareDB to use @ApacheDataFusio. Zoom talk open to public at 4:30pm ET. YouTube video available after:
1
7
88