
Hamel Husain
@HamelHusain
Followers
35K
Following
14K
Media
1K
Statuses
13K
AI Evals Course: https://t.co/S9ctbeyvCJ About Me: https://t.co/P6WyeKkyTa
Joined September 2012
"Open AI is not going to make it" @jeremyphoward called it yesterday 🤯. Really prescient. h/t @VanishingData podcast
50
259
2K
My thesis is LLMs significantly boost your motivation to work on something. Getting a shitty prototype fast is immensely motivating . Code and time are often not the bottleneck, motivation is the bottleneck.
there is no market for way shittier versions of apps/games/software no matter how fast you and an LLM can "make" it.
69
128
2K
New businesses cannot trust @stripe . They literally holding all my money hostage and you cant get through to meaningful support. Just and endless wall of red tape. Ive never had a refund request. So mad about this.
190
61
1K
This talk by @bclavie is the highest value per second talk I have ever watched on RAG . Chapter summaries and additional links in next tweet
15
144
1K
Leetcode is completely dead to me. Absolute waste of time. Even if I wanted FAANG salary, independent consulting easily exceeds that with more autonomy.
Honestly, at this point If you give me a programming interview and don't let me use AI assistance you won't get a very realistic idea of what I'm actually capable of.
31
46
1K
📢 @GitHub is releasing a large dataset for natural language processing and machine learning. It's a large parallel corpus of code and natural language, with benchmarks on IR tasks. Leaderboard is hosted by @weights_biases:
6
221
699
. @sh_reya 's paper confirms what I see in practice. 1) Automated evals don't work (without semi-manual human alignment).2) Most tools don't provide this alignment.3) Automated evals add mostly noise.4) You can only write good evals by looking at data and reacting to failures
19
83
713
. @johnowhitaker 's talk: Napkin Math For Fine Tuning.was so popular that we ended up doing an encore!. He answers q's like:. - When should I use LoRA? Quantization? GC?.- What’s the cheapest option? most accurate?.- What hardware?.- What batch size / context length . etc?
16
129
616
Epic talk on RAG Basics by.@jobergum.from the LLM conf (links to YT, slides, etc in next tweet)
6
77
592
Revenge of the python dev. We are back baby! . As much as the typescript dev is pulling ML/data into their stack, the python devs are experimenting with the opposite arrow. Prediction: large wave of small 1-2 person companies building AI products with stuff like this.
Announcing FastHTML. A new way to create modern interactive web apps. Scales down to a 6-line python file; scales up to complex production apps. Auth, DBs, caching, styling, etc built-in & replaceable and extensible. 1-click deploy to @Railway, @vercel, @huggingface, & more.
10
26
396
This is not the right way to use AI for learning/growing as a programmer! You need to ask the AI:. 1) What are other ways we could write this code? What are the pros and cons of those approaches?. 2) Explain this code from the perspective of a <your language> programmer. What.
This is why having AI write code for you sucks. Literally just have the training wheels on forever. Great point by @dhh
33
64
576
Just re-discovered this code from @jeremyphoward how to create schemas from functions w/ 4 lines of code, and no deps except pydantic
8
41
555
New blog post 📰: Understanding what LLM frameworks are doing is difficult (digging through code, docs, etc). I show how intercept API calls with @mitmproxy to quickly understand how popular frameworks work and find interesting results. I walk through examples with Guardrails,.
19
64
480
The biggest mistake in AI Eng hiring is focusing on dumb leetcode exercises when the most important skill is data literacy (cleaning and analyzing data). That’s why l love this talk from @BEBischof where he describes how he actively tests for this.
8
61
487
One of the least talked about subjects in AI/ML is how to clean, curate and look at data. Its the least sexiest but most important topic. Good news, @vanstriendaniel & @davidberenstei has an entire talk on this. Links in next tweet
7
56
470
📢 Personal announcement: I'm over the moon to share that I'll be exploring commercial opportunities around the @fastdotai ecosystem full-time 🥳. I'm excited to work with @jeremyphoward and @wasimlorgat on this, as they both are brilliant people who I love working with 😍.
33
25
446
I WAS SO FRUSTRATED with the way @github renders Jupyter Notebooks, esp on mobile (You cannot horizontal scroll code, even with nbviewer) . So I made a thing: Source code is here
16
66
439
If you are on a mac, these settings have given me years back on my life (make keystrokes faster). I learned this from @mussondata
13
25
421
My colleagues and I distilled practical advice re: LLMs into this three-part series. Lot's of bangers. One of my favorite excerpts from this part in the screenshot. Advice from: @eugeneyan, @BEBischof, @charles_irl, @sh_reya, @jxnlco and myself. See:
15
55
425
Did you know you could create infographics with python in @ProjectJupyter ? Neither did I until @PratapVardhan created this amazing chart using Jinja and Pandas DataFrames. Really creative. See his work at @WillingCarol @betatim @jeremyphoward
8
113
404
This is the prompt they are using for Llama-3 function calling (seems to work well even though it's not specifically fine tuned for that):
📊Delighted to welcome Command-R-Plus, Llama-3, and and Gemini-Pro-1.5 into the Berkeley Function Calling Leaderboard. Check out how they stack up across different categories, P95 latency, and costs at Congratulations to @cohere, @AIatMeta, and
7
59
415
Lately, I've been wondering what the future of ML Eng looks like re: LLMs. - What skills to invest in? (SWE, more ML, etc?).- Is AI Eng going to supplant ML Eng for LLMs?.- What should new people do?.- etc. I have no idea! But @hugobowne has put together this panel with.
23
43
412
This is Huge! 🥳 Half of the conversations around LLMs are “Will this fit on my GPU[s]?” And this calculator makes it much easier. There are some important nuances for LoRA and I took notes on that here:.
Excited to announce a new @huggingface space to help with one of machine learning's biggest questions:. How much space does {X} model take in vRAM? And most importantly: when using `device_map="auto"`.
7
67
391
Leaving this here, and then going to hide behind the bush before the python purists come for me.
Greatest minds of my generation making millions of copies of a constructor init method that starts like this and has approximately 157 parameters. self.model = model.self.tokenizer = tokenizer.
11
22
382
Recently, @jeremyphoward, @fastdotai and I launched fastpages. This post goes behind the scenes to explain the magic that makes it work: @github Actions. We show you additional examples of how this can give you new superpowers for ML & Data Science:
3
104
373
Only a matter of time before the job title “AI Scientist” emerges . - Better than most AI Engineers at Evals, statistics, Data Analysis, Error Analysis, A/B testing, etc.- Better than most Software Engineers at AI Engineering . (😅 I hope we don’t need another job title ).
Data Scientist (n.): Person who is better at statistics than any software engineer and better at software engineering than any statistician.
36
34
372