benhamner Profile Banner
Ben Hamner Profile
Ben Hamner

@benhamner

Followers
32K
Following
19K
Media
735
Statuses
5K

Sumble co-founder and CTO. Learning high-quality, structured data about the world. Formerly @kaggle

SF
Joined March 2009
Don't wanna be here? Send us removal request.
@benhamner
Ben Hamner
4 years
Wordle xxx 1/6 🟩🟩🟩🟩🟩 Get Wordle right on your first guess using the daily ⬛🟨🟩 tweet distribution
Tweet card summary image
kaggle.com
Explore and run machine learning code with Kaggle Notebooks | Using data from Wordle Tweets
33
217
1K
@antgoldbloom
Anthony Goldbloom
2 months
Excited to share more publicly what @benhamner and I have been working on: https://t.co/QXvodhsM5e We are building a knowledge graph for GTM data & have early customers such as @elastic, @figma and @Snowflake
Tweet card summary image
techcrunch.com
The founders of machine learning community Kaggle are trying to upend the crowded sales prospecting market with AI. And they're making headway.
3
7
23
@benhamner
Ben Hamner
2 months
english in 2030 : python :: python in 2020 : assembly
0
0
2
@premium
Premium
5 months
Why guess when you can know?
0
781
9K
@benhamner
Ben Hamner
3 months
WOW - cold inbound hiring processes are the worst I've ever seen it: - Huge volume of applications (probably triggered automatically) - Large volume of fake resumes - Extensive use of AI assistants in phone screens
2
0
2
@benhamner
Ben Hamner
5 months
You could release a crazy stat on the number of times (and number of employees) ClosedAI companies have downloaded open models from HuggingFace
@ClementDelangue
clem 🤗
5 months
@KamStaszewski All closed-source frontier labs use tons of open-source all over the stack, starting from python, @PyTorch, @huggingface all the way down to RoPE, GQA, flash-attention, or any tiny improvements released by open-source players. The whole transformers architecture (the T in gpT)
1
0
3
@benhamner
Ben Hamner
5 months
Assumptions: 8 billion people * 20k spoken/written words per day * 1.3 tokens per word
0
0
1
@benhamner
Ben Hamner
5 months
This is around 5x the ~200 trillion natural language tokens humans generate every month
@OfficialLoganK
Logan Kilpatrick
5 months
Google is processing 980 trillion+ monthly tokens across our products and APIs (up from 480T in May) 🤯 No slowdown in sight, intelligence is everywhere.
2
0
3
@antgoldbloom
Anthony Goldbloom
5 months
Kaggle launched an LLM eval product https://t.co/lJ8uSI2HmE This has the potential solve the biggest challenge in the LLM ecosystem: strong and diverse evals
Tweet card summary image
kaggle.com
Use and download benchmarks for your machine learning projects.
2
53
265
@benhamner
Ben Hamner
10 months
Getting a bunch of 429 rate limit errors with a quota exceeded message from @OpenAI's API, in spite of being well under quota and published rate limits
2
0
5
@benhamner
Ben Hamner
11 months
NLC: natural language cron
@karinanguyen_
Karina Nguyen
11 months
We're excited to introduce Tasks! For the first time, ChatGPT can manage tasks asynchronously on your behalf—whether it's a one-time request or an ongoing routine. Here are my favorite use cases: 1/ ChatGPT checks stock price every morning!
0
0
2
@benhamner
Ben Hamner
1 year
Just had the most obnoxious interaction I’ve had in a long time Jogging in quiet Palo Alto neighborhood. Middle-aged man and woman block away walking towards me Half a block away they start screaming “RUN ON THE FUCKING STREET SIDEWALKS ARE FOR WALKING NOT RUNNING. GET OFF THE
8
0
15
@benhamner
Ben Hamner
1 year
Marimo's a delight to use for both exploratory Python notebooks and rapidly prototpying reactive data applications with a rich UI. It's replaced Jupyter notebooks for me. Excited to see what Akshay and team build from here!
@akshaykagrawal
Akshay Agrawal
1 year
My co-founder @themylesfiles and I have started Marimo Inc. to keep building the @marimo_io notebook and other Python data tools. We've raised a $5M seed round led by @antgoldbloom and @shyammani_ at @aixventureshq. Excited for the journey ahead! https://t.co/3TEJyKcBHU
1
0
12
@antgoldbloom
Anthony Goldbloom
1 year
Looking at job posts over the last year, it looks like Amazon is making the broadest AI investment. 228 teams across the company posted jobs with GenAI projects over the past year, considerably more than any other company. https://t.co/z2mhUikaFH Most interest is the breadth of
1
16
47
@benhamner
Ben Hamner
1 year
Fast hack to improve the factual accuracy of your LLM calls? Multi-LLM and consensus (e.g. majority voting) across them among the leading foundation models
2
0
5
@antgoldbloom
Anthony Goldbloom
1 year
The overlooked GenAI use case: cleaning, processing, and analyzing data. https://t.co/klQjXiyODl Job post data tell us what companies plan to do with GenAI. The most common use case is data analytics projects. Examples: - AstraZeneca: using LLMs on freeform documents to
23
116
649
@benhamner
Ben Hamner
1 year
10 data quality challenges in relational tables: 1. Duplicates 2. 3. Duplicates 4. Cḧarαctěr Énçødïng 5. 1/1/24 (ambiguous dates) 6. 7am (ambiguous times w/o timezones) 7. John A. Smith, john smith (fuzzy joining) 8. asdfasdfa junk record 9. Company: Apple Url:
0
0
8
@benhamner
Ben Hamner
1 year
Close to 100% of machine code is now compiler-written. Higher levels of abstraction are how we move forward, faster
0
0
8
@benhamner
Ben Hamner
1 year
I was born into the first computer-native generation My 2yo son’s in the first AI-native generation
0
0
2
@benhamner
Ben Hamner
1 year
Admin consoles have their own circle of UX hell. If you can read this Google GSuite admin setting and tell me the difference between "Trusted" and "Limited", your English understanding is much better than mine
3
0
6
@benhamner
Ben Hamner
1 year
SQL is the one use for the legacy CAPS LOCK key
0
0
7
@benhamner
Ben Hamner
1 year
For all the marketers that want to slap AI on a product name: as soon as I see "AI" in a product, I assume it's not going to work as advertised
0
1
7