Harry Mellor @hmellor_ X Profile

Harry Mellor

@hmellor_

Followers

167

Following

147

Media

8

Statuses

44

ML Engineer @huggingface maintaining @vllm_project, prev @graphcoreai, @uniofoxford

Joined September 2022

Don't wanna be here? Send us removal request.

Harry Mellor

@hmellor_

8 days

RT @dylan_ebert_: Hugging Face Explained in 45 seconds.

0

29

0

Harry Mellor

@hmellor_

9 days

RT @ClementDelangue: And just like that, @OpenAI gpt-oss is now the number one trending model on @huggingface, out of almost 2M open models….

0

97

0

Harry Mellor

@hmellor_

9 days

RT @dylan_ebert_: OpenAI just released GPT-OSS: An Open Source Language Model on Hugging Face. Open source meaning:.💸 Free.🔒 Private.🔧 Cust….

0

39

0

Harry Mellor

@hmellor_

9 days

RT @ngxson: Welcome back, @OpenAI !. Day-0 support llama.cpp with MXFP4, let it rock 🚀🤘

0

6

0

Harry Mellor

@hmellor_

9 days

RT @reach_vb: For all you lazy lads! OpenAI's latest model in a short video by Dylan! 💥

0

7

0

Harry Mellor

@hmellor_

9 days

Head over to to try the official demo of @OpenAI's gpt-oss models powered by @huggingface!

0

1

Harry Mellor

@hmellor_

9 days

RT @ClementDelangue: When @sama told me at the AI summit in Paris that they were serious about releasing open-source models & asked what wo….

0

259

0

Harry Mellor

@hmellor_

17 days

RT @ariG23498: Did you know you can now run your own AI Job on the Hugging Face infrastructure?. Introducing `hf jobs`, the latest addition….

0

4

0

Harry Mellor

@hmellor_

20 days

RT @LysandreJik: The new transformers release comes w/ a surprise: kernels support ⚡️. It integrates deeply with precompiled kernels on the….

0

17

0

Harry Mellor

@hmellor_

20 days

RT @mervenoyann: We have recently merged fast processors for many models, the speed-up in Qwen-VL series is 🔥. you get speed-up up to 3x on….

0

25

0

Harry Mellor

@hmellor_

23 days

RT @vllm_project: The @huggingface Transformers ↔️ @vllm_project integration just leveled up: Vision-Language Models are now supported out….

0

47

0

Harry Mellor

@hmellor_

23 days

RT @casper_hansen_: vLLM is finally addressing a long-standing problem: startup times. 35s -> 2s for CUDA graph capture is a great reductio….

0

42

0

Harry Mellor

@hmellor_

28 days

RT @ErikKaum: We just released native support for @sgl_project and @vllm_project in Inference Endpoints 🔥. Inference Endpoints is becoming….

0

7

0

Harry Mellor

@hmellor_

1 month

RT @LysandreJik: BOOOM! transformers now has a baked-in http server w/ OpenAI spec compatible API. Launch it with `transformers serve` and….

0

30

0

Harry Mellor

@hmellor_

3 months

📖 has had a makeover!. Over the past week I wrote a script to port all of @vllm_project's docs from Sphinx ➡️ MkDocs. The DX and UX are so much nicer! 😌

3

58

Harry Mellor

@hmellor_

3 months

As of the 🤗 Transformers backend for @vllm_project supports interleaved sliding window attention architectures like Gemma 2 and Cohere 2! 🚀.

github.com

Tested with Gemma 2 by asking it to summarise >4k tokens of the Wikipedia page on frogs. vLLM reference: Generated text: '\n\nFrogs are a diverse group of amphibians with a wide rang...

1

0

7

Harry Mellor

@hmellor_

3 months

Looks like the crawlers have found ChatGPT is about to become a lot more knowledgeable about @vllm_project 😅

0

3

Harry Mellor

@hmellor_

3 months

I never thought I'd be saying this but. the @vllm_project forum ( now has light mode!

0

1

Harry Mellor

@hmellor_

4 months

RT @OpenAIDevs: Announcing the first Codex open source fund grant recipients:. ⬩vLLM - inference serving engine @vllm_project.⬩OWASP Nettac….

0

147

0

Harry Mellor

@hmellor_

4 months

RT @vanstriendaniel: Need blazing-fast classifier inference with minimal code?. ModernBERT now runs on @vllm_project — fast enough to proce….

danielvanstrien.xyz

Modern Inference for modern classifier models. Using vLLM to scale inference for classifiers to clean and curate datasets

0

60

0