GCResearchTeam Profile Banner
Graphcore Research Profile
Graphcore Research

@GCResearchTeam

Followers
370
Following
45
Media
23
Statuses
171

Our mission is to follow and contribute to the advancement of AI research, aiming to characterise the computational requirements of machine intelligence.

United Kingdom
Joined January 2024
Don't wanna be here? Send us removal request.
@GCResearchTeam
Graphcore Research
6 days
Our picks for October’s Papers of the Month are here. Out of 49 shortlisted papers, we spotlight 4 that stand out for their clever ideas on making #LLMs faster, smarter, and more efficient! 📊 First up, Grouped Lattice Vector Quantisation introduces a novel technique for a
1
4
5
@GCResearchTeam
Graphcore Research
27 days
LLM using too many reasoning tokens? 😕 Generation slow? 🐌 Or simply too many steps before EOS? 🪜🪜🪜 Douglas Orr (@douglasahorr), our beloved research scientist, has got you covered! He will tell you the remedies to all of the above in the shortest time possible. Registration
3
5
9
@GCResearchTeam
Graphcore Research
1 month
September's Papers of the Month is here, and this month is all about LLMs! 🧠 Out of all papers released this month, our editor @robhu92 has curated: 📊 "FlowRL: Matching Reward Distributions for LLM Reasoning“ (review by @samot_gc): A clever usage of #GFlowNets to align an #RL
1
3
8
@GCResearchTeam
Graphcore Research
2 months
Next, Guiding Diffusion Models with RL for Stable Molecule Generation introduces reinforcement learning with physical feedback to accomplish exactly as its name suggests! Summary:
Tweet card summary image
graphcore-research.github.io
August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...
1
0
0
@GCResearchTeam
Graphcore Research
2 months
First up, ADMIRE-BayesOpt addresses the question of finding the optimal mixture of multiple datasets. And the answer, sequential iterative search using Multi-Fidelity Bayesian Optimization! Summary:
Tweet card summary image
graphcore-research.github.io
August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...
1
0
0
@GCResearchTeam
Graphcore Research
2 months
Summer may be over, but Papers of the Month certainly isn’t! For August’s edition, we covered the following papers: ➡️ ADMIRE-BayesOpt ➡️ Guiding Diffusion Models with RL for Stable Molecule Generation ➡️ Graph-R1 🧵
1
1
3
@GCResearchTeam
Graphcore Research
3 months
Finally, DataRater addresses dataset quality: a ‘rater’ is meta-learned to curate training data without manual filtering. Summary:
Tweet card summary image
graphcore-research.github.io
As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.
0
0
0
@GCResearchTeam
Graphcore Research
3 months
Mixture of Recursions brings a twist to token-level computation: the model learns to recurse adaptively, allocating compute per token dynamically. Summary:
Tweet card summary image
graphcore-research.github.io
As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.
1
0
0
@GCResearchTeam
Graphcore Research
3 months
First up, Subliminal Learning explores a question in model distillation: “Can we control so that a student learns desirable, but avoids undesirable, traits?” Summary:
Tweet card summary image
graphcore-research.github.io
As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.
1
0
0
@GCResearchTeam
Graphcore Research
3 months
July's Papers of the Month are here! 🧠 Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data 💽 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation 📊 DataRater: Meta-Learned Dataset Curation 🧵⬇️
1
1
6
@GCResearchTeam
Graphcore Research
4 months
That's all for this month! To keep up with our monthly summaries, blog posts, and new research, follow us on @GCResearchTeam or subscribe here: https://t.co/YgQxvhvgHT
Tweet card summary image
graphcore-research.github.io
The official Graphcore Research blog.
0
0
0
@GCResearchTeam
Graphcore Research
4 months
For additional information, check out these excellent threads by the original authors ⬇️ @aaron_defazio https://t.co/eDTPNzuO3X @shizhediao https://t.co/v8GpDQjdOH @peter9863 https://t.co/lSsU0aOiku
@peter9863
Peter Lin
5 months
Introducing Seaweed APT2, a real-time, interactive, streaming video generation model. https://t.co/dBT7uQoFxz Adversarial training for autoregressive modeling! Streaming 1 minute videos, 1 diffusion step, 24fps real-time on 1xh100, with interactive controls!
1
0
0
@GCResearchTeam
Graphcore Research
4 months
Finally, we look at AAPT, a fresh approach from the ByteDance Seed team that turns pre-trained offline diffusion models into real-time video generators via adversarial post-training. https://t.co/zeurS9U1ro
Tweet card summary image
graphcore-research.github.io
This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...
1
0
0
@GCResearchTeam
Graphcore Research
4 months
Next, in ProRL, NVIDIA researchers dive into the evolving topic of large language model reasoning, showing how prolonged reinforcement learning can indeed introduce novel reasoning abilities. https://t.co/YOlLuVyj8B
Tweet card summary image
graphcore-research.github.io
This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...
1
0
0
@GCResearchTeam
Graphcore Research
4 months
Firstly, a researcher from FAIR explores the puzzling phenomenon of increasing gradient magnitudes during training, offering an elegant mathematical explanation and a simple remedy. https://t.co/zSryHkH07B
Tweet card summary image
graphcore-research.github.io
This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...
1
0
0
@GCResearchTeam
Graphcore Research
4 months
It's time for June's Papers of the Month! This time, we cover: ➡️Why Gradients Rapidly Increase Near the End of Training ➡️ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries ➡️Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation 🧵
1
3
11