Graphcore Research
@GCResearchTeam
Followers
370
Following
45
Media
23
Statuses
171
Our mission is to follow and contribute to the advancement of AI research, aiming to characterise the computational requirements of machine intelligence.
United Kingdom
Joined January 2024
Our picks for October’s Papers of the Month are here. Out of 49 shortlisted papers, we spotlight 4 that stand out for their clever ideas on making #LLMs faster, smarter, and more efficient! 📊 First up, Grouped Lattice Vector Quantisation introduces a novel technique for a
1
4
5
✨ Check out our summaries and follow our blog ✨
graphcore-research.github.io
October was packed with insights into making language models faster and smarter. We reviewed four of our favorite papers for you in detail:
0
0
0
LLM using too many reasoning tokens? 😕 Generation slow? 🐌 Or simply too many steps before EOS? 🪜🪜🪜 Douglas Orr (@douglasahorr), our beloved research scientist, has got you covered! He will tell you the remedies to all of the above in the shortest time possible. Registration
3
5
9
September's Papers of the Month is here, and this month is all about LLMs! 🧠 Out of all papers released this month, our editor @robhu92 has curated: 📊 "FlowRL: Matching Reward Distributions for LLM Reasoning“ (review by @samot_gc): A clever usage of #GFlowNets to align an #RL
1
3
8
Finally, Graph-R1 is another addition to the stack of agentic RAG approaches, but this time, using knowledge hypergraphs! Summary:
graphcore-research.github.io
August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...
0
0
0
Next, Guiding Diffusion Models with RL for Stable Molecule Generation introduces reinforcement learning with physical feedback to accomplish exactly as its name suggests! Summary:
graphcore-research.github.io
August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...
1
0
0
First up, ADMIRE-BayesOpt addresses the question of finding the optimal mixture of multiple datasets. And the answer, sequential iterative search using Multi-Fidelity Bayesian Optimization! Summary:
graphcore-research.github.io
August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...
1
0
0
Summer may be over, but Papers of the Month certainly isn’t! For August’s edition, we covered the following papers: ➡️ ADMIRE-BayesOpt ➡️ Guiding Diffusion Models with RL for Stable Molecule Generation ➡️ Graph-R1 🧵
1
1
3
Finally, DataRater addresses dataset quality: a ‘rater’ is meta-learned to curate training data without manual filtering. Summary:
graphcore-research.github.io
As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.
0
0
0
Mixture of Recursions brings a twist to token-level computation: the model learns to recurse adaptively, allocating compute per token dynamically. Summary:
graphcore-research.github.io
As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.
1
0
0
First up, Subliminal Learning explores a question in model distillation: “Can we control so that a student learns desirable, but avoids undesirable, traits?” Summary:
graphcore-research.github.io
As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.
1
0
0
July's Papers of the Month are here! 🧠 Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data 💽 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation 📊 DataRater: Meta-Learned Dataset Curation 🧵⬇️
1
1
6
That's all for this month! To keep up with our monthly summaries, blog posts, and new research, follow us on @GCResearchTeam or subscribe here: https://t.co/YgQxvhvgHT
graphcore-research.github.io
The official Graphcore Research blog.
0
0
0
For additional information, check out these excellent threads by the original authors ⬇️ @aaron_defazio
https://t.co/eDTPNzuO3X
@shizhediao
https://t.co/v8GpDQjdOH
@peter9863
https://t.co/lSsU0aOiku
Introducing Seaweed APT2, a real-time, interactive, streaming video generation model. https://t.co/dBT7uQoFxz Adversarial training for autoregressive modeling! Streaming 1 minute videos, 1 diffusion step, 24fps real-time on 1xh100, with interactive controls!
1
0
0
Finally, we look at AAPT, a fresh approach from the ByteDance Seed team that turns pre-trained offline diffusion models into real-time video generators via adversarial post-training. https://t.co/zeurS9U1ro
graphcore-research.github.io
This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...
1
0
0
Next, in ProRL, NVIDIA researchers dive into the evolving topic of large language model reasoning, showing how prolonged reinforcement learning can indeed introduce novel reasoning abilities. https://t.co/YOlLuVyj8B
graphcore-research.github.io
This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...
1
0
0
Firstly, a researcher from FAIR explores the puzzling phenomenon of increasing gradient magnitudes during training, offering an elegant mathematical explanation and a simple remedy. https://t.co/zSryHkH07B
graphcore-research.github.io
This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...
1
0
0
It's time for June's Papers of the Month! This time, we cover: ➡️Why Gradients Rapidly Increase Near the End of Training ➡️ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries ➡️Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation 🧵
1
3
11