Graphcore Research @GCResearchTeam X Profile

Graphcore Research

@GCResearchTeam

Followers

370

Following

45

Media

23

Statuses

171

Our mission is to follow and contribute to the advancement of AI research, aiming to characterise the computational requirements of machine intelligence.

https://t.co/FlJmqwvBWR

United Kingdom

Joined January 2024

Don't wanna be here? Send us removal request.

Graphcore Research

@GCResearchTeam

6 days

Our picks for October’s Papers of the Month are here. Out of 49 shortlisted papers, we spotlight 4 that stand out for their clever ideas on making #LLMs faster, smarter, and more efficient! 📊 First up, Grouped Lattice Vector Quantisation introduces a novel technique for a

1

4

5

Graphcore Research

@GCResearchTeam

6 days

✨ Check out our summaries and follow our blog ✨

graphcore-research.github.io

October was packed with insights into making language models faster and smarter. We reviewed four of our favorite papers for you in detail:

0

Graphcore Research

@GCResearchTeam

27 days

Learn more by signing up here!

luma.com

CodeWords Presents: AI Lightning Talk Night #2 Join London's brightest minds for an evening of lightning talks from experts from OpenAI & DeepMind. Connect…

0

Graphcore Research

@GCResearchTeam

27 days

LLM using too many reasoning tokens? 😕 Generation slow? 🐌 Or simply too many steps before EOS? 🪜🪜🪜 Douglas Orr (@douglasahorr), our beloved research scientist, has got you covered! He will tell you the remedies to all of the above in the shortest time possible. Registration

3

5

9

Graphcore Research

@GCResearchTeam

1 month

✨ https://t.co/USO1ctukVr ✨

graphcore-research.github.io

For September, the research team reviewed a whopping 22 papers! Needless to say, competition was fierce, and only four made the final cut for this month’s edition, which is LLM-themed:

0

1

Graphcore Research

@GCResearchTeam

1 month

September's Papers of the Month is here, and this month is all about LLMs! 🧠 Out of all papers released this month, our editor @robhu92 has curated: 📊 "FlowRL: Matching Reward Distributions for LLM Reasoning“ (review by @samot_gc): A clever usage of #GFlowNets to align an #RL

1

3

8

Graphcore Research

@GCResearchTeam

2 months

Finally, Graph-R1 is another addition to the stack of agentic RAG approaches, but this time, using knowledge hypergraphs! Summary:

graphcore-research.github.io

August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...

0

Graphcore Research

@GCResearchTeam

2 months

Next, Guiding Diffusion Models with RL for Stable Molecule Generation introduces reinforcement learning with physical feedback to accomplish exactly as its name suggests! Summary:

graphcore-research.github.io

August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...

1

0

Graphcore Research

@GCResearchTeam

2 months

First up, ADMIRE-BayesOpt addresses the question of finding the optimal mixture of multiple datasets. And the answer, sequential iterative search using Multi-Fidelity Bayesian Optimization! Summary:

graphcore-research.github.io

August, even with its heat waves and holidays, left no shortage of exciting research. Our top papers for this month are the following: ADMIRE-BayesOpt that investigates how to weight different data...

1

0

Graphcore Research

@GCResearchTeam

2 months

Summer may be over, but Papers of the Month certainly isn’t! For August’s edition, we covered the following papers: ➡️ ADMIRE-BayesOpt ➡️ Guiding Diffusion Models with RL for Stable Molecule Generation ➡️ Graph-R1 🧵

1

3

Graphcore Research

@GCResearchTeam

3 months

Finally, DataRater addresses dataset quality: a ‘rater’ is meta-learned to curate training data without manual filtering. Summary:

graphcore-research.github.io

As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.

0

Graphcore Research

@GCResearchTeam

3 months

Mixture of Recursions brings a twist to token-level computation: the model learns to recurse adaptively, allocating compute per token dynamically. Summary:

graphcore-research.github.io

As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.

1

0

Graphcore Research

@GCResearchTeam

3 months

First up, Subliminal Learning explores a question in model distillation: “Can we control so that a student learns desirable, but avoids undesirable, traits?” Summary:

graphcore-research.github.io

As July brought tennis at Wimbledon, so too did the ML world serve up a volley of research. This month, we took an eagle-eyed approach—or, perhaps, Hawk Eyed approach—to three papers.

1

0

Graphcore Research

@GCResearchTeam

3 months

July's Papers of the Month are here! 🧠 Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data 💽 Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation 📊 DataRater: Meta-Learned Dataset Curation 🧵⬇️

1

6

Graphcore Research

@GCResearchTeam

4 months

That's all for this month! To keep up with our monthly summaries, blog posts, and new research, follow us on @GCResearchTeam or subscribe here: https://t.co/YgQxvhvgHT

graphcore-research.github.io

The official Graphcore Research blog.

0

Graphcore Research

@GCResearchTeam

4 months

For additional information, check out these excellent threads by the original authors ⬇️ @aaron_defazio https://t.co/eDTPNzuO3X @shizhediao https://t.co/v8GpDQjdOH @peter9863 https://t.co/lSsU0aOiku

Peter Lin

@peter9863

5 months

Introducing Seaweed APT2, a real-time, interactive, streaming video generation model. https://t.co/dBT7uQoFxz Adversarial training for autoregressive modeling! Streaming 1 minute videos, 1 diffusion step, 24fps real-time on 1xh100, with interactive controls!

1

0

Graphcore Research

@GCResearchTeam

4 months

Finally, we look at AAPT, a fresh approach from the ByteDance Seed team that turns pre-trained offline diffusion models into real-time video generators via adversarial post-training. https://t.co/zeurS9U1ro

graphcore-research.github.io

This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...

1

0

Graphcore Research

@GCResearchTeam

4 months

Next, in ProRL, NVIDIA researchers dive into the evolving topic of large language model reasoning, showing how prolonged reinforcement learning can indeed introduce novel reasoning abilities. https://t.co/YOlLuVyj8B

graphcore-research.github.io

This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...

1

0

Graphcore Research

@GCResearchTeam

4 months

Firstly, a researcher from FAIR explores the puzzling phenomenon of increasing gradient magnitudes during training, offering an elegant mathematical explanation and a simple remedy. https://t.co/zSryHkH07B

graphcore-research.github.io

This June not only brought us very hot and sunny days (at least here in the UK), but also an excellent selection of new and exciting ML research! Out of the many good candidates, this month we...

1

0

Graphcore Research

@GCResearchTeam

4 months

It's time for June's Papers of the Month! This time, we cover: ➡️Why Gradients Rapidly Increase Near the End of Training ➡️ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries ➡️Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation 🧵

1

3

11