josh yan 👍🏼
@josh1yan
Followers
552
Following
150
Media
10
Statuses
72
perspicating @uwaterloo | @janestreetgroup @yutori_ai @ollama https://t.co/SBaguaFFqS
Joined June 2024
recently, i spent some time working on cross-lingual alignment for LLMs via encoder injection! treating languages as modalities is a compute-efficient way to extend understanding of low-resource languages without extending pre-training or the tokenizer
12
11
111
hypothetically, if one wanted to research at a frontier lab in 1.5 years (hypothetically winter 2027) and wanted to know how to develop the necessary skills and credentials to do so, what would you suggest to them (asking for a friend)
31
27
1K
we built PokeOS. Poke now - interacts with your iMessage proactively - fills forms/searches on your local browser - can write/debug code directly in your IDE
24hr Poke Automations Challenge 🌴 In addition to existing integrations like @NotionHQ, @Linear, and @Cognition, you can now add your own, custom Poke Integrations/MCPs. Custom Poke Integrations can be used for querying data, taking action, and triggering Poke Automations.
13
19
359
built with @_rajanagarwal in under 2hrs at cursor hackathon
github.com
cursor can now read your terminal in realtime. Contribute to rajansagarwal/observe development by creating an account on GitHub.
0
0
4
i built shadow: a powerful open-source background coding agent with a real-time interface. for async, parallel work on long-running coding tasks! made in a few weeks with @_rajanagarwal @elijahkurien
30
22
325
🤯 Gemma 3 is available on Ollama! Multimodal is here for Gemma. 1B (text-only): ollama run gemma3:1b 4B: ollama run gemma3:4b 12B: ollama run gemma3:12b 27B: ollama run gemma3:27b
80
319
2K
Introducing Outlook: A novel ML indicator that consistently outperforms both individual stocks and major indices. Leverages weighted future returns and custom decay functions to identify market trends across all conditions. Thread below
7
9
53
Day 13: ChatGPT Wrapped End 2024 with another wrapper. Built in 48 hours with @sdand
https://t.co/xKbCzVahIY
4
4
51
we're turning @uwaterloo into a supercomputer! arceus is a cross-device distributed compute network for training large models, using model/tensor/pipeline parallelism. you can train anything, from deep neural networks to language models on the network. oss & deployment soon!
38
35
539
We're turning @UWaterloo into a supercomputer 💻 Arceus is a distributed training marketplace for renting unused MacBook compute to a global network, giving developers cheap compute to train large models on consumer hardware clusters! With @_rajanagarwal @josh1yan @simerusm
16
8
197