
Ruoming Pang
@ruomingpang
Followers
2K
Following
823
Media
9
Statuses
112
Earlier today at #WWDC24, we introduced Apple Intelligence, the personal intelligence system integrated deeply into iPhone, iPad, and Mac, to enable powerful capabilities across language, images, actions, and personal context. We’re excited to share more about how Apple.
4
35
114
Our team is developing cutting-edge foundation models that power Apple Intelligence. Join our close-knit and fast-moving efforts as researchers and engineers in Cupertino, New York, or Seattle. Be at the heart of shaping the future of Apple Intelligence. Learn more at.
Earlier today at #WWDC24, we introduced Apple Intelligence, the personal intelligence system integrated deeply into iPhone, iPad, and Mac, to enable powerful capabilities across language, images, actions, and personal context. We’re excited to share more about how Apple.
0
15
100
We would appreciate feedback from our users and the research community. I'd also like to take this opportunity to thank our team (including @NoughtAleph, @MrZiruiWang, @cw_aabc, and many others) and collaborators. It has been a privilege to work with you all!.
1
2
21
Which architecture is better at speech recognition, convolution or transformer? Check out Conformer with 1.9/3.9 on Librispeech: @anmol01gulati.
1
8
19
A new SOTA on Librispeech with an end-to-end convolution model, 2.1/4.6 without external LM, 1.9/4.1 with LM: #ContextNet.
2
7
19
Also calling out to a few more Apple FM team members on X: @markblee @XiangKong4 @chenqibin99 @gyin94 @bwzhang_usc @vivekrathod @yapdianang @DpacGopinath @Phyyysalis @_samwiseman.
1
0
12
@Lingling_Wei @WSJ That’s my personal experience too: “I realized the essence of being American: You’re always welcome, no matter where you were born. The U.S. is built on inclusiveness. It’s one of the biggest factors in American competitiveness and what drew me here two decades ago.”.
1
0
6
@elizashapiro This conclusion is flawed in a number of ways: (1) GPA is not very meaningful across schools; (2) the gap between 4.1 and 3.9 is not spelled out in terms of percentage; (3) it uses the same metric for both selection and evaluation.
1
0
5
Amazing efficiency and scalability with Jax + GSPMD + XLA + TPU. For more details on GSPMD see by @ukoxyz
JAX + GSPMD + TPU v4 achieves (to our knowledge) the highest Model Flop Utilization (MFU) across a range of Transformer LLMs:
1
0
3
My child was lucky enough to study in Ms Feurtado's math program. I hope we can preserve it for future kids! @NYCSchools @nyclabschool.
0
1
2
@elizashapiro On the last point, imagine that one claims that height can be used to predict academic performance. After selecting students based on height, we find that indeed they are all quite tall! A better metric would be to compare GPA from SHS in an A-B study.
0
0
2
@elizashapiro The problem is not racial integration, but "academic" integration. The proposed admission criteria contain a number of bad ideas, such as assuming that middle schools across the city have comparable academic levels, even if some are screened.
0
0
2
@NYCMayor As a parent, I know that it's years of hard work, not test prep, that gets students into specialized high schools. Address the real achievement gaps, do not just paper over them!.
0
0
2
@giffmana MMLU 5-shot? GPT-4 reports 86.4 and Flan PaLM 2 at 81.2. Not sure whether the methodology is the same though.
0
0
2
Chinese Embassy in the US, please reinstate visas and remove obstacles for visiting China - Sign the Petition! via @Change.
0
0
1
@anmol01gulati @Tim_Dettmers I think the overall lessons from flash attention also hold for TPU, since there’s a similar HBM/SRAM memory hierarchy. OTOH, the software stack is different. We rely on XLA to apply the optimization instead of writing our own kernels.
0
0
1