UCBEPIC Profile Banner
UC Berkeley EPIC Lab Profile
UC Berkeley EPIC Lab

@UCBEPIC

Followers
467
Following
9
Media
70
Statuses
88

Effective Programming, Interaction, and Computation with Data Lab @UCBerkeley

Berkeley CA
Joined November 2021
Don't wanna be here? Send us removal request.
@CagatayDemiralp
Çağatay Demiralp
2 years
This was a great retreat put together by the @UCBEPIC team. As expected, applications of LLMs were central to many talks and discussions. Identifying the core problems invariant to the particularities of the next big LLM is key for research ROI in these applications.
@UCBEPIC
UC Berkeley EPIC Lab
2 years
We’re off to the races at the first EPIC data lab retreat in Napa!
0
2
5
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Can you explore the space using LLMs - but do it in a way that is efficient? How do we find the high error regions?
Tweet media one
0
1
2
@UCBEPIC
UC Berkeley EPIC Lab
2 years
It’s hard to robustly test edge cases in a model and make user defined concepts explicit
Tweet media one
1
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Fereshte Khani from Microsoft describes how to collaboratively develop NLP models, ensuring alignment and safety
Tweet media one
1
2
3
@UCBEPIC
UC Berkeley EPIC Lab
2 years
A new system they are working on is Humboldt for data discovery. You shouldn’t have to ask experts about what data you should explore!
Tweet media one
0
1
3
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Alex Bauerle from Sigma Computing tells us about what’s hard when building a spreadsheet for cloud data warehouses
Tweet media one
2
3
8
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Can you fuse structural understanding of API programs with LLM techniques? Naman provides a way! Parametric templates for the win!
Tweet media one
0
0
2
@UCBEPIC
UC Berkeley EPIC Lab
2 years
LLMs by themselves are insufficient for this task - brittle and hard to control
Tweet media one
1
0
2
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Naman Jain explores how to summarize data transformation scripts using a template-based approach, informed by LLMs
Tweet media one
1
1
2
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Flor allows users to travel back in time to help debug ML training. You can also inspect and “jump into” another user’s training history. Time travel and shapeshifting!
0
0
3
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Rolando Garcia @rogarcia_sanz describes the next generation of Flor, a tool for rapid iteration during ML training via a live notebook demo!
Tweet media one
1
1
3
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Haotian leverages large language models to identify visualization intent (variants of BERT) and prior work on automatically translating visualization intent into actual visualizations (eg Lux).
0
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Haotian Li describes how to support conversation with data via visualization - why write code when you can just talk to your data!
Tweet media one
1
1
6
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Can we check extensional equality (ie two programs have similar outputs) for constrained domains like biology? So that we can automatically rewrite and make code more performant — component by component?
Tweet media one
0
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
There is a trade off between easy to understand code (eg one that loops through arrays) and those that are performant (eg one that manipulates arrays in NumPy)
1
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Biologists, like many other non computer scientists, struggle to write performant code, especially on large datasets, such as genome sequences
1
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Justin Lubin @jplubin embedded himself in a “wet lab” biology group to identify their programming challenges
Tweet media one
1
1
3
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Yet more challenges in Machine Learning - operationalizing, explaining and trusting it.
0
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
More open challenges in helping novice users through the data science workflow - so that one can go from “zero to hero”
Tweet media one
1
0
1
@UCBEPIC
UC Berkeley EPIC Lab
2 years
Open challenges in data prep - even with sophisticated GUI tools, users often want to inspect and tweak underlying scripts - in tandem. Current tools don’t support seamless transitions and sensemaking
1
0
1