jasmine wang Profile
jasmine wang

@jasminechenwang

Followers
45
Following
124
Media
1
Statuses
70

Yoga, Cognitive Science, Open Source technology, Startups, Sunset at the Beach

Joined June 2022
Don't wanna be here? Send us removal request.
@lancedb
LanceDB
1 month
1/7 ๐ŸŽจ In a world of infinite scroll, discovering art still feels like searching for a needle in a haystack. With SemanticDotArt, we flipped the question: What if you searched by mood, not just metadata? See how we did this in @lancedb ๐Ÿ‘‡๐Ÿฝ
3
11
19
@changhiskhan
changhiskhan
2 months
This is a big milestone for Lance format. The F3 paper ( https://t.co/hVREwxykSn) verified that Lance has THE fastest random access, essential for search, shuffle, and many other AI workloads. But it incorrectly assumed it was because of lack of compression. With 2.1, we show
Tweet card summary image
dl.acm.org
Columnar storage formats are the foundation for modern data analytics systems. The proliferation of open-source file formats (i.e., Parquet, ORC) allows seamless data sharing across disparate...
@lancedb
LanceDB
2 months
๐Ÿ’พ Lance File 2.1 Is Now Stable ๐Ÿฅณ Big news from the LanceDB team โ€” Lance File Format 2.1 is officially stableโ—๏ธ This release solves one of the biggest challenges from 2.0: ๐Ÿ‘‰ adding compression without sacrificing *random access performance.
3
12
45
@ApacheSpark
Apache Spark
3 months
Join us for our webinar onย Apache Sparkโ„ข and Lance Spark Connectorย with Jack Ye (@lancedb) on September 25! ๐Ÿ‘ Learn how the Lance Spark Connector enables Apache Sparkโ„ข to work with Lanceโ€™s AI-native multimodal storage. โœ… Weโ€™ll look at how Spark can handle embeddings, images,
1
1
4
@lancedb
LanceDB
3 months
When building a columnar file reader, it becomes clear that ๐˜€๐˜๐—ฟ๐˜‚๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ ๐—ถ๐˜€ ๐—ป๐—ผ๐˜ ๐—ท๐˜‚๐˜€๐˜ ๐—ฎ๐—ป ๐—ฎ๐—ฏ๐˜€๐˜๐—ฟ๐—ฎ๐—ฐ๐˜ ๐—ฐ๐—ผ๐—ป๐—ฐ๐—ฒ๐—ฝ๐˜.ย ( https://t.co/9TGr34de1n) It is the set of rules that determines how every byte of data is stored and accessed on disk. A few months ago,
1
3
19
@lancedb
LanceDB
4 months
The data prep bottleneck for fine-tuning LLMs is a common challenge. ๐—ข๐˜‚๐—ฟ ๐—ป๐—ฒ๐˜„ ๐—ถ๐—ป๐˜๐—ฒ๐—ด๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐˜„๐—ถ๐˜๐—ต ๐— ๐—ฒ๐˜๐—ฎ'๐˜€ ๐—ฆ๐˜†๐—ป๐˜๐—ต๐—ฒ๐˜๐—ถ๐—ฐ ๐——๐—ฎ๐˜๐—ฎ ๐—ž๐—ถ๐˜ ๐—ต๐—ฒ๐—ฟ๐—ฒ ๐˜๐—ผ ๐—ณ๐—ถ๐˜… ๐˜๐—ต๐—ฎ๐˜! It simplifies the entire workflow with a ๐˜€๐˜๐—ฟ๐—ฎ๐—ถ๐—ด๐—ต๐˜๐—ณ๐—ผ๐—ฟ๐˜„๐—ฎ๐—ฟ๐—ฑ ๐—–๐—Ÿ๐—œ for
0
1
3
@lancedb
LanceDB
4 months
๐Ÿš€ Video from @TMLS_TO : @character_ai x @LanceDB on building a unified multimodal data lake , a single system for text, audio, video & image retrieval. @changhiskhan @ryanvilim Simpler pipelines, lower infra costs, faster AI dev. ๐ŸŽฅ Watch: https://t.co/bt21gm8dwZ #AI #LLM
0
3
7
@andriy_mulyar
Andriy Mulyar
4 months
@swyx @jxmnop - built by solid db people and hackable (we have a contributor at nomic to it) - used by top ai companies / labs / products for it's nice properties when used in a training loops (e.g. midjourney has been using it since 2023) so probably not going anywhere - feels like the right
1
2
11
@charles_irl
Charles ๐ŸŽ‰ Frye
5 months
q from the audience: "Is Lance the next big thing in data?" answer: "Yes" ๐Ÿ‘€
@lancedb
LanceDB
5 months
Thanks for @charles_irl , live from NY! Ethan from @runwayml is giving lots of love for LanceDB!
2
3
20
@lancedb
LanceDB
5 months
We just published a ๐—ป๐—ฒ๐˜„ ๐—ฏ๐—น๐—ผ๐—ด ( https://t.co/nT0lF1sbmH) on what the ๐— ๐˜‚๐—น๐˜๐—ถ๐—บ๐—ผ๐—ฑ๐—ฎ๐—น ๐—Ÿ๐—ฎ๐—ธ๐—ฒ๐—ต๐—ผ๐˜‚๐˜€๐—ฒ actually does. The Lakehouse is ๐—ณ๐—ผ๐—ฟ ๐˜๐—ต๐—ผ๐˜€๐—ฒ working with a mix of text, images, audio, and structured data - ๐˜„๐—ต๐—ผ ๐˜„๐—ถ๐˜€๐—ต ๐˜๐—ผ ๐—ฎ๐˜ƒ๐—ผ๐—ถ๐—ฑ ๐˜๐—ต๐—ฒ ๐—ฝ๐—ฎ๐—ถ๐—ป of
1
3
12
@lancedb
LanceDB
5 months
Today weโ€™re announcing ourย $30 million Series A. This round is led byย @Theoryvc with support fromย @CRV , @ycombinator, @databricks, @runwayml , @ZeroPrimeVC , @swift_vc,ย and more. Your belief in a future powered by multimodal dataย brings us one step closer to that reality.
16
39
200
@lancedb
LanceDB
5 months
Missed Ethanโ€™s talk at @DataCouncilAI 2025? ๐ŸŽค He shares how @RunwayML tackles multimodal data challengesโ€”and how LanceDB helps store, query, and retrieve it all efficiently. ๐ŸŽฅ Watch here: https://t.co/MhNXKY7sx0 Ethan's slides: https://t.co/sxWcAfnWUb #LanceDB
0
6
24
@lancedb
LanceDB
6 months
Live at #DataAISummit from @databricks @DbrxMosaicAI and @lancedb . A joint talk by @changhiskhan and Zero Qu Congrats to both teams on the newly announced storage optimized vector search. Now we take billion vector scale to the moon!
0
1
5
@lancedb
LanceDB
6 months
Join @character_ai and @lancedb at the upcoming @TMLS_TO for a joint talk on "๐˜ผ ๐™๐™ฃ๐™ž๐™›๐™ž๐™š๐™™ ๐™ˆ๐™ช๐™ก๐™ฉ๐™ž๐™ข๐™ค๐™™๐™–๐™ก ๐˜ฟ๐™–๐™ฉ๐™– ๐™‡๐™–๐™ ๐™š ๐™›๐™ค๐™ง ๐™‰๐™š๐™ญ๐™ฉ-๐™‚๐™š๐™ฃ๐™š๐™ง๐™–๐™ฉ๐™ž๐™ค๐™ฃ ๐˜ผ๐™„" @changhiskhan will be there! Time: June 13th, virtual talk Register: https://t.co/XJlr77Lo3E Btw,
0
2
4
@loldedxd
Ayush Chaurasia
7 months
We just released a walkthrough on how to ingest the ๐—ณ๐˜‚๐—น๐—น ๐—ช๐—ถ๐—ธ๐—ถ๐Ÿฐ๐Ÿญ๐—  ๐—ฑ๐—ฎ๐˜๐—ฎ๐˜€๐—ฒ๐˜ โ€” that's ๐Ÿฐ๐Ÿญ ๐—บ๐—ถ๐—น๐—น๐—ถ๐—ผ๐—ป ๐—ฟ๐—ผ๐˜„๐˜€ ๐—ผ๐—ณ Wikipedia โ€” into @lancedb ~11 minutes. ๐Ÿ”ง What youโ€™ll learn: โ€ข How to generate embeddings at scale โ€ข Ingest massive datasets into LanceDB Cloud
3
12
76
@lancedb
LanceDB
7 months
๐Ÿ” LanceDB is now SOC 2 Type II, HIPAA, and GDPR compliant. Weโ€™re built for secure, privacy-conscious AI applications โ€” from startups to enterprises. ๐Ÿ” #AIInfrastructure #DataPrivacy #GDPR #HIPAA #SOC2
0
2
4
@changhiskhan
changhiskhan
7 months
The weather is supposed to be much nicer tmr in NYC - who wants to hang out near Bryant Park?
@lancedb
LanceDB
7 months
If you are in #NYC going to the AI Agent Conference. Catch @changhiskhan at his session!
0
2
9
@lancedb
LanceDB
7 months
๐Ÿง  Monthly Newsletter update from LanceDB: โœ… Research paper on arXiv โš”๏ธ New Lancelots knighted โš™๏ธ Guides on rerankers + embeddings ๐Ÿ’ผ Case studies: @continuedev & @AnythingLLM ๐Ÿ”ง Big product upgrades https://t.co/iFPQBIi6tc
0
2
10
@jasminechenwang
jasmine wang
7 months
The AI infra team at @character_ai is particularly special to me personally. When I joined LanceDB, this was the first frontier modal company that I worked closely with. An exceptionally talented team that saw the value in what we were building at @lancedb. Thank you guys :)
@lancedb
LanceDB
7 months
Say hello to Nat Roth โ€” our newest Lancelot! ๐Ÿ›ก๏ธ He was the first engineer at @character_ai to work on #lance, contributing to our FTS and retrieval stack early on. Big Boston sports fan, trivia champ, and @TomBrady loyalist. Hi Tom, if you're reading this. ๐Ÿ‘€
0
0
1
@lancedb
LanceDB
7 months
This work not only addresses the challenges faced by current storage solutions but also sets the stage for future innovations in data management. If you're interested in the intersection of AI, data storage, and performance optimization, I invite you to read our paper and explore
1
1
1