CommonCrawl Profile Banner
Common Crawl Foundation Profile
Common Crawl Foundation

@CommonCrawl

Followers
8K
Following
612
Media
46
Statuses
1K

Common Crawl is a non-profit foundation dedicated to the Open Web.

San Francisco, CA
Joined February 2010
Don't wanna be here? Send us removal request.
@CommonCrawl
Common Crawl Foundation
5 days
We are pleased to announce the release of the web graphs based on the crawls of September, October, and November of 2025, consisting of 235.7 million nodes and 9.5 billion edges at the host level, and 100.7 million nodes and 6.6 billion edges at the domain level.
2
1
5
@CommonCrawl
Common Crawl Foundation
8 days
This is an abridged version of a keynote given by @jedsundwall at the 2025 Chan Zuckerberg Initiative Open Science Meeting. https://t.co/HmxxwdfODd
Tweet card summary image
radiant.earth
The language we use to talk about data is keeping us from realizing its full potential.
0
6
8
@bookofjoe
bookofjoe
23 days
"You shouldn't have put your content on the internet if you didn't want it to be on the internet." — Common Crawl's executive director Rich Skrenta
1
1
8
@citizens_sanity
Citizens for Sanity
19 days
Their words. Their videos. No excuses. Expose the woke agenda to everyone you know. Follow and share.
14
51
265
@CommonCrawl
Common Crawl Foundation
24 days
Common Crawl Celebrates World Digital Preservation Day CCF celebrates World Digital Preservation Day, which invites the community to unite in answering a powerful question: Why Preserve?
1
2
7
@skrenta
Rich Skrenta
24 days
@MediaSentinelle @CommonCrawl @mart1oeil France is trying to delete French from the Internet
2
1
5
@CommonCrawl
Common Crawl Foundation
25 days
Setting the Record Straight: Common Crawl’s Commitment to Transparency, Fair Use, and the Public Good
3
6
23
@CommonCrawl
Common Crawl Foundation
25 days
Identifying Rare Languages in Common Crawl Data is a Needles-in-a-Haystack Problem
1
0
4
@CommonCrawl
Common Crawl Foundation
25 days
Common Crawl Foundation October/November 2025 Newsletter! Montreal, SF and Stanford. :)
1
0
1
@GameMillEnt
GameMill Entertainment
8 days
There’s no case Snoopy can’t solve. Save 25% on Snoopy & The Great Mystery Club! Available Now.
0
1
42
@CommonCrawl
Common Crawl Foundation
28 days
0
0
3
@CommonCrawl
Common Crawl Foundation
29 days
https://t.co/blSBfgR4rx This Stanford HAI seminar featured Common Crawl Foundation’s work on preserving humanity's knowledge and making it accessible through its free public web dataset.
0
0
2
@CommonCrawl
Common Crawl Foundation
1 month
PDF slides here :) https://t.co/MS6bHRT8VP
0
0
0
@CommonCrawl
Common Crawl Foundation
1 month
Common Crawl Foundation would like to thank Stanford HAI for the opportunity to present this week: "Preserving Humanity's Knowledge and Making it Accessible:" We appreciate Patrick Hynes and Professor Diyi Yang for hosting us! (link to followup post and PDF slides in replies)
1
2
13