jepsen_io Profile Banner
Jepsen Profile
Jepsen

@jepsen_io

Followers
6K
Following
39
Media
23
Statuses
312

Distributed systems safety analysis. No longer maintained; see https://t.co/GS4QrEFo8P

Joined March 2018
Don't wanna be here? Send us removal request.
@jepsen_io
Jepsen
3 years
Shutting down the cross-poster. If you'd like Jepsen updates, they'll be on Mastodon from now on:
Tweet card summary image
mastodon.jepsen.io
219 Posts, 2 Following, 1.88K Followers · Breaking distributed systems, one fault at a time.
0
7
27
@jepsen_io
Jepsen
3 years
multi-query optimization yield dramatic speedups in analyses: histories of hundreds of millions or even billions of operations are now tractable. Hopefully it makes your life better!. (2/2).
Tweet card summary image
github.com
This release replaces many of Jepsen's internals with faster or more scalable data structures. It introduces significant new datatypes and adds new support libraries. Core generators are much f...
0
1
11
@jepsen_io
Jepsen
3 years
Phew! Jepsen 0.3.0 is ready, and it's a doozy. 1-2 order of magnitude throughput improvements across multiple benchmarks. Generators are way faster. Histories are now streamed incrementally to and from disk. Linear and concurrent folds over histories with stream fusion and (1/2).
1
6
34
@jepsen_io
Jepsen
3 years
If anyone out there works on Stripe eng and has a few minutes to spare, I could use a little help with something! DM me or email aphyr@jepsen.io, please. 😃.
2
0
3
@jepsen_io
Jepsen
3 years
Sneak preview of a new library for checking Jepsen histories. This will be shipping with the new disk-backed histories in the next release.
Tweet card summary image
github.com
Support library for working with Jepsen histories. Contribute to jepsen-io/history development by creating an account on GitHub.
0
2
9
@jepsen_io
Jepsen
3 years
Also delighted to share that there's now a (limited) Jepsen test for local filesystems, which we've been using to find bugs in lazyfs:
Tweet card summary image
github.com
Jepsen tests for local filesystems, running on a single node. - jepsen-io/local-fs
0
2
13
@jepsen_io
Jepsen
3 years
Jepsen 0.2.7 is now available! Includes a (known-buggy) preview of lazyfs: a filesystem which can intentionally lose un-fsynced writes!.
Tweet card summary image
github.com
LazyFS: A FUSE Filesystem with an internal dedicated page cache, which can be used to simulate data loss on unsynced writes - dsrhaslab/lazyfs
1
12
35
@jepsen_io
Jepsen
3 years
Ayyyyyy, congratulations! 🎉.
0
1
12
@jepsen_io
Jepsen
3 years
Hey y'all! Doing an hour-long free webinar on May 25th with @redpandadata to talk about what we found in the last Jepsen analysis (. Come learn about streaming systems safety!.
Tweet card summary image
go.redpanda.com
Kyle Kingsbury has made a career of improving the safety of distributed systems. With his Jepsen test, he explores whether a system lives up to its claims, files new bugs, and makes recommendations...
0
13
30
@jepsen_io
Jepsen
3 years
Cheers to @redpandadata on a delightful collaboration, and congratulations on their new release. :-).
0
2
34
@jepsen_io
Jepsen
3 years
Redpanda has addressed most of these issues in the just-released 21.11.15, and the upcoming 22.1.1 fixes aborted reads and lost writes with transactions--lost/stale messages are still under investigation. A few more issues require only documentation to address.
1
3
20
@jepsen_io
Jepsen
3 years
A new #Jepsen report! We analyzed @redpandadata (a Kafka-compatible distributed queue) and discuss crashes, aborted reads, inconsistent offsets, and lost/stale messages, along with some potentially surprising aspects of the Kafka transaction protocol.
2
49
205
@jepsen_io
Jepsen
3 years
I am begging the cryptocurrency community to consider alternative ways of knowing, such as "emailing someone to ask them questions instead of speculating in chat" and "submitting a handful of transactions and seeing if they show up".
3
4
19
@jepsen_io
Jepsen
3 years
<sigh> No, Radix folks, Jepsen will not be accepting a follow-up engagement with any Radix-related entities. Y'all can stop suggesting there's going to be a follow-up analysis on Xi'an now. I take my ethical commitments seriously.
4
2
28
@jepsen_io
Jepsen
3 years
They also stressed the importance of end-to-end verification of safety properties, because APIs are how exchanges and users actually interact with DLTs. This is a challenge in traditional databases as well: composition of (e.g.) serializable transactional DBs is nontrivial!.
1
0
5
@jepsen_io
Jepsen
3 years
I'm not sure how widespread this understanding is in the DLT space (still looking for a citation for RDX Works's definition) but the researchers I've talked to were unanimous: losing committed transactions *is* a safety error, even if every validator agrees to throw away data.
1
0
5
@jepsen_io
Jepsen
3 years
Since the release I've had the chance to chat with a handful of analysts working specifically on verification of blockchain/cryptocurrency/DLT systems, and can confirm that they also use the usual distsys sense of "safety property"--namely: "something bad does not happen".
1
0
5
@jepsen_io
Jepsen
3 years
Some helpful and much-better-informed comments from @trianglesphere on tendermint/hotstuff latency, including a nicely drawn Lamport diagram.
@trianglesphere
Joshua
3 years
@jepsen_io I’m pretty sure it’s 7 delays. 1 to validator, 7 to finalize, 1 from any validator back to the client. By this metric, PBFT/tender mint is 3. Ignore the new view, but each set of arrows is a hop.
0
1
9
@jepsen_io
Jepsen
3 years
RT @trianglesphere: @jepsen_io I'm not at all familiar with Radix DLT, but I've got a bunch of thoughts on consensus algorithms and improvi….
0
2
0
@jepsen_io
Jepsen
3 years
Thing is that none of this is even remotely close to saturating disk or network bandwidth. It's a fresh, empty cluster and request volumes are *tiny*, so like. page cache should be able to hold most if not all of this data. I dunno. Software is a ~rich tapestry~.
1
0
19