#362015-03-257 min read Distributed Systems

You Cannot Have Exactly-Once Delivery

I’m often surprised that people continually have fundamental misconceptions about how distributed systems behave. I myself shared many of these misconceptions, so I try not to demean or dismiss but rather educate and enlighten, hopefully while sounding less preachy than that just did. I continue to learn only by following in the footsteps of others. In retrospect, it shouldn’t be surprising that folks buy into these fallacies as I once did, but it can be frustrating when trying to communicate certain design decisions and constraints.

Within the context of a distributed system, you cannot have exactly-once message delivery. Web browser and server? Distributed. Server and database? Distributed. Server and message queue? Distributed. You cannot have exactly-once delivery semantics in any of these situations.

As I’ve described in the past, distributed systems are all about trade-offs. This is one of them. There are essentially three types of delivery semantics: at-most-once, at-least-once, and exactly-once. Of the three, the first two are feasible and widely used. If you want to be super anal, you might say at-least-once delivery is also impossible because, technically speaking, network partitions are not strictly time-bound. If the connection from you to the server is interrupted indefinitely, you can’t deliver anything. Practically speaking, you have bigger fish to fry at that point—like calling your ISP—so we consider at-least-once delivery, for all intents and purposes, possible. With this model of thinking, network partitions are finitely bounded in time, however arbitrary this may be.

So where does the trade-off come into play, and why is exactly-once delivery impossible? The answer lies in the Two Generals thought experiment or the more generalized Byzantine Generals Problem, which I’ve looked at extensively. We must also consider the FLP result, which basically says, given the possibility of a faulty process, it’s impossible for a system of processes to agree on a decision.

In the letter I mail you, I ask you to call me once you receive it. You never do. Either you really didn’t care for my letter or it got lost in the mail. That’s the cost of doing business. I can send the one letter and hope you get it, or I can send 10 letters and assume you’ll get at least one of them. The trade-off here is quite clear (postage is expensive!), but sending 10 letters doesn’t really provide any additional guarantees. In a distributed system, we try to guarantee the delivery of a message by waiting for an acknowledgement that it was received, but all sorts of things can go wrong. Did the message get dropped? Did the ack get dropped? Did the receiver crash? Are they just slow? Is the network slow? Am I slow? FLP and the Two Generals Problem are not design complexities, they are impossibility results.

People often bend the meaning of “delivery” in order to make their system fit the semantics of exactly-once, or in other cases, the term is overloaded to mean something entirely different. State-machine replication is a good example of this. Atomic broadcast protocols ensure messages are delivered reliably and in order. The truth is, we can’t deliver messages reliably and in order in the face of network partitions and crashes without a high degree of coordination. This coordination, of course, comes at a cost (latency and availability), while still relying on at-least-once semantics. Zab, the atomic broadcast protocol which lays the foundation for ZooKeeper, enforces idempotent operations.

State changes are idempotent and applying the same state change multiple times does not lead to inconsistencies as long as the application order is consistent with the delivery order. Consequently, guaranteeing at-least once semantics is sufficient and simplifies the implementation.

“Simplifies the implementation” is the authors’ attempt at subtlety. State-machine replication is just that, replicating state. If our messages have side effects, all of this goes out the window.

We’re left with a few options, all equally tenuous. When a message is delivered, it’s acknowledged immediately before processing. The sender receives the ack and calls it a day. However, if the receiver crashes before or during its processing, that data is lost forever. Customer transaction? Sorry, looks like you’re not getting your order. This is the worldview of at-most-once delivery. To be honest, implementing at-most-once semantics is more complicated than this depending on the situation. If there are multiple workers processing tasks or the work queues are replicated, the broker must be strongly consistent (or CP in CAP theorem parlance) so as to ensure a task is not delivered to any other workers once it’s been acked. Apache Kafka uses ZooKeeper to handle this coordination.

On the other hand, we can acknowledge messages after they are processed. If the process crashes after handling a message but before acking (or the ack isn’t delivered), the sender will redeliver. Hello, at-least-once delivery. Furthermore, if you want to deliver messages in order to more than one site, you need an atomic broadcast which is a huge burden on throughput. Fast or consistent. Welcome to the world of distributed systems.

Every major message queue in existence which provides any guarantees will market itself as at-least-once delivery. If it claims exactly-once, it’s because they are lying to your face in hopes that you will buy it or they themselves do not understand distributed systems. Either way, it’s not a good indicator.

RabbitMQ attempts to provide guarantees along these lines:

When using confirms, producers recovering from a channel or connection failure should retransmit any messages for which an acknowledgement has not been received from the broker. There is a possibility of message duplication here, because the broker might have sent a confirmation that never reached the producer (due to network failures, etc). Therefore consumer applications will need to perform deduplication or handle incoming messages in an idempotent manner.

The way we achieve exactly-once delivery in practice is by faking it. Either the messages themselves should be idempotent, meaning they can be applied more than once without adverse effects, or we remove the need for idempotency through deduplication. Ideally, our messages don’t require strict ordering and are commutative instead. There are design implications and trade-offs involved with whichever route you take, but this is the reality in which we must live.

Rethinking operations as idempotent actions might be easier said than done, but it mostly requires a change in the way we think about state. This is best described by revisiting the replicated state machine. Rather than distributing operations to apply at various nodes, what if we just distribute the state changes themselves? Rather than mutating state, let’s just report facts at various points in time. This is effectively how Zab works.

Imagine we want to tell a friend to come pick us up. We send him a series of text messages with turn-by-turn directions, but one of the messages is delivered twice! Our friend isn’t too happy when he finds himself in the bad part of town. Instead, let’s just tell him where we are and let him figure it out. If the message gets delivered more than once, it won’t matter. The implications are wider reaching than this, since we’re still concerned with the ordering of messages, which is why solutions like commutative and convergent replicated data types are becoming more popular. That said, we can typically solve this problem through extrinsic means like sequencing, vector clocks, or other partial-ordering mechanisms. It’s usually causal ordering that we’re after anyway. People who say otherwise don’t quite realize that there is no now in a distributed system.

To reiterate, there is no such thing as exactly-once delivery. We must choose between the lesser of two evils, which is at-least-once delivery in most cases. This can be used to simulate exactly-once semantics by ensuring idempotency or otherwise eliminating side effects from operations. Once again, it’s important to understand the trade-offs involved when designing distributed systems. There is asynchrony abound, which means you cannot expect synchronous, guaranteed behavior. Design for failure and resiliency against this asynchronous nature.

Comments

Comments are from this blog's WordPress era and are preserved read-only.

Confused March 25, 2015

“I can send 10 letters and assume you’ll get at least one of them (at-least-once).”

How is it you can assume 1 of 10 gets through? Isn’t this really “at most 10”?

If you can assume 1-of-N will get through, then why not just take N=1 and call it exactly-once?

Tyler Treat March 25, 2015

Yes, you’re correct. It’s not really an example of at-least-once delivery which requires an acknowledgement. Unintentionally misleading :)

Updated the wording to hopefully make it clear (and correct).

Jason Dusek March 26, 2015

The FLP result comes with a caveat — it applies to a “completely asynchronous” protocol.

> In this paper, we show the surprising result that no completely asynchronous consensus protocol can tolerate even a single unannounced process death.

With tightly bounded clock drift (hard to bound in practice), it seems reasonable that we can guarantee once-and-only-once delivery, because we can perform consensus.

Joubin Houshyar March 26, 2015

It seems the conceptual root cause is subscription to the illusion of continuums and a stubborn belief in the fairy dust of (meaningful) instantaneity. We need to accept the reality of a discreet view of the world– which naturally promotes to first-class design concerns the notions of ‘precision’ & (time) ‘granularities’.

The fact of the matter is that all of our computational systems are operating on data from ‘the past’.

(re. clock bounds: Google has done it with Spanner — someone needs to commoditize the necessary h/w.)

Nicolas Correard February 24, 2016

CockroachDB do the equivalent of spanner without the hardware. Look for the blog post containing “CockroachDB was designed to work without atomic clocks or GPS clocks. It’s an open source database intended to be run on arbitrary collections of nodes: from physical servers in a corp development cluster to public cloud infrastructure using the flavor-of-the-month virtualization layer. It’d be a showstopper to require an external dependency on specialized hardware for clock synchronization.”

lmm March 26, 2015

Doesn’t the existence of three-phase commit contradict this? If I make a change and commit it with 3PC, retrying until it does, how is that not exactly-once delivery?

J W November 5, 2021

FLP literally does not apply to message passing lol. It important for writers to actually understand the proof of the theorem they’re talking about before they make wild claims like those in this article. In fact, FLP assumes reliable links! You read that right–*it literally assumes the existence of exactly-once delivery between correct nodes*, but proceeds to show that this doesn’t affect its main result. If you are claiming that one of the core system assumptions of FLP (the existence of reliable links) is disproved by FLP, you *might* just not understand distributed systems as well as you think you do.

But what about in practice? Well, in practice it’s really easy to get exactly-once delivery between nodes, as long as you’re in a system strong enough to eventually solve consensus. In most contexts in which people want exactly-once delivery (such as between managed nodes in datacenters) this is a completely reasonable assumption. So this is a dumb post in practice, too.

haha November 6, 2024

So true… I was shocked to see so many blogs claiming exactly-once is impossible, citing the two-general problem and the FLP paper.

Those impossibility resutls only say that algorithms guaranteeing liveness (i.e., terminate within bounded number of steps) do not exist; there can still be algorithms that guarantee safety (i.e., produce correct consensus outcomes once terminated). Paxos and 2PC are such algorithms and they work just fine in practice.

Webhiker March 26, 2015

All your objections to Exactly-once also apply to AtLeastOnce.
And ExactlyOnce is possible…all your objections are based on the current design flay of messaing systems which decouple delivery into a message queue.

If you don’t decouple, the act of the recipient “reading” the message can be easily detected, including it’s failure. But then all the investment in expensive message queues looks stupid, so no-one will be able sell their superior knowledge of straw man arguments on why message delivery cannot be guaranteed. :)

MC April 18, 2022

You have absolutely no clue what you’re babbling about.

Michael Chermside March 26, 2015

Although it is impossible to create a system that guarantees “exactly once” delivery, it *is* possible to create a system that guarantees that EITHER (1) it will deliver exactly once, OR (2) it will report an error to a human being. This is also the technique best used for attempts at “at least once” delivery which fail over an extended period of time.

None of this invalidates anything you said about the usefulness of idempotency, but I like to point it out because it emphasizes two things: the impossibility of perfect message delivery (like “exactly once” being impossible) and the need to have someone monitor the error queues of your messaging system.

sumit March 27, 2015

Hey Michael,

It would be really helpful if you could point out some simple and relevant article(s) that supports the guarantee you mentioned.

Michael Ho June 5, 2015

Fancy meeting you here, MCherm!

Stu March 26, 2015

You’re right, but I’m not sure about the melodrama. Accusing folks like IBM or BEA/Oracle of lying for 20 years is a reach, it’s more like you weren’t there when they coined the term.

“Exactly once” has *always* meant “at least once but dupe-detected”. Mainly because we couldn’t convince customers to send idempotent and communitative state changes.

Sebastien Lorber April 5, 2015

With event-sourcing/stream processing for example you would version every event/message so that it’s easy to dedup. If your friend receive 2 messages with id=456 telling him to turn left them it is easy for him to ignore one of them.

Another problem is about message ordering. If you have multiple datacenters and want to keep allow local writes during a network partition it seems impossible to guarantee global event ordering.
Kafka does only guarantee ordering across a single Kafka partition for example.
See how Eventuate is trying to solve this with causal consistency: https://github.com/RBMHTechnology/eventuate

Mike Spooner June 8, 2015

Well said, nice article. But this has been well-understood since at least 1986… sigh

John B September 9, 2015

yes, it has been well understood by certain people for a long time, but there’s been an entire new generation of software developers since 1986.

While it may be tedious for those of us who have been around for a long time, re-introducing key concepts to young developers is incredibly valuable work.

Abhinav Singh September 6, 2015

Great post as always. Wanted to leave my 2 cents here.

Let’s forget engineering and take a real world example like you did. Assume a distributed system of 2 nodes, me and my wife sitting in next room. If I want to communicate with her, I shout out her name and wait for response. Well, if I don’t hear back from her, we can assume:

– probably she didn’t hear me (partitioned by walls)
– simply ignored my message coz she is busy
– received my message but it wasn’t clear to her what to do with it
– received and she did shout back, but I just couldn’t hear her due to partition caused by walls and due to her soft voice
– may be I did likely heard her, but I am not sure
– may be I was too busy when she called out to me
– ….. We can go on here

Now, if I seriously want her attention and mean business, I will have to move past this “exactly-once” melodrama and shout out to her again.

Kunal December 12, 2016

You can achieve the intent of “exactly-once” i.e. no duplicates and no data-loss on failures by making the receiver (client) state aware (i.e. offset, IDs); the client de-dupes.

There is business intent to building technology always. Religiously speaking, it is correct that exactly-once is not possible on the network protocol level in a distributed system; I don’t think anyone will argue that. When people say we need exactly once, they really are speaking from a business or application intent.

Mike Spooner December 12, 2016

Although sequence-numbers/IDs does mean that, like the 787 Dreamliner, you have to poweroff or restart the entire system-universe every so often, at least until we get systems that really can count to infinity (at least “A0”, not necessarily as far as Cantors number).

Homesh Rawat February 17, 2018

Great read!

mark June 18, 2018

I thinks these works like deduplication and… must execute on consumer side

Nicholas February 11, 2019

“The way we achieve exactly-once delivery in practice is by faking it. Either the messages themselves should be idempotent, meaning they can be applied more than once without adverse effects, or we remove the need for idempotency through deduplication.”

If the message only writes exactly once then that’s successful exactly-once semantics. Back when this article was written it was a hard problem a lot of people struggled with, but today Kafka has a system for exactly-once, and I work at a different company that does it in a different way. You can call it “fake” but in that case we have stable, well-functioning “fake” exactly-once semantics, and a lot of customers use that “fake” system successfully to solve real problems.

Mehedi May 22, 2020

How Gmail ensure only one email?

temporal user January 13, 2022

temporal does exactly once delivery

Alexey Stogny February 23, 2023

Nice post! Thanks! Wouldn’t post useless comment, but there’s no other way to subscribe to new posts ;)

Simon Boddy March 2, 2023

Absolutely. Of course. And there’s a simple pattern for dealing with this… use a request/response protocol rather than messages, then make sure all unsafe requests have an application-level id. The receiving process, the server application, should store all responses. If it sees a request for the first time it does the work, then sends and stores the response. If it sees a request for which it has a stored response, it just replays the response. Reliability is an application level responsibility, and uniquely identified requests can be linked to uniqueness in the application context (1 shopping cart can have 1 payment request that the shopping cart app can repeat endlessly until it gets a response)

Rody April 18, 2023

What about modern day chat applications, they use sockets and messages are delivered once and they come under distributed systems right?
isnt exactly once not achieved, or is it with the sender and receiver being the same services giving an edge?

abc123 September 18, 2023

Gyula Csom February 7, 2025

I think the reasoning here is not totally precise. From FLP it follows that “Exactly once” is impossible in network (where partitions can occur). However it does not seem to be impossible at the application level (if we assume the machine reliable where partition cannot occur). How? Through idempotency:

— Draft protocol

Here is a simple protocol using shared message log. Lets say the message broker is Kafka, which BTW also states it can do “Exactly once”:

Step 1. Kafka server->client: Server sends, client Receives new message
Step 2. Kafka client: Logs message as being processed
Step 3. Kafka client->Application: Client sends, Application receives new message
Step 4. Application: Process message
Step 5. Application: Logs message as processed

— Case of network failure:

Step 2. Check log before forwarding and only forward message to application iff it is not yet processed.

— Case of process failure of Kafka client (after the message is already processed):

Restart Kafka client: Consult log and only forward messages which are not marked as processed

Case of process failure of Application:

Restart Application: Consult log and process messages which are not marked as processed

— Case of process failure of Kafka client (while message is being processed by the application):

No need nothing (single threaded Application) or some intra-app coordination (multi threaded)

— Case of process failure of both Kafka client and Application:

No need nothing.

A thing which seems to be interesting here, in case of business applications (DB): what happens with the transaction? Then idempotency should be solved at that side as well, such as recording a version (such as e.g. optimistic lock) both at the local log and the DB.

Then the only problem remains when Application crashes during transaction, after transaction started but but before commited. To my understanding client-crash-survival DB sessions might be theoretically possible, but I am not aware of any RDBMS implementing it. Someone else? :-)

As always I can be wrong – these are just rough thoughts. Any feedback is more then welcome. :-)

Gyula Csom February 9, 2025

So to clarify… the above protocol is still not “Exactly once” at the network level. That is impossible due to FLP. However a smart protocol may hide this from the Application and present itself as “Exactly once”.

Bottom line: If network partitions (resending lost messages) is an issue than “Exactly once” can be missleading, someone shall not forget about the physical level. However if partitions are not a big issue (network is robust enough, fast, etc.) then someone may forget about the physical level (network) and just think that at the Application level (i.e. logically) it is “Exactly once”. Question here: what is the added value here in comparison with “Idempotency” which sometimes can still be an issue at the Application level as well, such as when an impatient user resends the message that cannot be (easily) handled by infrastructure. What are your experiences in this regard?

Heth Gala January 14, 2026

This article was just a rant. Nothing useful came out of it. I was expecting how to achieve at-least once in notification systems here but its total rant.