March 25, 2015April 4, 2017 by Tyler Treat

You Cannot Have Exactly-Once Delivery

I’m often surprised that people continually have fundamental misconceptions about how distributed systems behave. I myself shared many of these misconceptions, so I try not to demean or dismiss but rather educate and enlighten, hopefully while sounding less preachy than that just did. I continue to learn only by following in the footsteps of others. In retrospect, it shouldn’t be surprising that folks buy into these fallacies as I once did, but it can be frustrating when trying to communicate certain design decisions and constraints.

Within the context of a distributed system, you cannot have exactly-once message delivery. Web browser and server? Distributed. Server and database? Distributed. Server and message queue? Distributed. You cannot have exactly-once delivery semantics in any of these situations.

As I’ve described in the past, distributed systems are all about trade-offs. This is one of them. There are essentially three types of delivery semantics: at-most-once, at-least-once, and exactly-once. Of the three, the first two are feasible and widely used. If you want to be super anal, you might say at-least-once delivery is also impossible because, technically speaking, network partitions are not strictly time-bound. If the connection from you to the server is interrupted indefinitely, you can’t deliver anything. Practically speaking, you have bigger fish to fry at that point—like calling your ISP—so we consider at-least-once delivery, for all intents and purposes, possible. With this model of thinking, network partitions are finitely bounded in time, however arbitrary this may be.

So where does the trade-off come into play, and why is exactly-once delivery impossible? The answer lies in the Two Generals thought experiment or the more generalized Byzantine Generals Problem, which I’ve looked at extensively. We must also consider the FLP result, which basically says, given the possibility of a faulty process, it’s impossible for a system of processes to agree on a decision.

In the letter I mail you, I ask you to call me once you receive it. You never do. Either you really didn’t care for my letter or it got lost in the mail. That’s the cost of doing business. I can send the one letter and hope you get it, or I can send 10 letters and assume you’ll get at least one of them. The trade-off here is quite clear (postage is expensive!), but sending 10 letters doesn’t really provide any additional guarantees. In a distributed system, we try to guarantee the delivery of a message by waiting for an acknowledgement that it was received, but all sorts of things can go wrong. Did the message get dropped? Did the ack get dropped? Did the receiver crash? Are they just slow? Is the network slow? Am I slow? FLP and the Two Generals Problem are not design complexities, they are impossibility results.

People often bend the meaning of “delivery” in order to make their system fit the semantics of exactly-once, or in other cases, the term is overloaded to mean something entirely different. State-machine replication is a good example of this. Atomic broadcast protocols ensure messages are delivered reliably and in order. The truth is, we can’t deliver messages reliably and in order in the face of network partitions and crashes without a high degree of coordination. This coordination, of course, comes at a cost (latency and availability), while still relying on at-least-once semantics. Zab, the atomic broadcast protocol which lays the foundation for ZooKeeper, enforces idempotent operations.

State changes are idempotent and applying the same state change multiple times does not lead to inconsistencies as long as the application order is consistent with the delivery order. Consequently, guaranteeing at-least once semantics is sufficient and simplifies the implementation.

“Simplifies the implementation” is the authors’ attempt at subtlety. State-machine replication is just that, replicating state. If our messages have side effects, all of this goes out the window.

We’re left with a few options, all equally tenuous. When a message is delivered, it’s acknowledged immediately before processing. The sender receives the ack and calls it a day. However, if the receiver crashes before or during its processing, that data is lost forever. Customer transaction? Sorry, looks like you’re not getting your order. This is the worldview of at-most-once delivery. To be honest, implementing at-most-once semantics is more complicated than this depending on the situation. If there are multiple workers processing tasks or the work queues are replicated, the broker must be strongly consistent (or CP in CAP theorem parlance) so as to ensure a task is not delivered to any other workers once it’s been acked. Apache Kafka uses ZooKeeper to handle this coordination.

On the other hand, we can acknowledge messages after they are processed. If the process crashes after handling a message but before acking (or the ack isn’t delivered), the sender will redeliver. Hello, at-least-once delivery. Furthermore, if you want to deliver messages in order to more than one site, you need an atomic broadcast which is a huge burden on throughput. Fast or consistent. Welcome to the world of distributed systems.

Every major message queue in existence which provides any guarantees will market itself as at-least-once delivery. If it claims exactly-once, it’s because they are lying to your face in hopes that you will buy it or they themselves do not understand distributed systems. Either way, it’s not a good indicator.

RabbitMQ attempts to provide guarantees along these lines:

When using confirms, producers recovering from a channel or connection failure should retransmit any messages for which an acknowledgement has not been received from the broker. There is a possibility of message duplication here, because the broker might have sent a confirmation that never reached the producer (due to network failures, etc). Therefore consumer applications will need to perform deduplication or handle incoming messages in an idempotent manner.

The way we achieve exactly-once delivery in practice is by faking it. Either the messages themselves should be idempotent, meaning they can be applied more than once without adverse effects, or we remove the need for idempotency through deduplication. Ideally, our messages don’t require strict ordering and are commutative instead. There are design implications and trade-offs involved with whichever route you take, but this is the reality in which we must live.

Rethinking operations as idempotent actions might be easier said than done, but it mostly requires a change in the way we think about state. This is best described by revisiting the replicated state machine. Rather than distributing operations to apply at various nodes, what if we just distribute the state changes themselves? Rather than mutating state, let’s just report facts at various points in time. This is effectively how Zab works.

Imagine we want to tell a friend to come pick us up. We send him a series of text messages with turn-by-turn directions, but one of the messages is delivered twice! Our friend isn’t too happy when he finds himself in the bad part of town. Instead, let’s just tell him where we are and let him figure it out. If the message gets delivered more than once, it won’t matter. The implications are wider reaching than this, since we’re still concerned with the ordering of messages, which is why solutions like commutative and convergent replicated data types are becoming more popular. That said, we can typically solve this problem through extrinsic means like sequencing, vector clocks, or other partial-ordering mechanisms. It’s usually causal ordering that we’re after anyway. People who say otherwise don’t quite realize that there is no now in a distributed system.

To reiterate, there is no such thing as exactly-once delivery. We must choose between the lesser of two evils, which is at-least-once delivery in most cases. This can be used to simulate exactly-once semantics by ensuring idempotency or otherwise eliminating side effects from operations. Once again, it’s important to understand the trade-offs involved when designing distributed systems. There is asynchrony abound, which means you cannot expect synchronous, guaranteed behavior. Design for failure and resiliency against this asynchronous nature.

Follow @tyler_treat

62 Replies to “You Cannot Have Exactly-Once Delivery”

Confused says:

March 25, 2015 at 6:57 pm

“I can send 10 letters and assume you’ll get at least one of them (at-least-once).”

How is it you can assume 1 of 10 gets through? Isn’t this really “at most 10”?

If you can assume 1-of-N will get through, then why not just take N=1 and call it exactly-once?

Reply
1. Tyler Treat says:
  
  March 25, 2015 at 7:27 pm
  
  Yes, you’re correct. It’s not really an example of at-least-once delivery which requires an acknowledgement. Unintentionally misleading :)
  
  Updated the wording to hopefully make it clear (and correct).
  
  Reply
Jason Dusek says:

March 26, 2015 at 2:32 am

The FLP result comes with a caveat — it applies to a “completely asynchronous” protocol.

> In this paper, we show the surprising result that no completely asynchronous consensus protocol can tolerate even a single unannounced process death.

With tightly bounded clock drift (hard to bound in practice), it seems reasonable that we can guarantee once-and-only-once delivery, because we can perform consensus.

Reply
1. Joubin Houshyar says:
  
  March 26, 2015 at 1:59 pm
  
  It seems the conceptual root cause is subscription to the illusion of continuums and a stubborn belief in the fairy dust of (meaningful) instantaneity. We need to accept the reality of a discreet view of the world– which naturally promotes to first-class design concerns the notions of ‘precision’ & (time) ‘granularities’.
  
  The fact of the matter is that all of our computational systems are operating on data from ‘the past’.
  
  (re. clock bounds: Google has done it with Spanner — someone needs to commoditize the necessary h/w.)
  
  Reply
  1. Nicolas Correard says:
    
    February 24, 2016 at 4:30 pm
    
    CockroachDB do the equivalent of spanner without the hardware. Look for the blog post containing “CockroachDB was designed to work without atomic clocks or GPS clocks. It’s an open source database intended to be run on arbitrary collections of nodes: from physical servers in a corp development cluster to public cloud infrastructure using the flavor-of-the-month virtualization layer. It’d be a showstopper to require an external dependency on specialized hardware for clock synchronization.”
    
    Reply
lmm says:

March 26, 2015 at 3:43 am

Doesn’t the existence of three-phase commit contradict this? If I make a change and commit it with 3PC, retrying until it does, how is that not exactly-once delivery?

Reply
1. J W says:
  
  November 5, 2021 at 11:32 pm
  
  FLP literally does not apply to message passing lol. It important for writers to actually understand the proof of the theorem they’re talking about before they make wild claims like those in this article. In fact, FLP assumes reliable links! You read that right–*it literally assumes the existence of exactly-once delivery between correct nodes*, but proceeds to show that this doesn’t affect its main result. If you are claiming that one of the core system assumptions of FLP (the existence of reliable links) is disproved by FLP, you *might* just not understand distributed systems as well as you think you do.
  
  But what about in practice? Well, in practice it’s really easy to get exactly-once delivery between nodes, as long as you’re in a system strong enough to eventually solve consensus. In most contexts in which people want exactly-once delivery (such as between managed nodes in datacenters) this is a completely reasonable assumption. So this is a dumb post in practice, too.
  
  Reply
Webhiker says:

March 26, 2015 at 4:01 am

All your objections to Exactly-once also apply to AtLeastOnce.
And ExactlyOnce is possible…all your objections are based on the current design flay of messaing systems which decouple delivery into a message queue.

If you don’t decouple, the act of the recipient “reading” the message can be easily detected, including it’s failure. But then all the investment in expensive message queues looks stupid, so no-one will be able sell their superior knowledge of straw man arguments on why message delivery cannot be guaranteed. :)

Reply
1. MC says:
  
  April 18, 2022 at 10:37 pm
  
  You have absolutely no clue what you’re babbling about.
  
  Reply
Michael Chermside says:

March 26, 2015 at 8:58 am

Although it is impossible to create a system that guarantees “exactly once” delivery, it *is* possible to create a system that guarantees that EITHER (1) it will deliver exactly once, OR (2) it will report an error to a human being. This is also the technique best used for attempts at “at least once” delivery which fail over an extended period of time.

None of this invalidates anything you said about the usefulness of idempotency, but I like to point it out because it emphasizes two things: the impossibility of perfect message delivery (like “exactly once” being impossible) and the need to have someone monitor the error queues of your messaging system.

Reply
1. sumit says:
  
  March 27, 2015 at 2:43 am
  
  Hey Michael,
  
  It would be really helpful if you could point out some simple and relevant article(s) that supports the guarantee you mentioned.
  
  Reply
2. Michael Ho says:
  
  June 5, 2015 at 11:05 am
  
  Fancy meeting you here, MCherm!
  
  Reply
Stu says:

March 26, 2015 at 9:16 am

You’re right, but I’m not sure about the melodrama. Accusing folks like IBM or BEA/Oracle of lying for 20 years is a reach, it’s more like you weren’t there when they coined the term.

“Exactly once” has *always* meant “at least once but dupe-detected”. Mainly because we couldn’t convince customers to send idempotent and communitative state changes.

Reply
1. Sebastien Lorber says:
  
  April 5, 2015 at 7:10 pm
  
  +1
  
  With event-sourcing/stream processing for example you would version every event/message so that it’s easy to dedup. If your friend receive 2 messages with id=456 telling him to turn left them it is easy for him to ignore one of them.
  
  Another problem is about message ordering. If you have multiple datacenters and want to keep allow local writes during a network partition it seems impossible to guarantee global event ordering.
  Kafka does only guarantee ordering across a single Kafka partition for example.
  See how Eventuate is trying to solve this with causal consistency: https://github.com/RBMHTechnology/eventuate
  
  Reply
Pingback: Process Focus vs. System Architecture | Thinking Matters
Pingback: Endnu en god artikel om udvikling | Hennings blog
Pingback: Service-Disoriented Architecture | Brave New Geek
Mike Spooner says:

June 8, 2015 at 3:11 am

Well said, nice article. But this has been well-understood since at least 1986… sigh

Reply
1. John B says:
  
  September 9, 2015 at 9:06 am
  
  yes, it has been well understood by certain people for a long time, but there’s been an entire new generation of software developers since 1986.
  
  While it may be tedious for those of us who have been around for a long time, re-introducing key concepts to young developers is incredibly valuable work.
  
  Reply
Abhinav Singh says:

September 6, 2015 at 2:05 am

Great post as always. Wanted to leave my 2 cents here.

Let’s forget engineering and take a real world example like you did. Assume a distributed system of 2 nodes, me and my wife sitting in next room. If I want to communicate with her, I shout out her name and wait for response. Well, if I don’t hear back from her, we can assume:

– probably she didn’t hear me (partitioned by walls)
– simply ignored my message coz she is busy
– received my message but it wasn’t clear to her what to do with it
– received and she did shout back, but I just couldn’t hear her due to partition caused by walls and due to her soft voice
– may be I did likely heard her, but I am not sure
– may be I was too busy when she called out to me
– ….. We can go on here

Now, if I seriously want her attention and mean business, I will have to move past this “exactly-once” melodrama and shout out to her again.

Reply
Pingback: You cannot have at-least-once broadcast - 250bpm
Pingback: » Reportáž z GeeCON Praha 2015 Myšlenky dne otce Fura
Pingback: Use Cases für Apache Kafka: "Viele Data-Probleme sind gar nicht so big" - JAXenter
Pingback: Exactly Once Stream Processing Semantics ? Not Exactly | Mawazo
Pingback: 分布式系统互斥性与幂等性问题的分析与解决 – 大耳狐
Pingback: 分布式系统互斥性与幂等性问题的分析与解决 | 九悦
Pingback: Monzo是如何从0开始构建7*24小时不停歇的银行后台系统的？ - 莹莹之色
Pingback: Monzo是如何从0开始构建7*24小时不停歇的银行后台系统的？-zoues
Pingback: performance tuning kafka – Technology Musings
Kunal says:

December 12, 2016 at 1:22 pm

You can achieve the intent of “exactly-once” i.e. no duplicates and no data-loss on failures by making the receiver (client) state aware (i.e. offset, IDs); the client de-dupes.

There is business intent to building technology always. Religiously speaking, it is correct that exactly-once is not possible on the network protocol level in a distributed system; I don’t think anyone will argue that. When people say we need exactly once, they really are speaking from a business or application intent.

Reply
1. Mike Spooner says:
  
  December 12, 2016 at 6:12 pm
  
  Although sequence-numbers/IDs does mean that, like the 787 Dreamliner, you have to poweroff or restart the entire system-universe every so often, at least until we get systems that really can count to infinity (at least “A0”, not necessarily as far as Cantors number).
  
  Reply
Pingback: Tuning Kafka – Technology Musings
Pingback: Building a modern bank backend – At Monzo – Advance
Pingback: Delivering Billions of Messages Exactly Once · Segment Blog | Artificia Intelligence
Pingback: 如何做到“恰好一次”地传递数十亿条消息 - 莹莹之色
Pingback: 转：分布式系统互斥性与幂等性问题的分析与解决 – 东京钱爷的博客
Pingback: Apache Kafka gets 'exactly-once' message delivery - and that's a big deal - SiliconANGLE
Pingback: Apache Kafka 1.0 Released Exactly Once – IoT up2date
Pingback: Apache Kafka
Homesh Rawat says:

February 17, 2018 at 4:12 am

Great read!

Reply
mark says:

June 18, 2018 at 5:45 am

I thinks these works like deduplication and… must execute on consumer side

Reply
Pingback: Reliable notifications between two apps or microservices
Pingback: 1 – A victim of its own popularity: Scaling our CloudWatch integration | Traffic.Ventures Social
Pingback: 1 – Nancy Lynch – Distributed Systems Pioneer | Traffic.Ventures Social
Pingback: Reliable notifications between two apps or microservices – NICE CODE
Nicholas says:

February 11, 2019 at 12:11 pm

“The way we achieve exactly-once delivery in practice is by faking it. Either the messages themselves should be idempotent, meaning they can be applied more than once without adverse effects, or we remove the need for idempotency through deduplication.”

If the message only writes exactly once then that’s successful exactly-once semantics. Back when this article was written it was a hard problem a lot of people struggled with, but today Kafka has a system for exactly-once, and I work at a different company that does it in a different way. You can call it “fake” but in that case we have stable, well-functioning “fake” exactly-once semantics, and a lot of customers use that “fake” system successfully to solve real problems.

Reply
Pingback: A Game of Snakes & Ladders Called Microservices | APIscene
Pingback: Stream Deduplication with Hazelcast Jet | Hazelcast
Pingback: Stream Deduplication with Hazelcast Jet - Hazelcast Hazelcast the Leading In-Memory Data Grid
Mehedi says:

May 22, 2020 at 7:14 pm

How Gmail ensure only one email?

Reply
Pingback: Cassandra: difference between exactly-once and at-least-once guarantees - IZZIDB
Pingback: Handling access token expiry when internet is unstable - Tutorial Guruji
Pingback: Exactly-Once Delivery，這個需求做不了 - 水源一二三
Pingback: Exactly-Once Delivery，這個需求做不了 - 水源一二三
temporal user says:

January 13, 2022 at 9:05 pm

temporal does exactly once delivery

Reply
Pingback: 高并发-业务-如何构建幂等性接口 - 算法网
Pingback: 分布式系统互斥性与幂等性问题的分析与解决 – 源码巴士
Pingback: Part 3: Processing Payments – Ethereum Payment | Code Capsule
Alexey Stogny says:

February 23, 2023 at 10:15 am

Nice post! Thanks! Wouldn’t post useless comment, but there’s no other way to subscribe to new posts ;)

Reply
Simon Boddy says:

March 2, 2023 at 2:07 am

Absolutely. Of course. And there’s a simple pattern for dealing with this… use a request/response protocol rather than messages, then make sure all unsafe requests have an application-level id. The receiving process, the server application, should store all responses. If it sees a request for the first time it does the work, then sends and stores the response. If it sees a request for which it has a stored response, it just replays the response. Reliability is an application level responsibility, and uniquely identified requests can be linked to uniqueness in the application context (1 shopping cart can have 1 payment request that the shopping cart app can repeat endlessly until it gets a response)

Reply
Rody says:

April 18, 2023 at 9:10 pm

What about modern day chat applications, they use sockets and messages are delivered once and they come under distributed systems right?
isnt exactly once not achieved, or is it with the sender and receiver being the same services giving an edge?

Reply
abc123 says:

September 18, 2023 at 9:48 am

Reply

You Cannot Have Exactly-Once Delivery

Related

62 Replies to “You Cannot Have Exactly-Once Delivery”

Leave a Reply Cancel reply

Share this:

Related

62 Replies to “You Cannot Have Exactly-Once Delivery”

Leave a Reply Cancel reply