Superrationality and DAOs | Ethereum Basis Weblog

14 September 2025

1

Warning: this put up accommodates loopy concepts. Myself describing a loopy concept ought to NOT be construed as implying that (i) I’m sure that the thought is right/viable, (ii) I’ve a fair >50% likelihood estimate that the thought is right/viable, or that (iii) “Ethereum” endorses any of this in any means.

One of many widespread questions that many within the crypto 2.0 house have concerning the idea of decentralized autonomous organizations is an easy one: what are DAOs good for? What elementary benefit would a company have from its administration and operations being tied all the way down to laborious code on a public blockchain, that would not be had by going the extra conventional route? What benefits do blockchain contracts supply over plain outdated shareholder agreements? Notably, even when public-good rationales in favor of clear governance, and guarnateed-not-to-be-evil governance, may be raised, what’s the incentive for a person group to voluntarily weaken itself by opening up its innermost supply code, the place its rivals can see each single motion that it takes and even plans to take whereas themselves working behind closed doorways?

There are various paths that one may take to answering this query. For the precise case of non-profit organizations which are already explicitly dedicating themselves to charitable causes, one can rightfully say that the shortage of particular person incentive; they’re already dedicating themselves to bettering the world for little or no financial achieve to themselves. For personal corporations, one could make the information-theoretic argument {that a} governance algorithm will work higher if, all else being equal, everybody can take part and introduce their very own info and intelligence into the calculation – a slightly cheap speculation given the established outcome from machine studying that a lot bigger efficiency beneficial properties may be made by growing the information dimension than by tweaking the algorithm. On this article, nevertheless, we’ll take a unique and extra particular route.

What’s Superrationality?

In sport idea and economics, it’s a very broadly understood outcome that there exist many courses of conditions through which a set of people have the chance to behave in one in all two methods, both “cooperating” with or “defecting” in opposition to one another, such that everybody can be higher off if everybody cooperated, however no matter what others do every indvidual can be higher off by themselves defecting. Consequently, the story goes, everybody finally ends up defecting, and so folks’s particular person rationality results in the worst potential collective outcome. The commonest instance of that is the celebrated Prisoner’s Dilemma sport.

Since many readers have doubtless already seen the Prisoner’s Dilemma, I’ll spice issues up by giving Eliezer Yudkowsky’s slightly deranged model of the sport:

Let’s suppose that 4 billion human beings – not the entire human species, however a major a part of it – are at the moment progressing by means of a deadly illness that may solely be cured by substance S.

Nevertheless, substance S can solely be produced by working with [a strange AI from another dimension whose only goal is to maximize the quantity of paperclips] – substance S will also be used to supply paperclips. The paperclip maximizer solely cares concerning the variety of paperclips in its personal universe, not in ours, so we won’t supply to supply or threaten to destroy paperclips right here. We’ve by no means interacted with the paperclip maximizer earlier than, and can by no means work together with it once more.

Each humanity and the paperclip maximizer will get a single probability to grab some further a part of substance S for themselves, simply earlier than the dimensional nexus collapses; however the seizure course of destroys a few of substance S.

The payoff matrix is as follows:

	People cooperate	People defect
AI cooperates	2 billion lives saved, 2 paperclips gained	3 billion lives, 0 paperclips
AI defects	0 lives, 3 paperclips	1 billion lives, 1 paperclip

From our perspective, it clearly is sensible from a sensible, and on this case ethical, standpoint that we should always defect; there isn’t a means {that a} paperclip in one other universe may be price a billion lives. From the AI’s perspective, defecting at all times results in one further paperclip, and its code assigns a price to human lifetime of precisely zero; therefore, it’s going to defect. Nevertheless, the result that this results in is clearly worse for each events than if the people and AI each cooperated – however then, if the AI was going to cooperate, we may save much more lives by defecting ourselves, and likewise for the AI if we have been to cooperate.

In the actual world, many two-party prisoner’s dilemmas on the small scale are resolved by means of the mechanism of commerce and the power of a authorized system to implement contracts and legal guidelines; on this case, if there existed a god who has absolute energy over each universes however cared solely about compliance with one’s prior agreements, the people and the AI may signal a contract to cooperate and ask the god to concurrently stop each from defecting. When there isn’t a skill to pre-contract, legal guidelines penalize unilateral defection. Nevertheless, there are nonetheless many conditions, significantly when many events are concerned, the place alternatives for defection exist:

Alice is promoting lemons in a market, however she is aware of that her present batch is low high quality and as soon as clients attempt to use them they are going to instantly need to throw them out. Ought to she promote them anyway? (Observe that that is the type of market the place there are such a lot of sellers you possibly can’t actually preserve monitor of status). Anticipated achieve to Alice: $5 income per lemon minus $1 delivery/retailer prices = $4. Anticipated value to society: $5 income minus $1 prices minus $5 wasted cash from buyer = -$1. Alice sells the lemons.
Ought to Bob donate $1000 to Bitcoin improvement? Anticipated achieve to society: $10 * 100000 folks – $1000 = $999000, anticipated achieve to Bob: $10 – $1000 = -$990, so Bob doesn’t donate.
Charlie discovered another person’s pockets, containing $500. Ought to he return it? Anticipated achieve to society: $500 (to recipient) – $500 (Charlie’s loss) + $50 (intangible achieve to society from everybody having the ability to fear rather less concerning the security of their wallets). Anticipated achieve to Charlie: -$500, so he retains the pockets.
Ought to David minimize prices in his manufacturing facility by dumping poisonous waste right into a river? Anticipated achieve to society: $1000 financial savings minus $10 common elevated medical prices * 100000 folks = -$999000, anticipated achieve to David: $1000 – $10 = $990, so David pollutes.
Eve developed a treatment for a sort of most cancers which prices $500 per unit to supply. She will be able to promote it for $1000, permitting 50,000 most cancers sufferers to afford it, or for $10000, permitting 25,000 most cancers sufferers to afford it. Ought to she promote on the increased worth? Anticipated achieve to society: -25,000 lives (together with Alice’s revenue, which cancels’ out the wealthier patrons’ losses). Anticipated achieve to Eve: $237.5 million revenue as a substitute of $25 million = $212.5 million, so Eve expenses the upper worth.

In fact, in lots of of those instances, folks typically act morally and cooperate, despite the fact that it reduces their private scenario. However why do they do that? We have been produced by evolution, which is mostly a slightly egocentric optimizer. There are various explanations. One, and the one we’ll concentrate on, includes the idea of superrationality.

Superrationality

Contemplate the next rationalization of advantage, courtesy of David Friedman:

I begin with two observations about human beings. The primary is that there’s a substantial connection between what goes on inside and outdoors of their heads. Facial expressions, physique positions, and a wide range of different indicators give us at the very least some concept of our pals’ ideas and feelings. The second is that we’ve got restricted mental ability–we can’t, within the time accessible to decide, take into account all choices. We’re, within the jargon of computer systems, machines of restricted computing energy working in actual time.

Suppose I want folks to consider that I’ve sure characteristics–that I’m sincere, form, useful to my pals. If I actually do have these traits, projecting them is easy–I merely do and say what appears pure, with out paying a lot consideration to how I seem to exterior observers. They may observe my phrases, my actions, my facial expressions, and draw fairly correct conclusions.

Suppose, nevertheless, that I shouldn’t have these traits. I’m not (for instance) sincere. I often act truthfully as a result of performing truthfully is often in my curiosity, however I’m at all times prepared to make an exception if I can achieve by doing so. I have to now, in lots of precise choices, do a double calculation. First, I have to determine how you can act–whether, for instance, it is a good alternative to steal and never be caught. Second, I have to determine how I might be considering and performing, what expressions can be going throughout my face, whether or not I might be feeling blissful or unhappy, if I actually have been the particular person I’m pretending to be.

In the event you require a pc to do twice as many calculations, it slows down. So does a human. Most of us are usually not superb liars.
If this argument is right, it implies that I could also be higher off in narrowly materials terms–have, as an illustration, a better income–if I’m actually sincere (and sort and …) than if I’m solely pretending to be, just because actual virtues are extra convincing than fake ones. It follows that, if I have been a narrowly egocentric particular person, I would, for purely egocentric causes, wish to make myself a greater person–more virtuous in these ways in which others worth.

The ultimate stage within the argument is to look at that we may be made better–by ourselves, by our mother and father, maybe even by our genes. Folks can and do attempt to prepare themselves into good habits–including the habits of robotically telling the reality, not stealing, and being form to their pals. With sufficient coaching, such habits develop into tastes–doing “unhealthy” issues makes one uncomfortable, even when no one is watching, so one doesn’t do them. After some time, one doesn’t even need to determine to not do them. You would possibly describe the method as synthesizing a conscience.

Basically, it’s cognitively laborious to convincingly pretend being virtuous whereas being grasping at any time when you may get away with it, and so it makes extra sense so that you can truly be virtuous. A lot historical philosophy follows related reasoning, seeing advantage as a cultivated behavior; David Friedman merely did us the customary service of an economist and transformed the instinct into extra simply analyzable formalisms. Now, allow us to compress this formalism even additional. Briefly, the important thing level right here is that people are leaky brokers – with each second of our motion, we basically not directly expose components of our supply code. If we are literally planning to be good, we act a method, and if we’re solely pretending to be good whereas truly desiring to strike as quickly as our pals are weak, we act in a different way, and others can usually discover.

This would possibly appear to be a drawback; nevertheless, it permits a type of cooperation that was not potential with the straightforward game-theoretic brokers described above. Suppose that two brokers, A and B, every have the power to “learn” whether or not or not the opposite is “virtuous” to some extent of accuracy, and are taking part in a symmetric Prisoner’s Dilemma. On this case, the brokers can undertake the next technique, which we assume to be a virtuous technique:

Attempt to decide if the opposite occasion is virtuous.
If the opposite occasion is virtuous, cooperate.
If the opposite occasion shouldn’t be virtuous, defect.

If two virtuous brokers come into contact with one another, each will cooperate, and get a bigger reward. If a virtuous agent comes into contact with a non-virtuous agent, the virtuous agent will defect. Therefore, in all instances, the virtuous agent does at the very least in addition to the non-virtuous agent, and sometimes higher. That is the essence of superrationality.

As contrived as this technique appears, human cultures have some deeply ingrained mechanisms for implementing it, significantly regarding mistrusting brokers who strive laborious to make themselves much less readable – see the widespread adage that you must by no means belief somebody who does not drink. In fact, there’s a class of people who can convincingly fake to be pleasant whereas truly planning to defect at each second – these are known as sociopaths, and they’re maybe the first defect of this technique when carried out by people.

Centralized Guide Organizations…

This type of superrational cooperation has been arguably an essential bedrock of human cooperation for the final ten thousand years, permitting folks to be sincere to one another even in these instances the place easy market incentives would possibly as a substitute drive defection. Nevertheless, maybe one of many principal unlucky byproducts of the fashionable beginning of enormous centralized organizations is that they permit folks to successfully cheat others’ skill to learn their minds, making this type of cooperation tougher.

Most individuals in trendy civilization have benefited fairly handsomely, and have additionally not directly financed, at the very least some occasion of somebody in some third world nation dumping poisonous waste right into a river to construct merchandise extra cheaply for them; nevertheless, we don’t even understand that we’re not directly taking part in such defection; companies do the soiled work for us. The market is so highly effective that it will possibly arbitrage even our personal morality, inserting essentially the most soiled and unsavory duties within the arms of these people who’re prepared to soak up their conscience at lowest value and successfully hiding it from everybody else. The companies themselves are completely in a position to have a smiley face produced as their public picture by their advertising departments, leaving it to a very completely different division to sweet-talk potential clients. This second division could not even know that the division producing the product is any much less virtuous and candy than they’re.

The web has usually been hailed as an answer to many of those organizational and political issues, and certainly it does do an amazing job of lowering info asymmetries and providing transparency. Nevertheless, so far as the reducing viability of superrational cooperation goes, it will possibly additionally typically make issues even worse. On-line, we’re a lot much less “leaky” whilst people, and so as soon as once more it’s simpler to seem virtuous whereas truly desiring to cheat. That is a part of the rationale why scams on-line and within the cryptocurrency house are extra widespread than offline, and is maybe one of many main arguments in opposition to shifting all financial interplay to the web a la cryptoanarchism (the opposite argument being that cryptoanarchism removes the power to inflict unboundedly massive punishments, weakening the energy of a big class of financial mechanisms).

A a lot higher diploma of transparency, arguably, provides an answer. People are reasonably leaky, present centralized organizations are much less leaky, however organizations the place randomly info is consistently being launched to the world left, proper and heart are much more leaky than people are. Think about a world the place in case you begin even eager about how you’ll cheat your pal, enterprise accomplice or partner, there’s a 1% probability that the left a part of your hippocampus will insurgent and ship a full recording of your ideas to your supposed sufferer in alternate for a $7500 reward. That’s what it “feels” prefer to be the administration board of a leaky group.

That is basically a restatement of the founding ideology behind Wikileaks, and extra lately an incentivized Wikileaks different, slur.io got here out to push the envelope additional. Nevertheless, Wikileaks exists, and but shadowy centralized organizations additionally proceed to nonetheless exist and are in lots of instances nonetheless fairly shadowy. Maybe incentivization, coupled with prediction-like-mechanisms for folks to revenue from outing their employers’ misdeeds, is what’s going to open the floodgates for higher transparency, however on the identical time we will additionally take a unique route: supply a means for organizations to make themselves voluntarily, and radically, leaky and superrational to an extent by no means seen earlier than.

… and DAOs

Decentralized autonomous organizations, as an idea, are distinctive in that their governance algorithms are usually not simply leaky, however truly fully public. That’s, whereas with even clear centralized organizations outsiders can get a tough concept of what the group’s temperament is, with a DAO outsiders can truly see the group’s complete supply code. Now, they don’t see the “supply code” of the people which are behind the DAO, however there are methods to write down a DAO’s supply code in order that it’s closely biased towards a specific goal no matter who its contributors are. A futarchy maximizing the common human lifespan will act very in a different way from a futarchy maximizing the manufacturing of paperclips, even when the very same persons are operating it. Therefore, not solely is it the case that the group will make it apparent to everybody in the event that they begin to cheat, however slightly it is not even potential for the group’s “thoughts” to cheat.

Now, what would superrational cooperation utilizing DAOs appear like? First, we would want to see some DAOs truly seem. There are a number of use-cases the place it appears not too far-fetched to count on them to succeed: playing, stablecoins, decentralized file storage, one-ID-per-person information provision, SchellingCoin, and many others. Nevertheless, we will name these DAOs kind I DAOs: they’ve some inside state, however little autonomous governance. They can’t ever do something however maybe modify a number of of their very own parameters to maximise some utility metric through PID controllers, simulated annealing or different easy optimization algorithms. Therefore, they’re in a weak sense superrational, however they’re additionally slightly restricted and silly, and they also will usually depend on being upgraded by an exterior course of which isn’t superrational in any respect.

So as to go additional, we want kind II DAOs: DAOs with a governance algorithm able to making theoretically arbitrary choices. Futarchy, varied types of democracy, and varied types of subjective extra-protocol governance (ie. in case of considerable disagreement, DAO clones itself into a number of components with one half for every proposed coverage, and everybody chooses which model to work together with) are the one ones we’re at the moment conscious of, although different elementary approaches and intelligent combos of those will doubtless proceed to seem. As soon as DAOs could make arbitrary choices, then they are going to be capable of not solely interact in superrational commerce with their human clients, but in addition doubtlessly with one another.

What sorts of market failures can superrational cooperation remedy that plain outdated common cooperation can’t? Public items issues could sadly be exterior the scope; not one of the mechanisms described right here remedy the massively-multiparty incentivization drawback. On this mannequin, the rationale why organizations make themselves decentralized/leaky is in order that others will belief them extra, and so organizations that fail to do that might be excluded from the financial advantages of this “circle of belief”. With public items, the entire drawback is that there isn’t a method to exclude anybody from benefiting, so the technique fails. Nevertheless, something associated to info asymmetries falls squarely throughout the scope, and this scope is massive certainly; as society turns into increasingly advanced, dishonest will in some ways develop into progressively simpler and simpler to do and more durable to police and even perceive; the fashionable monetary system is only one instance. Maybe the true promise of DAOs, if there’s any promise in any respect, is exactly to assist with this.

Superrationality and DAOs | Ethereum Basis Weblog

What’s Superrationality?

Superrationality

Centralized Guide Organizations…

… and DAOs

The P + epsilon Assault

The Subjectivity / Exploitability Tradeoff

Gav’s Ethereum ÐΞV Replace V

Most Popular

5 Important Suggestions for Profitable Crowdfunding for Startups

Ethereum Each day Chart Turns Inexperienced As ETHBTC Prepares For Raise-Off

The P + epsilon Assault

Methods to Begin a YouTube Channel in 2024

Wall Road Veteran Suggestions TradFi To Bolster Bitcoin Allocations

The Subjectivity / Exploitability Tradeoff

Ansarada Expands in Center East as UAE Deal Exercise Rises in Q2

A Clear Roadmap To $0.35

What to Count on Throughout New Rent Orientation?

Unclaimed Belongings Value ₹1 Lakh Crore Trapped in Authorities Paperwork

Recent Comments

ABOUT US

POPULAR POSTS

5 Important Suggestions for Profitable Crowdfunding for Startups

Ethereum Each day Chart Turns Inexperienced As ETHBTC Prepares For Raise-Off

The P + epsilon Assault

POPULAR CATEGORY

Superrationality and DAOs | Ethereum Basis Weblog