photo_camera Infralist.com / Unsplash
Computer motherboard
Is Artificial Consciousness Possible? A Summary of Selected Books
Ali Ladak
Researcher
June 13, 2022

Many thanks to Tobias Baumann, Max Carpendale, and Michael St. Jules for their helpful comments and discussion. Thanks to Jacy Reese Anthis and Janet Pauketat for more extensive discussion and editing.

Table of Contents

Introduction

John Searle, The Rediscovery of the Mind, 1992

Daniel Dennett, Consciousness Explained, 1993

David Chalmers, The Conscious Mind: In Search of a Fundamental Theory, 1995

Thomas Metzinger, The Ego Tunnel: The Science of the Mind and the Myth of the Self, 2010

Stanislas Dehaene, Consciousness and the Brain: Deciphering How the Brain Codes Our Thoughts, 2014

Michael Tye, Tense Bees and Shell-Shocked Crabs: Are Animals Conscious?, 2017

Susan Blackmore, Consciousness: An Introduction, 2018

Susan Schneider, Artificial You: AI and the Future of Your Mind, 2019

Michael Graziano, Rethinking Consciousness: A Scientific Theory of Subjective Experience, 2019

Philip Goff, Galileo’s Error: Foundations for a New Science of Consciousness, 2019

Simona Ginsburg and Eva Jablonka, The Evolution of the Sensitive Soul: Learning and the Origins of Consciousness, 2019

Peter Godfrey-Smith, Metazoa: Animal Life and the Birth of the Mind, 2020

Kristof Koch, The Feeling of Life Itself: Why Consciousness Is Widespread but Can't Be Computed, 2020

Anil Seth, Being You: A New Science of Consciousness, 2021

David Chalmers, Reality+: Virtual Worlds and the Problems of Philosophy, 2022

Introduction

Many philosophers and scientists have written about whether artificial sentience or consciousness is possible.[1] In this blog post we summarize discussions of the topic from 15 books.[2] The books were chosen based on their popularity and representation of a range of perspectives on artificial consciousness. They were not randomly sampled from all of the books written on the topic. For brevity, we simply summarize the claims made by the authors, rather than critique or respond to them.

While the books contain a wide variety of terminology, we can categorize the ways they assess the possibility of artificial consciousness into three broad approaches:[3][4]

  1. The computational approach abstracts away from the specific implementation details of a cognitive system, such as whether it is implemented in carbon versus silicon substrate. Instead, it focuses on a higher level of analysis: the computations, algorithms, or programs that a cognitive system runs to generate its behavior. Another way of putting this is that it focuses on the software a system is running, rather than on the system’s hardware. The computational approach is standard in the field of cognitive science (e.g., Cain, 2015) and suggests that if artificial entities implement certain computations, they will be conscious. The specific algorithms or computations that are thought to give rise to or be constitutive of consciousness differ. For example, Metzinger (2010) emphasizes the importance of an internal self-model, whereas Dehaene (2014) emphasizes the importance of a “global workspace,” in which information becomes available for use by multiple subsystems. Out of the three approaches, the computational approach typically projects the largest number of conscious artificial entities existing in the future because computational criteria are arguably easiest for an AI system to achieve.
  2. The physical approach focuses on the physical details of how a cognitive system is implemented; that is, it focuses on a system’s hardware rather than its software.[5] For example, Koch (2020) defends Integrated Information Theory (IIT), in which the degree of consciousness in a system depends on its degree of integrated information, that is, the degree to which the system is causally interconnected such that it is not reducible to its individual components. This integrated information needs to be present at the physical, hardware level of a system.[6] According to Koch, the hardware of current digital computers has very little integrated information, so they could not be conscious no matter what cognitive system they implement at the software level (e.g., a whole brain emulation). However, only the physical organization matters, not the specific substrate the system is implemented in. Thus, although artificial consciousness is possible on the physical approach, it typically predicts fewer conscious artificial entities than the computational approach.[7]
  3. The biological approach also focuses on the physical details of how a cognitive system is implemented, but it additionally emphasizes some specific aspect of biology as important for consciousness. For example, Godfrey-Smith (2020) suggests that it would be very difficult to have a conscious system that isn’t physically very similar to the brain because of some of the dynamic patterns involved in consciousness in brains. However, when pressed, even these views tend to allow for the possibility of artificial consciousness. Godfrey-Smith says that future robots with “genuinely brain-like control systems” could be conscious, and John Searle, perhaps the most well-known proponent of a biological approach, has said, “The fact that brain processes cause consciousness does not imply that only brains can be conscious. The brain is a biological machine, and we might build an artificial machine that was conscious; just as the heart is a machine, and we have built artificial hearts. Because we do not know exactly how the brain does it we are not yet in a position to know how to do it artificially.” Still, the biological approach is skeptical of the possibility of artificial consciousness and the number of future conscious artificial entities is predicted to be smaller than on both the computational and physical approaches; a physical system would need to closely resemble biological brains to be conscious.

Overall, there is a broad consensus among the books that artificial consciousness is possible. According to the computational approach, which is the mainstream view in cognitive science, artificial consciousness is not only possible, but is likely to come about in the future, potentially in very large numbers. The physical and biological approaches predict that artificial consciousness will be far less widespread. Artificial sentience as an effective altruism cause area is, therefore, more likely to be promising if one favors the computational approach over the physical and biological approaches.

Which approach should we favor? Several of the books provide arguments. For example, Chalmers (1995) uses a Silicon Chip Replacement thought experiment to argue that a functionally identical silicon copy of a human brain would have the same conscious experience as a biological human brain, and from there goes on to defend a general computational account. Searle (1992) uses the Chinese Room thought experiment to argue that computational accounts necessarily leave out some aspects of our mental lives, such as understanding. Schneider (2019) argues that we don’t yet have enough information to decide between different approaches and advocates for a “wait and see” approach. The approach that one subscribes to will depend on how convincing they find these and other arguments.

Many of the perspectives summarized in this post consider the ethical implications of creating artificial consciousness. In a popular textbook on consciousness, Blackmore (2018) argues that if we create artificial sentience, they will be capable of suffering, and we will therefore have moral responsibilities towards them. Practical suggestions from the books for how to deal with the ethical issues range from an outright ban on developing artificial consciousness until we have more information (Metzinger, 2010), to the view that we should deliberately try to implement consciousness in AI as a way of reducing the likelihood that future powerful AI systems will cause us harm (Graziano, 2019). Figuring out which of these and other strategies will be most beneficial is an important topic for future research.

John Searle, The Rediscovery of the Mind, 1992

Searle argues against the computational approach to the mind. His most well-known objection is the Chinese Room argument. The argument asks us to imagine a non-Chinese speaker locked in a room with a large batch of (unbeknown to them) Chinese writing and a set of instructions written in a language they understand. The instructions tell the person how to match up and respond to inputs arriving through a slot in the door to the room, which are questions in Chinese. As the person responds with the appropriate outputs based on the instructions, and becomes increasingly good at this, it appears from the outside like they understand Chinese. However, Searle claims that the person clearly does not truly understand Chinese; from their perspective they are merely manipulating meaningless symbols based on syntactic rules. Since computer programs work in essentially the same way — operating at the level of syntax — they cannot have true understanding either. Searle concludes that the computational approach leaves out key aspects of the mind, such as understanding.

Searle has a second claim that, on standard definitions of computation, we do not discover physical systems to be carrying out computations, rather we assign this interpretation to them. That is, he argues that computation is not “intrinsic to physics,” rather, it is observer-relative. Searle argues that if an observer is required to assign a computational interpretation to a system, we cannot discover that the brain intrinsically carries out computations. The field of cognitive science in its current form, and computational approaches in general, therefore cannot explain how brains or minds work intrinsically.

While Searle does not outright deny the possibility of artificial consciousness, he concludes that a theory of consciousness should be focused on the neurobiological rather than computational level. His view therefore falls under the biological approach on our three categories to the question of artificial consciousness.

Daniel Dennett, Consciousness Explained, 1993

Dennett puts forward the “Multiple Drafts model of consciousness, which proposes an alternative to the arguably commonly held idea that the various aspects of conscious experience come together and are projected in a “Cartesian Theater” in the brain. Instead, Dennett proposes that information from various sensory inputs arrives in the brain and is processed by various subsystems in the brain at different times. Once processed, the information becomes immediately available for use without the need for further centralized processing, and out of these “drafts” a final draft that we call consciousness is selected depending on what information is recruited for use (e.g., information to respond to a question). Dennett considers that if his theory is correct, a computer that implements the right program would be conscious. Hence, his view falls under the computational approach on our three categories of approaches to the question of artificial consciousness.

Dennett addresses critics who say that they “can’t imagine a conscious robot!” First, he suggests we do imagine it when we think of fictional robots, such as HAL from 2001: A Space Odyssey. He suggests that we mean we can’t imagine how a silicon brain could give rise to consciousness. He counters this assumption by suggesting that this is also true for biological brains, and yet we do it. Dennett also considers Searle’s Chinese Room argument against strong AI, suggesting that the thought experiment asks us to imagine an extremely simplified version of the kind of computations that brains do. When we imagine the system with its full complexity, it’s no longer obvious that it does not understand Chinese. According to Dennett, complexity matters, otherwise the simple argument that a hand calculator has no understanding would suffice to show that no computer, however advanced, can have understanding.

David Chalmers, The Conscious Mind: In Search of a Fundamental Theory, 1995

Chalmers argues in favor of the computational approach to the mind and the possibility of strong AI. His argument has two steps. First, he argues in favor of the “principle of organizational invariance,” that two systems with the same fine-grained functional organization will have identical conscious experiences.[8] He argues for this conclusion using two variations of the Silicon Chip Replacement thought experiment: Fading Qualia, which suggests that an entity whose brain is replaced with functionally equivalent silicon chips will have some kind of experience, and Dancing Qualia, which argues that the silicon-alternative brain will also have the same experience as the original biological brain.[9][10] After arguing that maintaining a conscious system’s functional organization is sufficient for maintaining its conscious experience, Chalmers provides a technical account of what it means for a physical system to implement a computation, which he argues avoids the observer-relative nature of computation as argued by John Searle.[11] Chalmers suggests that a system’s functional organization is maintained when it is implemented computationally, for example, in a digital computer. If the system in question is a human brain, its computational implementation will therefore have the same mental states as a human brain, including consciousness.

He considers several objections, including Searle’s Chinese Room argument, to which he responds with variants on the Fading Qualia and Dancing Qualia arguments where a real Chinese speaker has their neurons gradually replaced to what eventually becomes the equivalent of the Chinese Room. As with the initial arguments, Chalmers considers the only plausible outcome to be the one where the entity’s conscious experience stays the same. He also addresses the question of whether the computed mind would be just a simulation rather than a replication of a mind with real mental states. He argues that with organizational invariants, such as minds, which are defined by their functional organization, simulations are the same as replications. This is why simulated minds are real minds, but simulated hurricanes are not real hurricanes.

Chalmers also responds to the criticism that his arguments only establish a weak form of strong AI, one that is closely tied to biology, because his arguments rely on replicating the neuron-level functional organization of the brain. Chalmers considers that his arguments remove the in-principle argument against computational approaches, and as a result, “the floodgates are then opened to a whole range of programs that might be candidates to support conscious experience.”

Thomas Metzinger, The Ego Tunnel: The Science of the Mind and the Myth of the Self, 2010

According to Metzinger’s theory, our experience of consciousness is due to our brain’s construction of a model of external reality and a self-model designed to help us interact with the world in a holistic way. These models are “transparent” in the sense that we cannot see them, and so we take them to be real. According to Metzinger, an AI that has the right kinds of models of external reality and its self would be conscious. Metzinger’s approach to the question of artificial consciousness in our categorization is, therefore, computational. However, he notes that engineering such an entity is a difficult technical challenge. He considers the self-model to be crucial — without it, there may be a constructed world but there would not be anyone to experience it.[12] He notes that the implementation of consciousness in artificial entities turns them into entities that can suffer, and they therefore become objects of moral concern.

Metzinger thinks that the first sentient AIs that are built will likely have all kinds of deficits due to design errors as engineers refine their processes, and they will likely suffer greatly as a result. For example, early systems will likely have perceptual deficits, making it difficult to perceive themselves or the world around them. In some cases, we may not be able to recognize or understand their suffering; such systems may have the capacity to suffer in ways or degrees completely unimaginable to us. Metzinger argues that because of the probability of their suffering, we should avoid trying to create artificial consciousness, and he suggests that our attention would be better directed at understanding and neutralizing our own suffering.

He also asks whether if we could, we should increase the overall amount of positive experience in the universe by colonizing it with artificial “bliss machines.” He argues that we should not, on the basis that there is more to an existence worth having than positive subjective experiences. He also considers whether we should have a broader pessimism about conscious experience — that the type of consciousness humans have is net negative in value, and that the evolution of consciousness has led to the expansion of suffering in the universe where before there was none. Metzinger considers that the fact that we do not have clear answers to considerations such as these gives us additional reasons to avoid trying to create artificial consciousness right now.

Stanislas Dehaene, Consciousness and the Brain: Deciphering How the Brain Codes Our Thoughts, 2014

Dehaene outlines the Global Neuronal Workspace theory of consciousness, which states that we become conscious when information enters a “global workspace” in the brain where the information is made available for use by various cognitive subsystems such as perception, action, and memory.[13] Dehaene sees no logical problem with the possibility of artificial consciousness, favoring a computational approach to the mind. He suggests that we are nowhere near having the capacity to build conscious machines today, but that this is an exciting avenue of scientific research for the coming decades. He thinks there are at least three key functions that are still lacking from current computers: flexible communication between subsystems, the ability to learn and adapt to their environments, and having greater autonomy to decide what actions to take to achieve their goals.

He considers objections made by philosophers such as Ned Block and David Chalmers that he only explains access consciousness and not phenomenal consciousness.[14] He argues that as science continues to make progress on understanding access consciousness, the more intractable problem of phenomenal consciousness will dissolve, similar to how the notion of a “life force” dissolved as biologists made progress in understanding the mechanics of life. He also considers the argument that his account leaves out free will. He argues that the variety of free will worth wanting is simply having the freedom and capacity to make decisions based on your higher-order thoughts, beliefs, values, and so on, and that this capacity can be implemented in a computer. He concludes that neither phenomenal consciousness nor free will pose an obstacle for the creation of artificial consciousness.

Michael Tye, Tense Bees and Shell-Shocked Crabs: Are Animals Conscious?, 2017

Tye uses “Newton’s Rule” — for two same outcomes, we are entitled to infer the same cause, unless there is evidence that defeats the inference — to reason about the likelihood of consciousness in nonhuman animals. In one chapter he applies this process to two artificial entities previously discussed in the philosophy literature: Commander Data from Star Trek, enhanced so that he is a silicon-brained functional isomorph of a human, and “Robot Rabbit,” a silicon-brained functional isomorph of a rabbit. He argues that in both cases, the functional similarity is a reason to favor that they are conscious, and the physical difference weakens this inference but does not defeat it. He cites Chalmers’ Silicon Chip Replacement arguments in favor of the view that functional isomorphs of conscious brains are also conscious. Therefore, he considers it is rational to prefer the view that both Commander Data and Robot Rabbit are conscious.

Tye further considers a thought experiment where we learn that humans are actually designed by an alien species with four different “brain types” (analogous to blood types), functionally identical but made of different substrates. On learning this, he asks whether it would be rational to consider that someone whose brain is made of a different substrate to yours is not conscious. He claims it would not be, and therefore that physical difference does not override functional similarity. However, Tye only considers cases of functional isomorphs and does not specify how he would judge entities with functional differences.

Susan Blackmore, Consciousness: An Introduction, 2018

Blackmore states that a machine trivially has the ability to be conscious because the brain is a machine and it is conscious. So, she refines the question: Can an artificial machine be conscious, and can we make one? She suggests the question matters because if artificial machines are conscious, they could suffer, and so we would have moral responsibilities towards them.

She considers several arguments of philosophers and scientists that artificial consciousness is impossible: consciousness is nonphysical and we can’t give something nonphysical to a machine; consciousness relies necessarily on biology; there are some things that machines can’t do, such as original thinking. She considers Searle’s Chinese Room argument and notes various objections, such as the Systems Reply, on which the whole system rather than the individual in the room would have a true understanding of Chinese, and the Robot Reply, which considers that if the system is attached to a body that can ground the symbols to objects in the real world, it would have true understanding of Chinese. 

Blackmore discusses the possibility that artificial entities are already conscious — even a thermostat could be said to have beliefs (it’s too hot or cold in this room); similarly, artificial intelligences could be said to already have beliefs and other mental states. She then considers several approaches to building conscious machines: looking for criteria associated with consciousness and building them into artificial entities, building AI based on existing theories of consciousness, and building the illusion of consciousness into AIs.

Susan Schneider, Artificial You: AI and the Future of Your Mind, 2019

Schneider considers two views of consciousness: “biological naturalism” and the “techno-optimist view.” She defines biological naturalism as the view that consciousness depends on some specific feature that biological systems have and that non-biological systems lack. Schneider is skeptical of biological naturalism. She notes that no such special feature has been discovered, and even if a such a feature was discovered in biological systems, there could be some other feature that gives rise to consciousness in non-biological systems. She considers John Searle’s Chinese Room argument, responding with a version of the Systems Reply and concluding that it does not provide an argument in favor of biological naturalism.

Schneider defines techno-optimism as the view that when humans develop highly sophisticated, general-purpose AI, the AI will be conscious. Schneider notes that while this view derives from thought experiments such as those described by Chalmers (1995) and hence allows for the possibility of artificial consciousness, Chalmer’s arguments only apply to systems that are functionally identical to human brains. AI systems aren’t and generally won’t be functionally identical to brains, so the techno-optimist view is too optimistic. Schneider is skeptical of the view that the mind is software, arguing that minds cannot be software because software is abstract, it can’t have any effects in the real world. She considers the view that the mind results when the right kind of software is physically implemented in hardware to be an improvement (e.g., Chalmers, 1995) but notes that this approach doesn’t help resolve deeper problems in philosophy about the nature of the mind (i.e., the mind-body problem).

Schneider advocates for a middle approach that she terms the “Wait and See Approach.”  There are several possible outcomes regarding artificial consciousness that may arise in the future: consciousness may be present in some systems but not others; it may need to be deliberately engineered into systems; as AI systems become more advanced there may be less need for consciousness in them; and the development of consciousness in artificial systems may be slowed down due to public relations considerations in organizations. Schneider suggests ways to test for artificial consciousness: asking AIs a variety of questions referring to their internal experiential states and actually carrying out a form of the Silicon Chip Replacement thought experiments while asking whether any aspect of their experience changes.

Michael Graziano, Rethinking Consciousness: A Scientific Theory of Subjective Experience, 2019

Graziano outlines the Attention Schema Theory (AST) of consciousness, according to which our brains construct a simplified internal model of our attention (an attention schema), and we claim to be conscious as a result of the information provided when the attention schema is accessed by cognitive and linguistic systems. Graziano suggests that if you build a machine according to this theory, putting in the correct internal models and giving it cognitive and linguistic access to those models, the machine will believe and claim it has consciousness, and will do so with a high degree of certainty. On Graziano’s account, this is what consciousness is, including in humans. Therefore, this is a computational approach. He claims that this would need to be a relatively sophisticated model rather than a simple internal monitor of attention, which can already be programmed into computers.

Graziano thinks that we could witness artificial consciousness in the next decade, though this would be limited, for example, to the artificial entities having visual experiences only (e.g., experiencing colors). He thinks machines with human-like consciousness are realistically 50 years away. He thinks the biggest question is not how we will treat artificial entities but how they will treat us; extremely powerful AI systems who can recognize consciousness in us are less likely to harm us. Graziano thinks that implementing consciousness in machines will lead to a better future with AI, and that AST sets out a path for building artificial consciousness.

On Graziano’s theory, mind uploading is also possible. He thinks that given the complexity of human brains, this technology is perhaps 100 years away or more, but he thinks it will definitely come at some point. He identifies several ethical considerations associated with mind uploading, including the simulations that may be subjected to immense harm as the technology is developed and refined (also a consideration in Metzinger, 2010), the potential high turnover of artificial minds due to technological progress rendering earlier versions obsolete, and the possibility of the technology being put to harmful political uses. He thinks that mind uploading will be the technology that enables space travel, since uploads don’t face various limitations biological humans face, and he thinks that this will result in the exploration and dispersion of life across the galaxy, potentially for millions of years.

Philip Goff, Galileo’s Error: Foundations for a New Science of Consciousness, 2019

Goff outlines his panpsychist view of consciousness on which all physical entities, down to the most fundamental building blocks of the universe, have some basic form of consciousness. He argues that panpsychism resolves problems associated with both dualism, such as the problem of how the (purportedly) non-physical mind interacts with the physical brain and world in general, and materialism, such as that material explanations seem to exclude consciousness itself.

Goff briefly addresses the question of artificial consciousness. He considers Searle’s Chinese Room argument and argues that is possible for a nonconscious computer program to implement the Chinese Room, which shows that computation and consciousness are separable — computation does not require consciousness. He emphasizes, however, that if human brains are conscious, there is no reason to think artificial entities cannot also be conscious. He considers the case where a computer is programmed to believe that it is conscious. Illusionists tend to argue that this is all there is to consciousness, even in humans (this view is related to Graziano, 2019). While he considers the illusionist perspective to be coherent, he argues against illusionism for separate reasons. For example, he argues that any evidence provided by illusionists that consciousness is an illusion appears in our conscious experience, undermining the (supposed) evidence for illusionism. Goff’s theory does not clearly fit into the three categories outlined above, though if all physical entities have consciousness that would include artificial entities.

Simona Ginsburg and Eva Jablonka, The Evolution of the Sensitive Soul: Learning and the Origins of Consciousness, 2019

Ginsburg and Jablonka outline an evolutionary approach for understanding which biological entities are conscious. They identify seven criteria that they consider to be agreed upon by neurobiologists as being jointly sufficient for consciousness.[15] They then look to identify an “evolutionary transition marker,” a trait that arose in evolutionary history that requires the presence of those seven criteria and thus indicates a transition from non-conscious to conscious life. The transition marker they propose is “unlimited associative learning” (UAL). Associative learning is where a subject learns to make an association between a stimulus and another stimulus or response behavior, such as in classical and operant conditioning. In UAL, the possible forms of associative learning are open-ended, because they include, for example, compound stimuli, second-order conditioning, and trace conditioning.[16] On the basis of this transition maker, they conclude that consciousness arose twice in evolutionary history, first in vertebrates and arthropods during the Cambrian Explosion, and then 250 million years later in mollusks.

They do not think that an AI that has the capacity for UAL would necessarily be conscious; they stress that their theory refers to biological entities, and it may be possible to implement UAL in an AI without the seven criteria for consciousness. They emphasize the importance of a body for consciousness, though they note that it may be possible for such a body to be virtual, interacting in a virtual environment. They consider the requirement for a body and complex cognition to be a strong constraint on the development of conscious AI, and they expect that relatively few will be created. They do not think that mass-produced conscious AI will be a reality, and they think that conscious AI would need to go through a learning process in the way that animals do. They note that the field of developmental and evolutionary robotics take an approach that they consider more feasible. Given that they emphasize various aspects of biology, their approach is probably best seen as biological on our categorization of approaches to the possibility of artificial consciousness.[17] They worry about the ethical implications of building conscious AI, noting that “human record of horrendous and shameless cruelty towards other humans and towards conscious animals does not bode well for future conscious robots.”

Peter Godfrey-Smith, Metazoa: Animal Life and the Birth of the Mind, 2020

Godfrey-Smith considers the evolution of consciousness in animals, including a short section on artificial consciousness. Godfrey-Smith suggests that if the ideas in the book are right, you can’t create a mind through modelling brains on a computer. He argues that minds are tied to particular types of physical and biological substrate. You need to do more than represent or simulate the activity in brains; the interactions between the parts actually need to be physically present. He claims that this would be more difficult with some of the higher-level dynamic patterns of the brain and suggests that it could be very difficult to have a system with anything like the brain’s dynamic patterns that isn’t otherwise physically very similar to the brain.

He suggests that current computers can create the illusion of agency and consciousness well, but that they are currently completely different devices to brains. His biggest disagreement is with the notion of simulated brains, such as in mind uploading — biological beings are (physically) very different to a simulation of a biological being implemented on a present-day computer. On the other end of spectrum, he considers future robots with “genuinely brain-like control systems,” suggesting that these could be conscious. Depending on how brain-like such control systems must be, this view could be classified as either a physical or biological approach in our categorization of approaches to artificial consciousness.

Kristof Koch, The Feeling of Life Itself: Why Consciousness Is Widespread but Can't Be Computed, 2020

Koch defends the Integrated Information Theory (IIT) of consciousness, which states that the degree of consciousness in a system depends on its degree of “integrated information,” which can be understood as the degree to which a system is causally interconnected such that it is not reducible to its individual components. Koch is skeptical of artificial consciousness and computationalism as an approach to studying the mind more generally, as the subtitle of the book indicates.

He notes several differences between biological organisms and existing classical computers, such as the vastly more complex design of biological organisms, and differences on dimensions such as signal type, speed, connectivity, and robustness. He claims that today’s most successful artificial neural networks are feedforward networks, with information flowing in only one direction, but the networks in the cortex involve a great deal of feedback processing. He claims that feedforward networks have no integrated information, and so according to IIT have no consciousness.[18] Networks with feedback loops have high integrated information. In addition, he notes that present day digital computers operate with very little integrated information. Even if they were used to simulate systems with a high degree of integrated information, such as a human brain, the computers themselves would have minimal integrated information and so would only be minimally conscious.

In principle, however, Koch considers artificial consciousness to be possible. It would just require computers with a very different design to today’s computers — what he calls “neuromorphic electronic hardware,” where “individual logic gates receive inputs from tens of thousands of logic gates” and “these massive input and output streams would overlap and feed back onto each other.” The emphasis that Koch places on the physical organization of the system but not on any specific biological features puts his approach under the physical approach in our categorization of approaches to artificial consciousness.

Anil Seth, Being You: A New Science of Consciousness, 2021

Seth details a theory of consciousness grounded in the biological drive of embodied, living beings towards staying alive. Seth considers that artificial consciousness depends on two assumptions: (1) functionalism, the view that mental states are defined by the role they play in a cognitive system, which he holds a stance of “suspicious agnosticism” towards, and (2) that the kind of computation that gives rise to artificial intelligence is the same as that which would give rise to consciousness. He thinks issues associated with these views are glossed over, and artificial consciousness is unlikely to be “around the corner.” He describes a silicon-based machine that is functionally equivalent to a human. In the theory developed in the book he says he is uncertain about whether the silicon-based machine would be conscious, but he doubts that it would be. He notes, however, that he is relying somewhat on his intuition that consciousness depends on some aspect of biology rather than on computation.

While Seth’s approach seems closest to a purely biological approach of all the books considered, he still does not completely rule out artificial consciousness and considers the question to be an important one to be concerned about. He makes a distinction between how we ought to treat entities that appear conscious but are not and what we should do if true artificial sentience is created. With the former, there may be unintended consequences such as granting them moral consideration at the expense of conscious entities, such as nonhuman animals. With the latter, he thinks that we would be obligated to minimize their suffering. He also notes that there is a problem that we might not understand what AI conscious states are like. For example, they may experience entirely new forms of suffering for which we have no conception, or some may have no conception or distinction between positive and negative experiential states. He considers whether biotechnology rather than AI is what will bring us closest to artificial consciousness, asking whether “cerebral organoids” — brain like structures made of real neurons used as models to study brains — could have some basic level of consciousness. He thinks ethical considerations around cerebral organoids could be very important due to the number that are created and the possibility that they become more complex in the future.

David Chalmers, Reality+: Virtual Worlds and the Problems of Philosophy, 2022

In this book Chalmers defends the ideas that virtual realities are genuine realities, that we cannot know whether we are in such a reality, and that it is possible to live meaningful lives in virtual realities. A key issue relating to these ideas is whether entirely simulated entities living in virtual words will be conscious. Chalmers discusses the ethical implications of this question: if such entities are conscious, then shutting down a virtual world containing many simulated brains would be an atrocity; on the other hand, if the simulated brains are not conscious, it would seem to be no worse than turning off an ordinary computer game. To understand whether simulated entities would be conscious, Chalmers asks us to first consider the case of a perfect simulation of a human brain running on a digital computer. He then uses the Fading Qualia argument,[19] adapted to the case of simulations, to argue that a gradually uploaded simulation of a human brain would be conscious. As with his original Fading Qualia argument, Chalmers argues that since the gradually uploaded simulation of a human brain would be conscious, we can know there is no in-principle reason that simulated entities can’t be conscious, and that this “opens the floodgates” to many different possible types of conscious simulated entities. These claims fall under the computational approach.


[1] While we are mainly concerned about artificial sentience, many books consider the question of artificial consciousness. These terms are used in different ways, but we prefer to use “sentience” to refer to the capacity for positive and/or negative experiences and “consciousness” as a broader term for this and other experiences (e.g., visual experience). See our blog post on the terminology of artificial sentience for more details about how different terms have been used by various stakeholders.

[2] Note that in some cases we only read the relevant sections of the book rather than the whole book cover-to-cover. In most cases the topic of artificial sentience is not central to the books; the summaries should therefore be read as summaries of the specific points relevant to the topic of artificial sentience.

[3] Not every perspective falls under one of these three categories. For example, Tye (2017) uses a more general methodology rather than relying on a specific theory or approach that can be categorized in this way. These categories are also not made precise in most works. Another way to think of them is as two spectrums, one from an emphasis on low-level criteria to high-level criteria, and one from an emphasis on the contingencies of biology (particularly low-level biology) to an emphasis on a priori reasoning about consciousness. Each perspective can thus be placed somewhere in this two-dimensional space, as well as on other similar dimensions. Thanks to Jacy Reese Anthis for making this point.

[4] Our categories do not make commitments about the metaphysical nature of consciousness. For example, each of the three approaches are compatible with dualism.

[5] We are using “physical” to primarily refer to the level of analysis or description a cognitive system is evaluated at (e.g., hardware versus software) rather than in a contrast of the physical and nonphysical world in a dualist sense, though these are not always clearly distinguished in this literature.

[6] This is the approach taken in chapter 13 of Koch (2020) and sections 5g and 5h of Tononi and Koch (2015), though it is ambiguous whether this is a core requirement of IIT or just some applications or interpretations of it.

[7] This category includes approaches that rely on quantum physics, such as those of David Pearce (2021)  and Roger Penrose (1989), since these theories suggest we should look at the physical make-up of systems rather than the higher computational level to understand whether they are conscious. We have not covered any of these views in this post.

[8] “Functional organization” refers to the position in philosophy of mind known as functionalism, in which mental states are defined by the roles they play in cognitive systems rather than their physical make-up. A system’s functional organization refers to a description of the causal roles played by each of its components. Chalmers includes the “fine-grained” qualifier to refer to the level of detail at which the two systems produce the same behavior.

[9] Briefly, the Fading Qualia argument runs as follows: Suppose there is a functional isomorph of a human brain made from silicon, and suppose for argument’s sake that the isomorph has no subjective experience. We can then imagine a series of intermediate stages between the human brain and the silicon isomorph, where at each stage a small part of the human brain is replaced with a functionally equivalent silicon alternative. If the entity at the end has no conscious experience, there are two possibilities: 1) either their experience gradually fades away as the parts are replaced, or 2) it suddenly disappears. Chalmers considers both of these outcomes to be logically possible but highly implausible. He therefore concludes that subjective experience must remain as the parts of the brain are replaced, and that the functional isomorph made from silicon would be conscious.

[10] Briefly, the Dancing Qualia argument runs as follows: Suppose the human brain and silicon functional isomorph described above have different conscious experiences. Again, consider the intermediate stages between them. There must be two stages that are sufficiently different that the two entities’ conscious experiences are different. Chalmers asks us to additionally imagine that at each stage the silicon replacement is also built as a backup circuit in the human brain, and a switch is installed that enables switching between the neural and silicon circuits. This would suggest that at some stage, the flip could be switched back and forth, and the person’s conscious experience would “dance” back and forth, but they would not notice any change, even if they were paying full attention. Chalmers considers this outcome to be highly implausible, and therefore concludes that conscious experience of the human brain and silicon isomorph must be the same.

[11] While Chalmers accepts Searle’s argument that every system implements some computation, he argues that his account avoids the result that every physical system implements every computation, and it is only the latter result that is problematic for computational accounts. There is ongoing discussion of these issues and their implications.

[12] While he claims that humans are probably unique among animals in that we can think about our self-models, he thinks many animals have self-models and that the evidence for animal consciousness is now “far beyond any reasonable doubt.”

[13] The global workspace is not a single area of the brain; it consists of a distributed set of cortical neurons.

[14] Block (2002) introduced this distinction. Mental content is access conscious where it is available by the system for use, e.g., in reasoning, speech, or action. Phenomenal consciousness refers to subjective experience. A mental state is phenomenally conscious when “it is like” something to be in that state. 

[15] The criteria are global accessibility and activity; binding and unification; selection, plasticity, learning, and attention; intentionality; temporal thickness; values, emotions, and goals; embodiment, agency, and a notion of “self.”

[16] Compound stimuli is where the conditioned stimulus in classical conditioning is a compound of features, for example, from different senses. Second-order conditioning is where a conditioned stimulus is associated with another conditioned stimulus, allowing for a long chain between stimuli and actions. Trace conditioning is where there is a time gap between the conditioned and unconditioned stimulus.

[17] However, given that they also say that a conscious artificial entity could exist in a virtual environment, their approach, uniquely among the books, has aspects of both the computational and biological approaches.

[18] Koch suggests that even atoms may have some degree of integrated information and so some degree of consciousness, so presumably what is technically meant is that feedforward networks as a whole do not have consciousness over and above the degree of consciousness in the parts that make them up.

[19] See Footnote 5.


Subscribe to our newsletter to receive updates on our research and activities. We average one to two emails per year.