writing teaching about
|
Neural rhythmicity, feature binding, and serotonin: A hypothesis Abstract Natural selection favors animals that make successful predictive theories about the world. The first step in the formation of these theories is the construction of complex, multi-feature percepts. This process requires resolution of the binding problem, possibly via rhythmic cortical oscillations as suggested by von der Malsburg, Singer, Koch & Crick, and others. If the binding process were made rewarding, animals might enjoy theory-making and spontaneously become "smarter." I argue that the serotonergic raphe may have been used by evolution to link cortical binding with limbic reward centers and so serve as a neural substrate for the enjoyment of successful theory-making. I present evidence from the study of disorders like OCD and autism and drugs like LSD and MDMA suggesting that rhythmicity, reward, and pattern recognition are causally linked. I also propose that the genus Homo has tied powerful symbol manipulation hardware ("language") to the binding/theory-making circuits, allowing the construction, rehearsal, and communication of sophisticated models of the world. I suggest that many interesting phenomena, such as music-induced euphoria, déjà vu, and the so-called "temporal lobe personality" can be explained by the interactions between these systems. Introduction Why do brains look for patterns? What is the hardware that causes us to theorize about events we experience? I use the word "theory" simply to mean the identification of percepts as belonging to some already understood class. For example, the classification of a particular Poodle as a dog is a "static" theory, predicting that the attributes of this particular object can be predicted by the attributes of the class Dog. Each assignment of an object into a class is in fact an experiment. We experience a few features of an object, and based on this, we tentatively place the object into a previously known class. There are also "dynamic" theories. If a ball is thrown, we use our in-house model of ballistics to predict the ball's trajectory. We could be wrong; perhaps the ball has a spin, generating a force causing the ball to deviate from a normal path. Scientific theories are of both sorts, static and dynamic: The Linnean system of biological classification is a static theory, Newtonian mechanics is a dynamical theory. That we try to make sense of the world in these ways is undeniable; that it is advantageous is intuitive. If we have good predictive ability, we can choose behaviors leading to the most desirable states of affairs. Even simple successes in theory-making are rewarding. We are all familiar with the satisfaction of remembering a location, fixing a broken device, or generally just being right. Can we explain why, neurobiologically, our brains cause us to behave and feel this way? What neural mechanisms underlie the search for order? Feature binding and rhythmicity The richness of our environment requires any would-be theory maker to segment the sensory world into discrete objects. However, there is a problem: If each complex object in the environment activates large sets of cells simultaneously, how will the brain know where the representation for one object ends and another begins? If we are presented with a gray square and a gray circle, three feature detectors activate: gray, square, and circle. In this case, downstream neurons observing this obtain a more-or-less clear interpretation of the world. Suppose instead that we see a gray square and a black circle. Now, four feature detectors are activated: gray, square, black, circle. If these detectors are all simultaneously active, the downstream interpretation is ambiguous. Which shape is which color? This is the so-called "binding problem" (Fig. 1).
As first suggested by Milner (1), and later clarified by von der Malsburg (2), a possible solution to the binding problem is to have sets of feature detectors rhythmically firing together in correlated bursts, i.e. (gray + square), then (black + circle), then (gray + square), etc. At any moment, the set of active feature detectors represents a single percept. This scheme predicts that the perception of complex objects (required for useful theory-building) will be accompanied by cortical oscillations, as subpopulations of feature detectors go on and off in their allotted time windows. Such oscillations have been observed (3-11), and indeed, there is a popular opinion that the oscillation solution to the binding problem may be correct (12-16). Crick and Koch (12,15) have even argued that such oscillations are the neural basis of consciousness itself. Symbolic representations Imagine an animal theorizing about lions. If the area of the brain housing the model activates the area of the brain which is activated when a lion is actually seen, this will be indistinguishable from actually seeing a lion in the world. Using the primary representation for model-building will cause hallucinations. To prevent this, some intermediary placeholder, or token, is required which will be used in all reasoning, predicting, all model building in general. Token use will, in fact, permit model building in the first place and will be a major evolutionary advantage. Using models, an animal could internally rehearse proposed actions and anticipate the consequences without taking any real risks. The system of tokens and token manipulation that evolution and associative learning have produced constitute what I will call "language." Note that linguistic tokens are not necessarily things that can be said. It is quite common to use non-verbal tokens for mental objects, as when comparing two tastes. You can think of them as "this taste" and "that taste" - though you probably do not go to the actual trouble of doing so - but there is a richness to each idea beyond its "thisness" or "thatness." Somehow, an idea without a name still has a mentally graspable handle, its token. This general ability to juggle named and unnamed concepts is what I mean by "language." That is, it refers to all symbolic manipulation, not just that which can be verbalized, gestured, or written. As the richness of the internal representational scheme grows, tokens attach themselves to clouds of sense-data and ideas of causal flow are extracted from the dynamics of the world. As this happens, tokens ("words") can be combined into strings ("sentences") representing experiences, both actual and imagined, and used in communication. In this way, the lexicon of a language is the collection of our static theories, and the grammar is the skeleton of our dynamical theories, both extracted from sensory experience. Although simplifying the world by regarding objects as members of a class entails information loss, throwing away complexity and reducing the entropy of experience is what enhances survival and makes language useful. On the road to symbolic theory-making, then, there are three crucial steps. First, mental objects must be bound together and assembled from their constituent perceptual features. Second, each object must linked to a memory, so that the learned characteristics of that class of objects can be reconstructed; some of these characteristics may not be superficially apparent. Finally, the compound sense-memory object must be replaced by a token. Token labeling is usually learned, as each instance of experience is described or given a name. Eventually, a token's "meaning" is the center of mass of its associated perceptual cloud (Fig. 2).
Serotonin A simple way to produce smart animals is to make the process of getting smarter enjoyable. If theory-making requires cortical binding and, in turn, rhythmic neural oscillations, then evolution could potentially have produced smart animals by (1) creating a neural subsystem which has broad sensory-cortical connectivity, which is (2) specifically activated by rhythmic input, and which (3) modulates the areas of the limbic system involved in reward. The serotonergic system of the raphe nucleus is a prime candidate. The broad connectivity of serotonergic cells is almost unmatched, each cell contacting 500,000 cortical targets and each cortical cell receiving approximately 200 serotonergic varicosities (17). Interestingly, this cortical projection is selectively larger in primates than in rats (18). The densest collections of serotonergic fibers are those associated with sensory information processing and the limbic system (17,19,20). The limbic projections allow serotonin to control reward, which it appears to do via modulation of dopamine's effects in the nucleus accumbens (19,21,22). A major proposal for the preferred stimulus of serotonergic cells is rhythmic activity (17,23,24), from the observation that while raphe activity increases with behavioral arousal, spiking frequencies are highest during rhythmic motor activities such as licking, chewing, and running on a treadmill. Because of serotonin's dual roles, Jacobs (23) suggested that "normal" people engage in repetitive activities such as knee-bouncing, chair-rocking, and head-bobbing in order to self-medicate with serotonin and obtain the associated reward. This bold proposal has many implications for various disorders, as described below, and is a central element in the construction of the present hypothesis, described next. The hypothesis The hypothesis is as follows. Rhythmic neural activity is induced by the binding of a set of features into a unitary percept (Figs. 1, 2). This percept is compared to and possibly bound to an object stored in long-term memory. The percept-memory binding activates the serotonergic raphe, which then activates, via the thalamus, three areas (Fig. 3). First is the frontal cortex, possibly the lateral orbitofrontal region, which uses the percept to update its model of the state of the world. Second is the temporal cortex, possibly Wernicke's area, which retrieves the corresponding linguistic token and brings its representation into the binding circuit. The more the current percept resembles the prototypical object, the stronger this involvement will be. Third, serotonin released by the raphe enhances the reward associated with dopamine release in the nucleus accumbens (NACC). This makes the recognition process pleasurable. Finally, I argue that positive feedback from the cortex, through the raphe and thalamus and back to cortex, allows serotonin release to facilitate oscillatory binding as well as be facilitated by oscillatory binding.
I propose that the release of serotonin during the cortical binding associated with theory-making was used by evolution to make animals smarter. When an animal's brain releases serotonin during successful theorizing, the animal will enjoy this process and will, over time, learn to understand and predict the world. This advantage will enhance its survival and will be favored by natural selection. I believe that there was at least one sharp increase in the quality of this system during the evolution of genus Homo, and that this increased quality resulted in our ability to create and use language and other forms of symbolic representation. At the same time, the relationship between rhythmicity and reward has allowed us to commandeer the underlying neural hardware and obtain pleasure from evolutionarily neutral repetitive stimulation, as proposed by Jacobs (23). In this hypothesis, the frontal cortex houses acquired knowledge concerning projections of the current world state into the future. When several possible futures are available, a feedback loop with the basal ganglia resolves conflicting predictions (25,26). Because the tokens for each world-state are activated this process, the loop from frontal and temporal cortex to basal ganglia and back houses a generative grammar used in the production of sentences. Drugs of abuse One set of evidence for the rhythmicity hypothesis comes from the study of LSD and MDMA (Ecstasy), both serotonergic agonists (27-29). LSD users report hallucinations reminiscent of synesthesia - cross-modal binding of colors or textures with sounds, for example. They also report feelings of déjà vu, intense insight, and enhanced pleasure with rhythmic activity, especially music. The subjective effects of MDMA are less hallucinatory and more euphorigenic, but there are similar effects regarding reactions to music. Reconsider the model: serotonin is the link between oscillations underlying feature binding and reward, and there is positive feedback between serotonin and binding (Fig. 3). Because LSD and MDMA facilitate serotonin, binding should become enhanced, and features which do not actually belong to the same object get spuriously bound in cortex. Because higher serotonin levels are associated with mood elevation, this should be pleasurable. Finally, the positive feedback to oscillatory neural activity and the enhancements of neural reward centers would make rhythmic activity more rewarding than normal, hence the elevated appreciation for music. The recent emergence of the musical/drug phenomenon known as "raves" highlights this effect. If the process which binds percepts to memories became artificially enhanced, what would result? If binding were sufficiently diffuse, the subjective impression would perhaps be familiarity with the overall state of affairs. It would resemble, I believe, a perception that the entire perceptual snapshot had been experienced before. Electrophysiologically, certain parts of the cortex would be in a state of abnormally high rhythmicity, i.e. a seizure. I argue that this is in fact the basis for déjà vu, which explains why this unusual mnemonic/perceptual experience is often reported by patients with certain seizure disorders, and is also induced by LSD. Neuropsychiatric disorders If this hypothesis is correct, then there should be correlations between various phenomena connected to serotonin, neural rhythmicity, reward, and pattern classification. Some data along these lines come from the study of obsessive-compulsive disorder (OCD), Gilles de la Tourette Syndrome (GTS), autism, depression, and epilepsy. A common symptom of OCD, GTS, and autism is rhythmic motor activity. Patients with OCD and GTS report having to repeat an action over and over until they get it "just right" (perhaps until it exactly matches an action prototype?) and patients with autism spend hours alone, rocking or flapping their hands. Excessive rhythmicity actually extends beyond the behavioral scale into the neurological: there is high comorbidity of seizure disorders with each of these pathologies, and in some cases (GTS) symptoms are reminiscent of temporal lobe epilepsy (30-32). A second point is that these disorders are often treated with serotonergic agonists (33-38). This is noteworthy because this is the same class of drugs used to treat patients with chronic depression. In fact, unmedicated OCD and GTS patients prevented from engaging in rhythmic or compelling behaviors become depressed. This might be interpreted as suggesting that they suffer from abnormally low levels of serotonin and reward, which they return to normal by engaging in rhythmic activities. Compulsions might then represent emotional pressures to make the sensory environment conform to learned prototypes, a subjective experience reminiscent of reports from the autism literature. Notably, serotonergic agonists are most effective in specifically reducing the repetitive behaviors and stereotypy pressures while not noticeably improving, for example, panic/anxiety symptoms seen in autism (36,39-42). Depression is also treated with rapid eye movement (REM) sleep deprivation (43) and electroconvulsive therapy (ECT) (44,45). The therapeutic effect that has been reported with REM deprivation is presumably due to serotonergic quiescence during REM. By reducing REM, quiescence may be prevented so that serotonin levels rise [but see (46)]. ECT, on the other hand, is the paragon of rhythmic neural activity, and must strongly activate the raphe. A generalized seizure would resemble a state of intense cortical binding, raphe cells would fire vigorously, and serotonin levels would be predicted to rise. A popularized though empirically rare aspect of autism [and GTS - see (47)] is the savant syndrome, characterized by exceptional ability in one narrow domain (47-49). A speculative explanation is that elevated serotonin levels (40,42,50,51) facilitate binding and pattern recognition skills. A corollary prediction, stemming from the ostensible connection between the elevated 5-HT levels of hyperserotonemia and cortical rhythmicity, is that there should be high comorbidity of autism and seizure disorders, which is the case (30-32). Beauty and the religious experience As mentioned earlier, the more a percept resembles a category prototype, the stronger the binding. This implies that oscillatory activity, and consequently reward, should be more pronounced as things tend more toward averageness or genericity. Thus, this hypothesis accounts for the finding that average faces are judged more attractive (52,53). If the phenomenon of averagenes preference is a general one - as this hypothesis would predict - then the most average head of broccoli, the most average rose, and the most average country farmhouse will also be judged as aesthetically superior. From another perspective, genericity implies low entropy and high predictability, which would be expected to make the world simple and safe. Clinical observations have suggested an association between temporal lobe seizures on one hand and hyperreligiosity and hypergraphia on the other (54-57). While organized/ institutional religion is a complex social and psychological phenomenon, it is possible that the canonical transcendental religious experience, at least in part, has a simple underlying neural foundation. Consider the subjective experience that would be associated with large-scale cortical rhythmicity. The corresponding percept would be an object which possessed all features, universal in character. It would be impossible to describe adequately, as many of its features would be mutually contradictory, but nevertheless the neural circuitry for language would be strongly activated. Awareness of this concept would be predicted to be highly pleasurable, because strong binding would facilitate the serotonergic modulation of reward and pleasure centers in the NACC. I propose that something like the rhythmicity hypothesis explains the connection between seizure disorders (excessive rhythmicity), hyperreligiosity (global feature binding), and hypergraphia (excessive linguistic token activation). In crude terms, God is a seizure. Consistent with this proposal is the fact that in some societies, religious rituals involve the ingestion of a hallucinogen and the performance of rhythmic drumming or chanting. I suggest that because both of these stimuli would be expected to enhance cortical rhythmicity, they would indeed facilitate the religious experience. Discussion Obviously, this is a controversial hypothesis with many possible objections. One of the most immediate is the disparity between the frequency of oscillations thought to be involved in sensory binding, i.e. the 40 Hz gamma-band, and that involved in the behaviors known to activate the raphe, i.e. 1-5 Hz. On one hand, this objection might be investigated empirically by examining raphe response to 40 Hz cortical oscillations. On the other hand, there are defenses available in the absence of this evidence. One possibility is that the raphe responds only to every n'th peak in the binding oscillations. This would simply be a kind of frequency division, where the raphe is tuned to the power in lower harmonics of the gamma band. A second possibility is that there is another piece of neural hardware that is specifically designed to do frequency division on modules of activated cortex. Evidence for such a system is given by the hippocampal model of Jensen and Lisman (58,59), in which working memory scans through its current contents by packing multiple gamma-band peaks into each theta cycle in the hippocampus (58,59) and allowing each gamma peak to excite a population of cells. Thus, it could be hippocampal theta-oscillations and not cortical gamma-oscillations that excite the raphe. I have greatly simplified serotonergic physiology in order to make a compact point in this paper. Before closing, several additional points should be raised. First is a finding that serotonergic cells fall silent abruptly when an unexpected event occurs, as when an experimenter suddenly enters the room of a grooming cat (23). Gradually, unit activity recovers to normal. When the experimenter leaves the room, there is a brief lull, less pronounced than that seen on the way in. This finding is at least circumstantially supportive of the rhythmicity hypothesis. When the cat is surprised by the opening door, the world is transformed into unpredictability. Binding is correspondingly low, and serotonergic unit activity crashes. As the world becomes comprehensible again, unit activity recovers, and because surprise is less pronounced when the experimenter leaves, the lull is less pronounced. A second point regarding serotonin concerns dreaming. As mentioned earlier, raphe activity is reduced during REM sleep. If it is true that serotonin facilitates the binding of percepts to tokens and modulates reward, its quiescence during REM might imply that sensory-cortical experiences during REM never become "token-ized" or reinforced by NACC DA release. Because no language-based narrative would then occur on-line, it is possible that these experiences would therefore be difficult to reconstruct after the fact, accounting for the subjective effect that dreams are hard to remember. This is especially interesting in light of the hippocampus's role in memory and the work of Jensen and Lisman (58,59). Conclusion The desire to make the world comprehensible is one of humanity's deepest idiosyncrasies. It seems reasonable to assume that evolution would impel animals with the drive not only to survive, but to make survival easier; and to do this it would have to create neural hardware for the purpose. What is the nature of that hardware? What are the consequences of its failure? Even when the system is functioning perfectly, there are opportunities to take advantage of the brain's rhythmic solution to the binding problem and the reward that results. Astonishingly, the pursuit of these opportunities has resulted in some of our greatest achievements. "Music is a higher revelation than philosophy." - Ludwig von Beethoven "Beauty is truth, truth beauty - that is all ye know on earth, and all ye need to know." - John Keats "In every true searcher of nature there is a kind of religious reverence, for he finds it impossible to imagine that he is the first to have thought out the exceedingly delicate threads that connect his perceptions." - Albert Einstein "You all, healthy people, can't imagine the happiness which we epileptics feel during the moments before our fit…I don't know if this felicity lasts for seconds, hours, or months, but believe me, I would not exchange it for all the joys that life may bring." - Fyodor Dostoyevsky Acknowledgements Barry Jacobs introduced me to the idea that rhythmic activity might be rewarding because it releases serotonin. For helpful discussions and comments I also wish to thank Brian Craft, Ole Jensen, Larry Abbott, John Lisman, Paul Mineiro, Jon Horvitz, and Kamal Sen. Through the course of this work, I was supported by an Office of Naval Research predoctoral fellowship, a Princeton University Honorific Fellowship, an NIH postdoctoral training grant, and a Sloan Fellowship for Theoretical Neuroscience. References [1] Milner, P. A model for visual shape recognition. Psychol. Rev. 1974;81:521-535. |
idea sketchbook
|
| copyright © 2007 - Joshua Fost, all rights reserved. contact: josh at joshuafost dot com |
||