When asked why certain sound patterns are preferred to others—why, for instance, people blog, not lbog—many scholars appeal to the phonetic substance. Lbog, they argue, is disliked because its phonetic substance is harder to articulate and decode from the speech signal (e.g., Ohala 1983; Hayes et al. 2004; Pulvermüller et al. 2005). Phonology, in this view, is substance-based. Phonological representations encode the analog and continuous properties of acoustic and articulatory substance (e.g., voice onset time, amplitude, gesture overlap). Furthermore, those aspects of the “substance” aren’t just “sitting there”, passively. Rather, the phonetic substance plays a causal role in the generation of phonological patterns. Phonological patterns, then, arise not from abstract rules and constraints but solely from bodily action—that of the ear and mouth.

In this paper, I argue against this position. To do so, I first consider recent developments in brain and cognitive science that seemingly support the substance-based view. The facts, to be clear, are indisputable. But, in my view, they do not challenge the possibility that phonology is substance-free; the second section motivates this assertion. Finally, I present three arguments that phonology is indeed abstract, algebraic, and amodal, and support them by extensive experimental evidence. My conclusions converge with the other papers in this volume and elsewhere in the literature (Andersson 2025; Chabot 2025; Gorman & Reiss 2025; Hale & Reiss 2008; Reid et al. 2017; Shen 2025; Tanaka et al. 2011; Volenec & Reiss 2020) to suggest that phonology is substance-free (Berent 2013; Hale & Reiss 2008).

1 Substance-free phonology: still a viable hypothesis

Linguists have long noticed that phonological patterns aren’t fully arbitrary. Rather, marked phonological structures typically pose greater demands on the articulatory and motor system (e.g., Stampe 1973; Hayes et al. 2004). These observations could suggest that synchronic phonological processes are guided by sensorimotor pressures, rather than by abstract rules. Phonology, in that view, is substance-based

To illustrate the view, consider, again, restrictions on onset structure (e.g., bla> lba). In an influential paper, John Ohala and Haruko Kawasaki-Fukimori (1990) argued that syllable structure ought to be captured not by abstract phonological restrictions on sonority, but by acoustic phonetic parameters, such as amplitude, periodicity, spectral shape and fundamental frequency. Recent evidence from neuroscience seems to vindicate this proposal.

To evaluate the role of phonetic substance, Organian and Chang (2019) implanted electrodes in patients’ brains and traced their responses to spoken language. Results showed that areas in the superior temporal gyrus record various acoustic correlates of phonological structure. Moreover, computational analyses confirmed that those acoustic properties are sufficient to recover syllable structure and lexical stress. Their results would seem to imply that, for the human brain, syllable structure is substance-based.

Other results link phonology to the articulatory system. In this account, the preference for unmarked syllables (e.g., ba > bla) arises from the desire to optimize the parallel transmission of consonants and vowels (Kawasaki-Fukumori 1992; Mattingly 1981; Wright 2004). In line with this proposal, the evidence from neuroscience suggests that articulatory action occurs covertly and, as such, may constrain speech perception even when people do not engage in overt talking. The idea is that, to perceive speech, people must act: they must simulate what it would be for them to articulate the relevant speech sound, and in so doing, they engage both the relevant articulators (e.g., lips) as well as the motor areas that control them (Fadiga et al. 2002; Liberman et al. 1967).

One piece of evidence comes from the role of the speech motor system in speech perception. It is well known that primary motor cortex is topographically organized—certain areas of M1 control action by specific bodily organs, from the leg and hand all the way to the articulatory motor system, including the lips and tongue (e.g., Pulvermüller et al. 2006; for review Badino et al. 2014; Skipper et al. 2017). Thus, as people produce labial sounds, they engage the lip motor area (more than the tongue motor area), whereas the production of coronal sounds engages the tongue motor area (more than the lip; e.g., Pulvermüller et al. 2006). Crucially, those articulatory motor regions also charge selectively as people listen to speech sounds. Hearing ba, for instance, elicits stronger activation of the lip motor area, whereas hearing da selectively charges the tongue motor area (e.g., Pulvermüller et al. 2006).

This activation, moreover, is not epiphenomenal but causal: the activation of the motor system is necessary for speech perception to take place. This much becomes clear from experiments that selectively disrupt activation in these motor regions, using a technique called transcranial magnetic stimulation (TMS). TMS temporarily disrupts activity in specific brain regions (e.g., the lip motor area) by altering the brain’s electromagnetic activity (e.g., Pascual-Leone et al. 1993). In so doing, one can gauge whether the area of interest, such as the lip motor area, is necessary for the encoding of labial speech sounds. If it is, then its disruption via TMS ought to selectively impair the perception of labial sounds relative to controls (e.g., coronal sounds). This is precisely what the results show (e.g., D’Ausilio et al. 2012; D’Ausilio et al. 2009; Möttonen & Watkins 2009; Smalle et al. 2014).

Other experiments find the same when motor activity is disrupted mechanically—by having participants bite on their lips, tongues, or on an object that engages these articulators. Results suggest that these manipulations selectively impair the perception of the relevant speech sounds. For example, when people lightly bite on their lip, they are less able to identify labial sounds (e.g., Berent et al. 2020b). Likewise, when young infants bite on a teething toy that engages their tongue, they lose their ability to perceive coronal speech sounds (Bruderer et al. 2015).

Together, these results suggest that as we listen to spoken language, we tacitly engage in motor action—we simulate what it would “be like” to articulate those speech sounds. Given that (as shown above), linguistic structure can be directly extracted from the acoustic input, and given further that the brain response to this acoustic information engages motor action, it would appear that phonology is subsumed by acoustic and articulatory “substance”; there is no abstract, substance-free phonology in human brains.

These challenges notwithstanding, I believe the hypothesis of substance-free phonology remains utterly plausible. To be clear, this is not to say that the previously cited results aren’t real. Human brains do encode the substance of speech—that much is undeniable. But the positive evidence for the coding of phonetic substance does not show that the brain doesn’t encode abstract algebraic structure. Thus, the evidence discussed is perfectly compatible with the hypothesis that human brains encode speech along two parallel processing streams. One stream registers continuous, analog aspects of speech substance; another encodes abstract, substance-free phonological structure.

To be clear, the evidence I will present is based primarily on behavioral experiments. As such, this evidence suggests a functional distinction between the substance-based and substance-free streams; their precise neural implementations remain unknown. But given that the functional cognitive architecture arises from the organization of the brain, it is quite likely that these streams correspond to two distinct networks in human brains.

2 Substance-free phonology is adaptive

Why bother, you wonder? If the brain already registers phonetic detail, why worry about abstraction? Furthermore, if the brain does encode abstraction, why does phonological structure often mimic phonetic form? For example, why does the sonority cline of the syllable recapitulate the peaks and valleys of acoustic modulation and articulatory aperture? In short, why be wasteful? If the brain encodes phonetic substance, why bother forming abstraction alongside the phonetic detail?

In what follows, I suggest that substance-free, algebraic phonology confers adaptive benefits. Nonetheless, an adaptive phonological system ought to also be “mindful” of the sensorimotor limitations on the human body: its outputs should be readily articulated and perceived by the human mouth and ear. Phonological structure, I suggest, has evolved to satisfy these two competing constraints. Let me explain these claims.

2.1 Algebraic phonology is adaptive

To appreciate why a substance-free phonology is adaptive, it is helpful to consider the role of phonology within the language system. All sides agree that language is productive and that an adaptive phonological system must therefore support linguistic productivity. Natural systems, however, form new elements by two distinct methods—blending and combination (Abler 1989). So, in principle, either method could guide phonological productivity.

Blending systems generate new forms by mixing existing substances, akin to baking a cake from flour, sugar, and butter, or creating the color green by mixing blue and yellow. In both cases—the cake and the color green—a new instance is formed by blending existing substances. Blending systems, then, can certainly yield new forms, and the ingredients do matter—omitting the sugar from one’s cake certainly makes a noticeable difference. Still, once blended, the individual ingredients interact and are no longer recognizable in the whole (e.g., you cannot distinguish the bits of sugar in your cake, or the blue in the green).

But in combinatorial systems, you can. These systems generate new forms by combining discrete elements. DNA, for instance, combines four bases—adenine (A), thymine (T), guanine (G), and cytosine (C). Like the blending system, combinatorial systems are productive. For example, the four DNA bases can combine productively to form new life forms. To attain productivity, however, the building blocks must remain discrete and unaltered when combined. Indeed, it is precisely the sequencing of discrete DNA bases that guides gene expression, regulation and the formation of new proteins. In combinatorial systems, then, productivity is attained by a system of “discrete infinity” (Chomsky 2017).

Language is arguably a combinatorial system, and this is also the case for phonology. To support discrete infinity in language, phonology, then, must minimally combine discrete elements without blending. And, to attain the full power of linguistic productivity, the system must further exhibit systematicity, and compositionality (Fodor & Pylyshyn 1988), akin to how algebra forms powerful expressions by operating on the constituent structure of discrete, abstract variables (Marcus 2001). By hypothesis, then, phonology is not only substance-free but algebraic (Berent 2013).

But phonetic substance is neither discrete nor combinatorial; phonetic cues are, by definition, analog and continuous, and they are known to interact (e.g., Miller & Volaitis 1989; Keating 1988). Phonetic substance, then, fails to “deliver the goods” necessary to attain the discrete infinity of language. And it is for this reason that I claim that phonology must be substance-free.

2.2 Phonetically-sensible phonology is adaptive

Being substance-free, however, does not entail that phonological patterns are phonetically arbitrary. To be transmitted and perceived by the human body, phonological structure must be ultimately expressed in phonetic form. And the processing of phonetic form is surely constrained by the human body. Certain phonetic properties simply won’t register by the human sensorimotor system, and thus, they won’t survive the “telephone game” that underlies the cultural evolution of language (Blevins 2004). Phonology, then, cannot afford to be utterly arbitrary—it must favor phonological structures that can be expressed in “sensible” phonetic forms.

This, to be clear, does not mean that every phonological process is “phonetically sensible”. Indeed, several authors have identified systematic phonological alternations that violate sensorimotor pressures. Examples of such “crazy rules” include the preference of Buriat stems with front vowels to take suffixes with back vowels (as opposed to the expected harmony on frontness/backness; Chabot 2021), and the unexpected role of m as a trigger for realization of dental stops as fricatives in Attic Greek (Sommerstein 1973). Thus, the pressure for “phonetically sensible” output is clearly a soft violable constraint, not an inviolable “must”. Still, all things being equal, one would not expect such “phonetically senseless” processes to be prevalent.

2.3 Algebraic optimization

Phonology, then, must abide by two conflicting pressures. To survive cultural evolution, its outputs must be phonetically sensible, as “senseless” phonological outputs are less likely to persist across generations. But to attain productivity, the phonological grammar must also be algebraic and substance-free. How can the system satisfy these two conflicting constraints?

The answer, I suggest, is not at once. The phonological system can abide by each of these pressures as long as each such constraint is met at a different time scales. Phonetic constraints apply diachronically, during language evolution (via natural selection and cultural evolution), and it is these substance-based restrictions that give rise to “phonetically sensible” constraints, such as a preference for sonority rises. But at the synchronic level, within the brains and minds of individual speakers, phonological representations and grammatical processes are algebraic and substance-free.

Phonology, then, works much like the blind watchmaker in Dawkins’ account of natural selection (Dawkins 1987). In this analogy, the watchmaker is the phonological grammar, and its creations are phonological representations (the clocks). As the watchmaker assembles its phonological “clocks”, each one of its actions is algebraic and substance-free, and so are its outputs. But unbeknownst to the clockmaker, the plan it follows is not arbitrary; its distinct algebraic steps were selected to render the phonological “clock” resilient to phonetic pressures. When seen from afar, it becomes evident that the actions and their sequences make “phonetic sense”. And yet, the immediate causes of each of the watchmaker’s actions—of each computation within the phonological grammar—are strictly algebraic and substance-free.

Thus, at the synchronic level, phonological representations and causes are strictly algebraic—the sensorimotor substance plays no direct causal role in phonological computations; this aspect of the proposal agrees with the substance-free hypothesis advanced by Hale and Reiss (2008). Unlike Hale and Reiss, however, in my proposal, sensorimotor “substance” could nonetheless play a role in shaping the design of phonological systems diachronically and phylogenetically. I refer to this design as algebraic optimization (Berent 2013). The seemingly bipolar nature of phonology, then, is not a bug, but an adaptive feature. This is precisely what would be expected given its function in the language system.

3 Phonology is substance-free: three arguments

What evidence is there to support this proposal? Since the phonological literature is full of evidence for the phonetic grounding of phonology, in this short piece, I will not seek to defend that aspect of the proposal. Instead, I strive to show that phonological constraints—the immediate causes of phonological computation—are irreducible to sensorimotor demands: they are abstract, algebraic, and amodal, hence, substance-independent.

3.1 Phonology is irreducible to sensorimotor demands

To show that phonology is irreducible to sensorimotor demands, consider, again, the restrictions on syllable structure. Language surveys show that onsets with large rises in sonority are preferred to small rises, which, in turn, are preferred to plateaus; least preferred are sonority falls (blif > bnif > bdif > lbif; Berent et al. 2007, data from Greenberg 1978; on codas showing the mirror preference for falls, see Bat-El 2012; Clements 1990; Maïonchi-Pino & Runge 2024.). Furthermore, a large body of literature shows that people are sensitive to the syllable hierarchy, even when all syllable types are unattested in their language: the better-formed the syllable, the more faithfully it is encoded. These preferences have been reported among adult speakers of various languages (e.g., Berent et al. 2007; Berent 2008; Zhao & Berent 2016; Tamasi & Berent 2015), children (e.g., Berent et al. 2011; Jarosz 2017; Ohala 1999; Berent et al. 2008; Berent et al. 2012a), and even newborn infants (Gómez et al. 2014). Figure 1A presents representative data from adult English speakers (data from Berent et al. 2007; Experiment 1).

Figure 1: The restriction on syllable structure. Marked monosyllables are harder to identify when the materials are either spoken (A) or printed (B; shown are responses to non-identical trials, e.g., lbif-lebif). These difficulties persist when the contribution of covert articulation is disrupted, by applying TMS to the lip motor area (relative to a sham condition; C) or via mechanical suppression (i.e., pressing on the lips, tongues, or Control; D). While auditory and articulatory constraints cannot explain the restriction on the syllable hierarchy (A-D), abstract grammatical constraints do, as the syllable hierarchy engages Broca’s area (BA 45; E) and the disruption of this region (specifically, the pars triangularis, PT) disrupts syllable count (relative to the disruption of the lip motor area, the orbicularis oris, OO). Data from Berent et al. 2007 (A); Berent & Lennertz 2010(B); Berent et al. 2015(C); Berent & Platt 2022 (D); Berent et al. 2014a (E); Berent et al. 2023(F).

These facts can be readily captured by the hypothesis that all speakers possess universal restrictions on syllable structure (e.g., Prince & Smolensky 1993/2004). The nature of these restrictions, however, is controversial. One view asserts that the constraints on syllable structure are abstract. Whether or not these abstract constraints concern a (substance-free) sonority feature specifically or other features (c.f., Clements 1990; Smolensky 2006) is immaterial; either way, the constraint is substance-free.

Critically, this proposal further predicts that syllables that violate this constraint are repaired by the grammar (e.g., lbif→lebif). Since the formal analysis is provided elsewhere (Berent et al. 2009), here, I will not delve into the details. My goal is to underscore the logical argument: misidentification does not necessarily arise from a “substance problem”.

I say “not necessarily” because some forms of misidentification can undoubtedly result from “mis-hearing” (i.e., failure to encode the auditory or phonetic form) or “mis-talking” (an articulation failure) and “mis-talking” could occur even when people passively listen to speech (since speech perception engages the articulatory system, as shown in the previous section). Substance-failures, then, can certainly contribute to repair. Critically, however, they aren’t its sole cause. In my proposal, repair (and thus, misidentification) could also arise from grammatical constraints that are algebraic. Paradoxically, then, misidentification is the hallmark of a substance-free computation.

To support this proposal, my lab has systematically examined various substance-based explanations for repair and showed that they cannot fully account for misidentification. Let me briefly consider these explanations and the counter-evidence, in turn.

a. Repair does not only arise from “mishearing”. To evaluate the auditory/phonetic account, one can examine whether people remain sensitive to syllable structure when the experimental materials are presented in a printed form—in the absence of any auditory or phonetic demands. At first blush, reading seems irrelevant to phonology. But a large literature shows that people do extract phonological structure in silent reading (e.g., Perfetti et al. 1992; Van Orden et al. 1990). Building on this literature, we thus expect participants to decode the phonological structure of the syllables. The critical question is whether people will remain sensitive to their syllable structure. Since the phonological representations of printed words do not require hearing, evidence for the repair of printed words cannot possibly arise from a failure to encode the acoustic input, as the auditory/phonetic account asserts1. Are marked syllables, then, still repaired in print?

To find out, participants were presented with two successive nonwords, either identical (e.g., lbif-LBIF), or nonidentical and epenthetically related (e.g., lbif-LEBIF); the task is to determine whether the two items are identical. If marked syllables are typically misidentified due to a simple auditory/phonetic failure, then now that the materials are printed (and thus, impose no auditory/phonetic demands), misidentification ought to vanish. But the results show that it doesn’t: marked syllables still take longer for readers to decode from print. Figure 1B provides evidence from printed words, modeled after the spoken items in 1A (Berent & Lennertz 2010; similar results are reported in Berent et al. 2009; Lennertz & Berent 2015; Tamasi & Berent 2015). These results suggest that the demands exacted by these syllables aren’t auditory.

b. Repair is not solely due to “mis-talking”. Could these difficulties arise from articulatory reasons? Could lbif be taxing because, as people perceive speech, they attempt to covertly articulate it? To evaluate this idea, one can disrupt the articulatory system. If the troubles with lbif (and other marked syllables) arise from a failed articulatory attempt, then once articulation is disrupted, the demands of marked syllables should vanish; persistence would suggest that a source that is substance-free.

One set of experiments disrupted the articulatory motor system in the brain (specifically, the lip motor area, the orbicularis oris) using transcranial motor simulation; Figure 1C presents the results (from Berent et al. 2015). Other experiments disrupted articulation mechanically, by having people bite on either their lips or tongues as they heard either labial or coronal stimuli. Figure 1D presents syllable count responses to labial stimuli (e.g., plik, pnik, ptik) as participants bite on their lips, tongue, or do nothing (Control; data from Berent & Platt 2022; for similar results, see Zhao & Berent 2013). In all cases, sensitivity to the syllable hierarchy persisted.

This, to be sure, is not because the articulatory manipulations were ineffective. Recall that the perception of phonetic contrasts is exquisitely sensitive to articulatory manipulations (as discussed previously). Indeed, when articulatory suppression was applied in a phonetic task (the perception of voicing continua), speech perception was affected in the expected, selective manner: the disruption of lip articulation selectively disrupted the perception of labial continua more than coronal continua, and this was the case when suppression was applied either mechanically, by biting on the lips (Berent et al. 2020b) or by disrupting the lip motor area via TMS (Berent et al. 2023). This confirms that phonetic processing is highly dependent on articulatory simulation. Phonology, however, is not.

A neuroimaging study further bears this conclusion out (Berent et al. 2014a). Here, participants engaged in a syllable-counting task while their brains were scanned using fMRI. If syllable structure arises from sensorimotor demands, then as syllable structure becomes more marked, activation in the auditory and motor areas ought to increase. This was not the case. Instead, activation increased in Broca’s area (specifically, the Pars Triangularis, PT)—a primary language hub. Thus, as a monosyllable became worse-formed, activation in the PT increased (for results, Figure 1E).

A final TMS experiment confirmed that this activation of Broca’s area isn’t just epiphenomenal. Rather, Broca’s area plays a causal role in the computation of syllable structure. To gauge the role of Broca’s area in the syllable hierarchy, one can disrupt its activation by TMS and examine its effect on a syllable-counting task. If the activation of Broca’s area is necessary for the computation of syllable structure, then once it’s disrupted, the extraction of syllable structure should be impaired; this is exactly what was found: participants became biased to identify all stimuli as disyllabic, regardless of whether they were monosyllables or disyllables (Berent et al. 2023; see Figure 1F). Summarizing, the results show that while phonetics is clearly sensitive to acoustic and articulatory substance, phonology is not.

3.2 Phonology is algebraic

If phonology isn’t based on sensorimotor substance, then what is it based on? What principles drive computations in the phonological grammar? Above, I suggested these principles are algebraic and substance-free. We now turn to test this hypothesis.

a. What are algebraic rules? One critical hallmark of algebraic operations is that they generalize across the board. Consider, for example, a constraint on identical elements, such as *XX. In this constraint, X is a variable that stands for an entire class of phonological elements, such as “any feature” or “any syllable”. Expressions such as *XX are akin to mathematical algebraic expressions (y = 2X, where X is “any integer”), insofar as both apply to entire classes (e.g., any integer; any syllable), as opposed to their specific instances (e.g., 3, “ba”). Since, by definition, these classes do not specify phonetic substance, it follows that if the phonological mind is algebraic, then it is also substance-free. But is it?

Upon a cursory inspection, it appears that it is, as identity restrictions (in phonology), and reduplication (in morphophonology) are frequent across languages—spoken (Suzuki 1998; Walter 2007) and signed (Wilbur 2009); in what follows, I will use the term “doubling” to refer to identity and reduplication, collectively. And, doubling, as noted, is widely attested across languages. Still, the fact that one can describe doubling in algebraic terms does not prove that human brains work that way. And indeed, on an alternative account, what would appear to present an algebraic restriction could arise from the co-occurrence of specific instances. For example, a preference for dada does not concern reduplication per se (i.e., XX where X stands for any syllable). Rather, people only track the statistical co-occurrence of specific syllable tokens (e.g., baba, dada), as they demonstrably can (Saffran et al. 1996). It is the statistical frequency of specific phonological substances that drive their preference, not algebraic structure.

To adjudicate between these explanations, one can examine whether people still constrain doubling (XX) when XX and XY forms include elements with novel feature values—features that are unattested in one’s language. These test items challenge statistical models for two reasons. First, in these cases, the frequency of both XX and XY forms is vanishingly low, so they are statistically indistinguishable from each other. Second, generalization to unattested features is known to pose serious challenges for statistical computational models (Marcus 1998; Berent et al. 2012b). If phonology only tracks the frequency of specific phonological tokens, then the distinction between XX and XY forms should not extend to unattested features. In contrast, if the ban is substance-free and algebraic, then people could still contrast them by virtue of their structure.

b. Evidence for algebraic phonological rules. To test this prediction, participants were presented with identical elements comprising feature values that are utterly unattested in their language. This test was applied in two language modalities—spoken (Hebrew) and signed (ASL).

Consider first the case of Hebrew. Like all Semitic languages, Hebrew bans *XXY roots, but allows YXX forms. Thus, dideb is banned, whereas bided (he isolated) is attested. To determine whether this restriction concerns reduplication per se (as opposed to the statistical infrequency), one can examine whether Hebrew speakers extend this restriction to th (/θ/)—a phoneme that is unattested in Hebrew, and its place feature—tongue tip constriction area ‘wide’ (Gafos 1999) is likewise unattested. Hebrew speakers, however, systematically favored kathath to thathak forms (Berent et al. 2002).

Another set of experiments tested the same in ASL. To this end, these experiments compared the responses of native ASL signers to novel XX and XY forms, whose handshape feature was unattested in ASL. Thus, both forms (XX and XY) included two unattested features, but in the XX items, these features were identical. Since ASL has a productive reduplicative morphology (Wilbur 2009), signers are expected to prefer novel reduplicative signs over non-reduplicative ones. The critical question was whether they would extend this preference to unattested features.

Results showed that they do, and this was the case in both offline rating (Berent et al. 2014b) as well as in online procedures, such as a lexical decision study (Berent et al. 2014b) and even the Stroop task (Dupuis & Berent 2015). Moreover, control experiments showed that nonsigners exhibited no such preference. In fact, nonsigners showed a reliable doubling aversion (i.e., XX < XY). Below, we discuss this aversion in detail, but for now, we can conclude that the reduplication preference of signers arises from their knowledge of language, and not from sensorimotor factors alone. Summarizing, then, these results show that speakers of reduplicative morphologies—Hebrew and ASL—each project their phonological knowledge to novel features. These results suggest that phonology is an algebraic, and thus, substance-free.

3.3 Phonology is amodal

So far, we have shown that phonological restrictions are substance-free and, for this reason, they can be freely extended to novel features. All these demonstrations, however, are bound by language modality. Hebrew speakers, for instance, generalize to novel words in their spoken language, whereas ASL signers project generalizations to novel manual signs. In principle, however, the scope of algebraic phonology could be even broader. An algebraic restriction, such as *XX (where X is “any syllable”), can theoretically apply to any syllable—spoken or signed. So, if phonology is algebraic, people might be able to generalize doubling restrictions from their spoken language to a sign language, and do so spontaneously, despite having no command of a sign language.

Before we evaluate this prediction, we must first address two challenges. To constrain the identity of signed syllables, sign-naïve speakers must first be able to identify syllables in a sign language. And of course, they must also possess the requisite doubling constraints and apply them within their L1 (e.g., English). Can speakers rise to these challenges?

In what follows, I first show that indeed, they do. I next proceed to examine amodal projection by adults and infants.

a. Speakers extract syllables from ASL signs. Sign-naïve speakers can track the number of syllables in ASL signs (contrasting monosyllables and disyllables, akin to the English contrast between pen and pencil) and distinguish them from the number of morphemes (akin to the contrast between pen and pens; Berent et al. 2013). Speakers can readily do so because, like spoken syllables, signed syllables are marked by sonority peaks, such as movement (Sandler & Lillo-Martin 2006), which are phonetically salient to sign-naïve speakers. Interestingly, sign-naïve speakers spontaneously expect signed syllables to be marked by sonority peaks, akin to the structure of syllables in their spoken language, and it is this amodal principle that explains their ability to extract syllables from signs (Berent et al. 2013). Given that signed syllables are salient to speakers, we can now return to the critical question of whether they further constrain their identity.

b. English speakers demonstrably constraint phonological identity and reduplication in English. As noted, in languages like Hebrew and ASL, doubling is parsed as reduplication ({X}Xc), and it is systematically preferred. In English, by contrast, XX has no systematic morphological function. Thus, XX in forms like paper presents phonological identity, and identity is banned (by the Obligatory Contour Principle; Leben 1973; McCarthy 1979). It follows that the doubling (generally, XX) is structurally ambiguous, as it is amenable to two competing parses—phonological or morphological. When XX is parsed by the phonology, as identity, XX is banned. But when the same surface form is parsed morphologically, as reduplicative ({X}Xc), XX is now systematically preferred (Inkelas & Zoll 2005; Berent et al. 2016 ).

Before we tested generalizations to ASL signs, we first wished to ascertain that English speakers apply these doubling restrictions to novel English words. Results showed that when presented with bare phonological forms, English speakers indeed systematically disprefer doubling to controls (e.g., slaflaf < slafmat: Berent et al. 2016; Berent et al. 2017; and panana < panaka: Berent et al. 2021a). But when the exact same surface forms were presented as reduplication (e.g., slaf = one ball; slaflaf = a set of balls), doubling was now preferred (e.g., slaflaf > slafmat: Berent et al. 2016; Berent et al. 2017; and panana>panaka: Berent et al. 2021a).

c. Amodal projections. The results discussed thus far suggest that English speakers (a) possess the requisite grammatical knowledge of doubling; and (b) they can extract the requisite constituents (syllables) from ASL signs. With these results in hand, we can now turn to the critical question: can English speakers spontaneously project grammatical constraints from the grammar of their (spoken) language to signs?

To find out, a series of experiments presented speakers who are sign-native with novel ASL signs—either XX and XY, in two contrastive contexts: phonological or morphological (Figure 2A). In the phonological context, people saw a photo of a novel object and were asked to choose which ASL sign makes a better name for it—XX or XY; repetition, then, had no bearing on meaning, so it is purely phonological, i.e., identity.

Figure 2: Cross-modal generalizations. Panel A illustrates the presentation of XX and XY signs in a phonological vs. morphological context; Panel B presents the responses of English speakers to doubling in novel signs; each symbol captures the mean doubling preference of an individual participant; the dotted line captures chance level (.5); data from Berent et al. 2016.

In the morphological condition, by contrast, repetition conveyed a systematic change in meaning, such as plurality. To this end, people were presented with a base X, along with an object (e.g., a ball); next they saw either a set of identical objects (e.g., a set of three balls), to suggest semantic plurality; the task was to choose which of two signs—XX or XY—is a better name for that set in ASL (Figure 2A).

Results showed that the doubling preference shifted across conditions. In the phonological condition, people dispreferred identity (i.e., XX < XY), in line with the ban on identity in their spoken language phonology (Berent et al. 2016; Berent et al. 2021a). But when the context suggested morphological reduplication, they preferred reduplicative signs (i.e., XX > XY; Figure 1B)).

It is unlikely that these preferences only arise from nonlinguistic strategies, such as iconicity. First, iconicity does not explain why doubling is banned, in the phonological context. Second, iconicity cannot explain the doubling preference in the morphological context. This is the case because speakers’ preference for ASL signs demonstrably depends on the morphological structure of their spoken language. English speakers, as noted, preferred XX signs to express semantic plurality, which is productively marked by their L1 morphology, whereas speakers of Mandarin did not—in line with the absence of plural morphology in their spoken language (Berent et al. 2020a). Further evidence for the role of linguistic constraints is provided by the types of semantic relations expressible by reduplication. In Hebrew morphology, reduplication productively expresses diminution (e.g., kelev-klavalav; ‘dog-puppy’), whereas in English, it does not. Correspondingly, Hebrew speakers preferred XX signs when the context suggested diminution (X=a ball; XX=a small ball; Berent et al. 2016) whereas English speakers did not (Berent et al. 2016). Together, these results demonstrate that phonological structure is not only substance-free but modality-free.

Similar conclusions emerge when doubling is explored in the brains of young infants. A large body of literature suggests that young infants (Marcus et al. 1999; see also Bouchon et al. 2015; Gerken 2006, 2010; Gerken et al. 2015; Gervain et al. 2012; Gervain et al. 2008; Gervain & Werker 2013; Marcus et al. 2007; Wagner et al. 2011) can track the abstract algebraic structure of identity, distinguish it from nonidentical elements (e.g., XX vs. XY) and constrain its location (e.g., XXY vs. XYY); indeed, this ability is present at birth (Gervain et al. 2012). Unknown, however, is whether young infants who have had no previous experience with a sign language can project these restrictions to ASL signs (for preliminary conflicting results, see Rabagliati et al. 2012; Rabagliati et al. 2019).

To find out, researchers presented 6-month-old infants who were sign-naïve with novel ASL signs, either XX and XY, and gauged their brain response using near-infrared spectroscopy (NIRS)—a technique that tracks hemodynamic response by measuring the absorption of infrared light by the brain (Berent et al. 2021b).

To determine whether responses to signs arise from their linguistic structure or from visual characteristics alone, the experiment further compared the responses to signs with matched visual controls—cartoons of trees, fitted with hand-shaped leaves, such that the spatial and temporal properties precisely matched the configuration of the signers’ body (torso and hands) and its movement. In the experiment, participants were presented with blocks of XX and XY stimuli (either ASL signs or visual controls), and their brain response was monitored (Figure 3a).

Figure 3: Amodal phonology in infants’ brains. Panel A illustrates doubling in novel ASL signs and in visual analogs; Panel B provides infants; responses to doubling in signs, speech, and visual analog; means are difference scores in brain response to doubling and no-doubling; OxyHb and deoxyHb are the concentrations of oxygenated and deoxygenated hemoglobin. Figure from Berent et al. 2021b, Scientific Reports, used under a Creative Commons Attribution 4.0 International License.

If responses to signs only arise from their visual properties, then doubling ought to elicit similar responses in signs and visual controls; in contrast, if the infant’s brain extracts linguistic structure from ASL signs, then doubling in ASL signs and controls ought to differ.

To further ascertain whether doubling in ASL signs engaged an amodal language network, the experiment also contrasted brain responses to reduplication in signs and in speech stimuli (in another set of infants). Results showed that infants extracted the reduplicative rule (AA) from ASL signs, and the neural responses to reduplication in signs differed from reduplication in the visual controls, but were similar to the response to reduplication in speech stimuli (see Figure 3B; Berent et al. 2021b). These results suggest that infant brains possess a powerful phonological system that differentially responds to all linguistic stimuli, speech or signs.

4 Conclusions

In this piece, I outlined three arguments in support of the hypothesis that phonology is substance-free, algebraic, and amodal. First, I argued that speakers’ demonstrably sensitivity to phonetic detail does not challenge the possibility that phonology is substance-free. Second, I explained why an algebraic, substance-free phonology is evolutionarily adaptive. Third, I reviewed three powerful experimental demonstrations suggesting that, indeed, phonology is abstract, algebraic, and amodal. I conclude that the phonological grammar is substance-free (Andersson 2025; Berent 2013; Chabot 2025; Gorman & Reiss 2025; Hale & Reiss 2008; Reid et al. 2017; Shen 2025; Tanaka et al. 2011; Volenec & Reiss 2020).

Competing interests

The author has no competing interests to declare.

Notes

  1. Could readers still extract phonetic forms from print? They certainly can (Abramson & Goldinger 1997). But this fact does not undermine my argument. The “mishearing” account, recall, states that repair arises from failures to encode the auditory input (speech). The repair of printed words counters this argument, as the input to reading is visual (not auditory). So, if people still repair printed words, then repair cannot arise from “mis-hearing”. Critically, this would remain the case even if readers converted visual input to phonetic detail; phonetics, here, is output, not input, so this possibility is irrelevant to my argument. [^]

References

Abler, William L. 1989. On the particulate principle of self-diversifying systems. Journal of Social and Biological Systems 12(1). 1–13. DOI:  http://doi.org/10.1016/0140-1750(89)90015-8

Abramson, Marianne & Goldinger, Stephen D. 1997. What the reader’s eye tells the mind’s ear: silent reading activates inner speech. Perception and Psychophysics 59(7). 1059–1068. DOI:  http://doi.org/10.3758/BF03205520

Andersson, Samuel. 2025. A Substance-Free Analysis of Assimilatory Processes in Võro. Manusciprt submitted for publication.

Badino, Leonardo & D’Ausilio, Alessandro & Fadiga, Luciano & Metta, Giorgio. 2014. Computational validation of the motor contribution to speech perception. Topics In Cognitive Science 6(3). 461–475. DOI:  http://doi.org/10.1111/tops.12095

Bat-El, Outi. 2012. The Sonority Dispersion Principle in the acquisition of Hebrew word final codas. In Parker, S. (ed.), The Sonority Controversy (pp. 319–344). Berlin: Mouton de Gruyter. DOI:  http://doi.org/10.1515/9783110261523.319

Berent, Iris. 2008. Are phonological representations of printed and spoken language isomorphic? Evidence from the restrictions on unattested onsets. Journal of Experimental Psychology: Human Perception & Performance 34(5). 1288–1304. DOI:  http://doi.org/10.1037/0096-1523.34.5.1288

Berent, Iris. 2013. The phonological mind. Cambridge; New York: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9781139049610

Berent, Iris & Bat-El, Outi & Andan, Qatherine & Brentari, Diane & Vaknin-Nusbaum, Vered. 2021a. Amodal phonology. Journal of Linguistics 57. 199–529. DOI:  http://doi.org/10.1017/S0022226720000298

Berent, Iris & Bat-El, Outi & Brentari, Diane & Dupuis, Amanda & Vaknin-Nusbaum, Vered. 2016. The double identity of linguistic doubling. Proceedings of the National Academy of Sciences 113(48). 13702–13707. DOI:  http://doi.org/10.1073/pnas.1613749113

Berent, Iris & Bat-El, Outi & Brentari, Diane & Platt, Melanie. 2020a. Knowledge of language transfers from speech to sign: Evidence from doubling. Cognitive Science 44. 1. DOI:  http://doi.org/10.1111/cogs.12809

Berent, Iris & Bat-El, Outi & Vaknin-Nusbaum, Vered. 2017. The double identity of doubling: Evidence for the phonology-morphology split. Cognition 161. 117–128. DOI:  http://doi.org/10.1016/j.cognition.2017.01.011

Berent, Iris & Brem, Anna-Katherine & Zhao, Xu & Seligson, Erica & Pan, Hong & Epstein, Jane & Galaburda, Albert M. & Pascual-Leone, Alvaro. 2015. Role of the motor system in language knowledge. Proceedings of the National Academy of Sciences 112. 1983–1988. DOI:  http://doi.org/10.1073/pnas.1416851112

Berent, Iris & de la Cruz-Pavía, Irene & Brentari, Diane & Gervain, Judit. 2021b. Infants differentially extract rules from language. Scientific reports 11. DOI:  http://doi.org/10.1038/s41598-021-99539-8

Berent, Iris & Dupuis, Amanda & Brentari, Diane. 2013. Amodal aspects of linguistic design. Plos One 8(4). DOI:  http://doi.org/10.1371/annotation/935f97a6-67f9-4331-a998-f94a62826194

Berent, Iris & Dupuis, Amanda & Brentari, Diane. 2014b. Phonological reduplication in sign language: Rules rule. Frontiers in Language Sciences 5. 560. DOI:  http://doi.org/10.3389/fpsyg.2014.00560

Berent, Iris & Fried, Peter J. & Theodore, Rachel M. & Manning, Daniel & Pascual-Leone, Alvaro. 2023. Phonetic categorization relies on motor simulation, but combinatorial phonological computations are abstract. Scientific reports 13(1). 874. DOI:  http://doi.org/10.1038/s41598-023-28099-w

Berent, Iris & Harder, Katherine & Lennertz, Tracy. 2011. Phonological universals in early childhood: Evidence from sonority restrictions. Language Acquisition 18. 281–293. DOI:  http://doi.org/10.1080/10489223.2011.580676

Berent, Iris & Lennertz, Tracy. 2010. Universal constraints on the sound structure of language: Phonological or acoustic? Journal of Experimental Psychology: Human Perception & Performance 36(1). 212–223. DOI:  http://doi.org/10.1037/a0017638

Berent, Iris & Lennertz, Tracy & Jun, Jongho & Moreno, Miguel A. & Smolensky, Paul. 2008. Language universals in human brains. Proceedings of the National Academy of Sciences 105(14). 5321–5325. DOI:  http://doi.org/10.1073/pnas.0801469105

Berent, Iris & Lennertz, Tracy & Rosselli, Monica. 2012a. Universal linguistic pressures and their solutions: Evidence from Spanish. The Mental Lexicon 13(2). 275–305. DOI:  http://doi.org/10.1075/ml.7.3.02ber

Berent, Iris & Lennertz, Tracy & Smolensky, Paul & Vaknin-Nusbaum, Vered. 2009. Listeners’ knowledge of phonological universals: Evidence from nasal clusters. Phonology 26(1). 75–108. DOI:  http://doi.org/10.1017/S0952675709001729

Berent, Iris & Marcus, Gary F. & Shimron, Joseph & Gafos, Adamantios I. 2002. The scope of linguistic generalizations: evidence from Hebrew word formation. Cognition 83(2). 113–139. DOI:  http://doi.org/10.1016/S0010-0277(01)00167-6

Berent, Iris & Pan, Hong & Zhao, Xu & Epstein, Jane & Bennett, Monica L. & Deshpande, Vibhas & Seethamraju, Ravi & Stern, Emily. 2014a. Language universals engage Broca’s area. Plos One 9(4). e95155. DOI:  http://doi.org/10.1371/journal.pone.0095155

Berent, Iris & Platt, Melanie. 2022. Is Phonology Embodied? Evidence from Mechanical Stimulation. Journal of Psycholinguistic Research 51(3). 597–626. DOI:  http://doi.org/10.1007/s10936-022-09871-x

Berent, Iris & Platt, Melanie & Theodore, Rachel M. & Balaban, Evan & Fried, Peter & Pascual-Leone, Alvaro. 2020b. Speech perception triggers articulatory action:Evidence from mechanical stimulation. Frontiers in communication. DOI:  http://doi.org/10.3389/fcomm.2020.00034

Berent, Iris & Steriade, Donca & Lennertz, Tracy & Vaknin, Vered. 2007. What we know about what we have never heard: Evidence from perceptual illusions. Cognition 104. 591–630. DOI:  http://doi.org/10.1016/j.cognition.2006.05.015

Berent, Iris & Wilson, Colin & Marcus, Gary & Bemis, Doug. 2012b. On the role of variables in phonology: Remarks on Hayes and Wilson. Linguistic Inquiry 43(1). 97–119. DOI:  http://doi.org/10.1162/LING_a_00075

Blevins, Juliette. 2004. Evolutionary phonology. Cambridge: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9780511486357

Bouchon, Camillia & Nazzi, Thierry & Gervain, Judit. 2015. Hemispheric Asymmetries in Repetition Enhancement and Suppression Effects in the Newborn Brain. Plos One 10(10). e0140160. DOI:  http://doi.org/10.1371/journal.pone.0140160

Bruderer, Alison G. & Danielson, D. Kyle & Kandhadai, Padmapriya & Werker, Janet F. 2015. Sensorimotor influences on speech perception in infancy. Proceedings of the National Academy of Sciences. DOI:  http://doi.org/10.1073/pnas.1508631112

Chabot, Alex. 2021. Possible and Impossible Languages: Naturalness, Third Factors, and Substance-Free Phonology in the Light of Crazy Rules. (Dissertation). Universite Cote d’Azur.

Chabot, Alex. 2025. Form with no substance: Substance-Free Phonology and amodal grammars. Manusciprt submitted for publication.

Chomsky, Noam. 2017. The language capacity: architecture and evolution. Psychon Bull Rev 24(1). 200–203. DOI:  http://doi.org/10.3758/s13423-016-1078-6

Clements, George N. 1990. The role of the sonority cycle in core syllabification. In Kingston, J. & Beckman, M. (eds.), Papers in laboratory phonology I: Between the grammar and physics of speech (pp. 282–333). Cambridge: Cambridge University Press.

D’Ausilio, Alessandro & Bufalari, Ilaria & Salmas, Paola & Fadiga, Luciano. 2012. The role of the motor system in discriminating normal and degraded speech sounds. Cortex 48(7). 882–887. DOI:  http://doi.org/10.1016/j.cortex.2011.05.017

D’Ausilio, Alessandro & Pulvermüller, Friedemann & Salmas, Paola & Bufalari, Ilaria & Begliomini, Chiara & Fadiga, Luciano. 2009. The motor somatotopy of speech perception. Current Biology: CB 19(5). 381–385. DOI:  http://doi.org/10.1016/j.cub.2009.01.017

Dawkins, Richard. 1987. The blind watchmaker ; why the evidence of evolution reveals a universe without design. New York: W.W. Norton & co.

Dupuis, A. & Berent, Iris. 2015. Signs are symbols: Evidence from the Stroop task. Language, Cognition and Neuroscience 30(10). 1339–1344. DOI:  http://doi.org/10.1080/23273798.2015.1069363

Fadiga, L. & Craighero, L. & Buccino, G. & Rizzolatti, G. 2002. Speech listening specifically modulates the excitability of tongue muscles: a TMS study. European Journal of Nueroscience 15. 399–402. DOI:  http://doi.org/10.1046/j.0953-816x.2001.01874.x

Fodor, Jerry & Pylyshyn, Zenon. 1988. Connectionism and cognitive architecture: A critical analysis. Cognition 28. 3–71. DOI:  http://doi.org/10.1016/0010-0277(88)90031-5

Gafos, Adamantios I. 1999. The articulatory basis of locality in phonology. New York: Garland Publishers.

Gerken, LouAnn. 2006. Decisions, decisions: Infant language learning when multiple generalizations are possible. Cognition 98(3). B67–B74. DOI:  http://doi.org/10.1016/j.cognition.2005.03.003

Gerken, LouAnn. 2010. Infants use rational decision criteria for choosing among models of their input. Cognition 115(2). 362–366. DOI:  http://doi.org/10.1016/j.cognition.2010.01.006

Gerken, LouAnn & Dawson, Colin & Chatila, Razanne & Tenenbaum, Josh. 2015. Surprise! Infants consider possible bases of generalization for a single input example. Dev Sci 18(1). 80–89. DOI:  http://doi.org/10.1111/desc.12183

Gervain, Judit & Berent, Iris & Werker, Janet. 2012. Binding at birth: Newborns detect identity relations and sequential position in speech. Journal of Cognitive Neuroscience 24(3). 564–574. DOI:  http://doi.org/10.1162/jocn_a_00157

Gervain, Judit & Macagno, Francesco & Cogoi, Silvia & Peña, Marcela & Mehler, Jacques. 2008. The neonate brain detects speech structure. Proc Natl Acad Sci U S A 105(37). 14222–14227. DOI:  http://doi.org/10.1073/pnas.0806530105

Gervain, Judit & Werker, Janet F. 2013. Learning non-adjacent regularities at age 0 ; 7. Journal of Child Language 40(4). 860–872. DOI:  http://doi.org/10.1017/S0305000912000256

Gómez, David Maximiliano & Berent, Iris & Benavides-Varela, Silvia & Bion, Ricardo A. H. & Cattarossi, Luigi & Nespor, Marina & Mehler, Jacques. 2014. Language universals at birth. Proceedings of the National Academy of Sciences 111(16). 5837–5341. DOI:  http://doi.org/10.1073/pnas.1318261111

Gorman, Kyle & Reiss, Charles. 2025. Language Change in a Substance-Free Framework. Manusciprt submitted for publication.

Greenberg, Joseph H. 1978. Some generalizations concerning initial and final consonant clusters. In Greenberg, J. H. & Ferguson, C. A. & Moravcsik, E. A. (eds.), Universals of Human Language (Vol. 2, pp. 243–279). Stanford, CA: Stanford University Press.

Hale, Mark & Reiss, Charles. 2008. The phonological enterprise. Oxford; New York: Oxford University Press. DOI:  http://doi.org/10.1093/oso/9780199533961.001.0001

Hayes, Bruce & Kirchner, Robert Martin & Steriade, Donca. 2004. Phonetically based phonology. Cambridge; New York: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9780511486401

Inkelas, Sharon & Zoll, Cheryl. 2005. Reduplication: Doubling in Morphology. Cambridge, UK: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9780511627712

Jarosz, Gaja. 2017. Defying the stimulus: acquisition of complex onsets in Polish. Phonology 34(2). 269–298. DOI:  http://doi.org/10.1017/S0952675717000148

Kawasaki-Fukumori, Haruko. 1992. An acoustical basis for universal phonotactic constraints. Language and Speech 35. 73–86. DOI:  http://doi.org/10.1177/002383099203500207

Keating, Patricia A. 1988. The Phonology-Phonetics Interface. In Newmeyer, F. (ed.), Linguistics: The Cambridge Survey (Vol. I, pp. 281–302). Cambridge; New York: Cambridge University Press.

Leben, William. 1973. Suprasegmental phonology. Cambridge, MA: MIT press.

Lennertz, Tracy & Berent, Iris. 2015. On the sonority levels of fricatives and stops. The Mental Lexicon 10(1). 88–132. DOI:  http://doi.org/10.1075/ml.10.1.04len

Liberman, A. M. & Cooper, F. S. & Shankweiler, D. P. & Studdert-Kennedy, M. 1967. Perception of the speech code. Psychological Review 74(6). 431–461. DOI:  http://doi.org/10.1037/h0020279

Maïonchi-Pino, Norbert & Runge, Élise. 2024. A gradient-based preference for sonority markedness constraints in reading: evidence for intact phonological grammar in developmental dyslexia. Applied Psycholinguistics 45(6). 1016–1048. DOI:  http://doi.org/10.1017/S0142716424000353

Marcus, G. F. & Vijayan, S. & Bandi Rao, S. & Vishton, P. M. 1999. Rule learning by seven-month-old infants. Science 283(5398). 77–80. DOI:  http://doi.org/10.1126/science.283.5398.77

Marcus, Gary. 2001. The algebraic mind : integrating connectionism and cognitive science. Cambridge, Mass.: MIT Press. DOI:  http://doi.org/10.7551/mitpress/1187.001.0001

Marcus, Gary F. 1998. Rethinking eliminative connectionism. Cognitive Psychology 37(3). 243–282. DOI:  http://doi.org/10.1006/cogp.1998.0694

Marcus, Gary F. & Fernandes, Keith J. & Johnson, Scott P. 2007. Infant rule learning facilitated by speech. Psychol Sci 18(5). 387–391. DOI:  http://doi.org/10.1111/j.1467-9280.2007.01910.x

Mattingly, Ignatius G. 1981. Phonetic representation and speech synthesis by rule. In Myers, T. & Laver, J. & Anderson, J. (eds.), The cognitive representation of speech (pp. 415–420). Amsterdam: North Holland. DOI:  http://doi.org/10.1016/S0166-4115(08)60217-4

McCarthy, John J. 1979. Formal problems in Semitic phonology and morphology. Doctoral dissertation, MIT. New York, 1985: Garland Press.

Miller, Joanne L. & Volaitis, Lydia E. 1989. Effect of speaking rate on the perceptual structure of a phonetic category. Perception & Psychophysics 46(6). 505–512. DOI:  http://doi.org/10.3758/BF03208147

Möttonen, Riikka & Watkins, Kate E. 2009. Motor representations of articulators contribute to categorical perception of speech sounds. The Journal of Neuroscience 29(31). 9819–9825. DOI:  http://doi.org/10.1523/JNEUROSCI.6018-08.2009

Oganian, Yulia & Chang, Edward F. 2019. A speech envelope landmark for syllable encoding in human superior temporal gyrus. Science Advances 5(11). eaay6279. DOI:  http://doi.org/10.1126/sciadv.aay6279

Ohala, Diane K. 1999. The influence of sonority on children’s cluster reductions. Journal of Communication Disorders 32. 397–421. DOI:  http://doi.org/10.1016/S0021-9924(99)00018-0

Ohala, John J. 1983. The origin of sound patterns in vocal tract constraints. In MacNeilage, P. F. (ed.), The production of speech (pp. 189–216). New-York: Springer-Verlag. DOI:  http://doi.org/10.1007/978-1-4613-8202-7_9

Ohala, John J. & Kawasaki-Fukumori, Haruko. 1990. Alternatives to the sonority hierarchy for explaining segmental sequential constraints. Papers from the Regional Meetings, Chicago Linguistic Society 2. 319–338.

Pascual-Leone, A. & Houser, C. M. & Reese, K. & Shotland, L. I. & Grafman, J. & Sato, S. & Valls-Solé, J. & Brasil-Neto, J. P. & Wassermann, E. M. & Cohen, L. G., et al. 1993. Safety of rapid-rate transcranial magnetic stimulation in normal volunteers. Electroencephalography And Clinical Neurophysiology 89(2). 120–130. DOI:  http://doi.org/10.1016/0168-5597(93)90094-6

Perfetti, Charles A. & Zhang, Sulan & Berent, Iris. 1992. Reading in English and Chinese: Evidence for a “universal” phonological principle. In Frost, R. & Katz, L. (eds.), Orthography, phonology, morphology, and meaning (pp. 227–248). Amsterdam: North-Holland. DOI:  http://doi.org/10.1016/S0166-4115(08)62798-3

Prince, Alan & Smolensky, Paul. 1993/2004. Optimality theory: Constraint interaction in generative grammar. Malden, MA: Blackwell Publishing. DOI:  http://doi.org/10.1002/9780470759400

Pulvermüller, Friedemann & Hauk, Olaf & Nikulin, Vadim V. & Ilmoniemi, Risto J. 2005. Functional links between motor and language systems. The European Journal Of Neuroscience 21(3). 793–797. DOI:  http://doi.org/10.1111/j.1460-9568.2005.03900.x

Pulvermüller, Friedemann & Huss, Martina & Kherif, Ferath & Moscoso del Prado Martin, Fermin & Hauk, Olaf & Shtyrov, Yury. 2006. Motor cortex maps articulatory features of speech sounds. Proceedings of the National Academy of Sciences 103(20). 7865–7870. DOI:  http://doi.org/10.1073/pnas.0509989103

Rabagliati, Hugh & Ferguson, Brock & Lew-Williams, Casey. 2019. The Profile of Abstract Rule Learning in Infancy: Meta-Analytic and Experimental Evidence. Dev Sci 22(1). DOI:  http://doi.org/10.1111/desc.12704

Rabagliati, Hugh & Senghas, Ann & Johnson, Scott & Marcus, Gary F. 2012. Infant rule learning: Advantage language, or advantage speech? Plos One 7(7). DOI:  http://doi.org/10.1371/journal.pone.0040517

Reid, Vincent M. & Dunn, Kirsty & Young, Robert J. & Amu, Johnson & Donovan, Tim & Reissland, Nadja. 2017. The Human Fetus Preferentially Engages with Face-like Visual Stimuli. Current biology : CB 27(12). 1825. DOI:  http://doi.org/10.1016/j.cub.2017.05.044

Saffran, Jenny R. & Aslin, Richard N. & Newport, Elissa L. 1996. Statistical learning by 8-month-old infants. Science 274(5294). 1926–1298. DOI:  http://doi.org/10.1126/science.274.5294.1926

Sandler, Wendy & Lillo-Martin, Diane C. 2006. Sign language and linguistic universals. Cambridge: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9781139163910

Shen, D. Ta-Chun. 2025. Menominee Mid Vowel Alternation: A Substance-Free Search Account. Manusciprt submitted for publication.

Skipper, Jeremy I. & Devlin, Joseph T. & Lametti, Daniel R. 2017. The hearing ear is always found close to the speaking tongue: Review of the role of the motor system in speech perception. Brain and Language 164. 77–105. DOI:  http://doi.org/10.1016/j.bandl.2016.10.004

Smalle, Eleonore H. M. & Rogers, Jack & Möttönen, Riikka. 2014. Dissociating Contributions of the Motor Cortex to Speech Perception and Response Bias by Using Transcranial Magnetic Stimulation. Cerebral Cortex. DOI:  http://doi.org/10.1093/cercor/bhu218

Smolensky, Paul. 2006. Optimality in Phonology II: Harmonic completeness, local constraint conjunction, and feature domain markedness. In Smolensky, P. & Legendre, G. (eds.), The harmonic mind: From neural computation to Optimality-theoretic grammar (Vol. 2: Linguistic and philosophical implications, pp. 27–160). Cambridge, MA: MIT Press.

Sommerstein, Alan H. 1973. The sound pattern of ancient Greek. Oxford: Basil Blackwell.

Stampe, David. 1973. A dissertation on natural phonology. (Ph.D). The University of Chicago, Chicago.

Suzuki, Keiichiro. 1998. A typological investigation of dissimilation. (Ph.D.). University of Arizona, Tucson, AZ.

Tamasi, Katalin & Berent, Iris. 2015. Sensitivity to phonological universals: The case of fricatives and stops. Journal of Psycholinguistic Research 44(4). 59–81. DOI:  http://doi.org/10.1007/s10936-014-9289-3

Tanaka, Hiroko & Black, Jessica M. & Hulme, Charles & Stanley, Leanne M. & Kesler, Shelli R. & Whitfield-Gabrieli, Susan & Reiss, Allan L. & Gabrieli, John D. E. & Hoeft, Fumiko. 2011. The brain basis of the phonological deficit in dyslexia is independent of IQ. Psychological Science 22(11). 1442–1451. DOI:  http://doi.org/10.1177/0956797611419521

Van Orden, Guy C. & Pennington, Bruce F. & Stone, Gregory O. 1990. Word identification in reading and the promise of subsymbolic psycholinguistics. Psychological review 97. 488–522. DOI:  http://doi.org/10.1037/0033-295X.97.4.488

Volenec, Veno & Reiss, Charles. 2020. Formal generative phonology. Radical: a journal of phonology.

Wagner, Jennifer B. & Fox, Sharon E. & Tager-Flusberg, Helen & Nelson, Charles A. 2011. Neural processing of repetition and non-repetition grammars in 7- and 9-month-old infants. Frontiers in Psychology 2. DOI:  http://doi.org/10.3389/fpsyg.2011.00168

Walter, Mary Ann. 2007. Repetition avoidance in human language. (Ph.D). Massachusetts Institute of Technology, Cambridge.

Wilbur, Ronnie B. 2009. Productive reduplication in a fundamentally monosyllabic language. Language Sciences 31(2–3). 325–342. DOI:  http://doi.org/10.1016/j.langsci.2008.12.017

Wright, Richard. 2004. A review of perceptual cues and robustness. In Steriade, D. & Kirchner, R. & Hayes, B. (eds.), Phonetically based phonology (pp. 34–57). Cambridge: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9780511486401.002

Zhao, Xu & Berent, Iris. 2013. Universal constraints on the sound structure of language: Beyond articulation. Paper presented at the 2013 Annual meeting of the Psychonomic society. Toronoto, Canada. DOI:  http://doi.org/10.1037/e505772014-048

Zhao, Xu & Berent, Iris. 2016. Universal Restrictions on Syllable Structure: Evidence From Mandarin Chinese. Journal of Psycholinguistic Research 45. 795–811. DOI:  http://doi.org/10.1007/s10936-015-9375-1