<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2397-1835</journal-id>
<journal-title-group>
<journal-title>Glossa: a journal of general linguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2397-1835</issn>
<publisher>
<publisher-name>Open Library of Humanities</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.16995/glossa.23979</article-id>
<article-categories>
<subj-group>
<subject>Research article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Compositional parsing in adjective-noun phrases: The role of adjective semantics</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-3023-1611</contrib-id>
<name>
<surname>Iwan</surname>
<given-names>Oliwia Anna</given-names>
</name>
<email>Iwan_Oliwia@phd.ceu.edu</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-3188-6145</contrib-id>
<name>
<surname>Wittenberg</surname>
<given-names>Eva</given-names>
</name>
<email>WittenbergE@ceu.edu</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Central European University, Vienna, Austria</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2026-04-25">
<day>25</day>
<month>04</month>
<year>2026</year>
</pub-date>
<pub-date pub-type="collection">
<year>2026</year>
</pub-date>
<volume>11</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>31</lpage>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2026 The Author(s)</copyright-statement>
<copyright-year>2026</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://www.glossa-journal.org/articles/10.16995/glossa.23979/"/>
<abstract>
<p>A central question in language comprehension is how the mind combines the meanings of individual words. Minimal phrases such as adjective-noun pairs (<italic>red cup, big boat</italic>) provide a tractable model for studying how linguistic and perceptual features are integrated during comprehension. Previous research has often assumed a uniform compositional process, independent of adjective meaning. Across three experiments, we examined how adjective type (intersective vs. subsective) and word order influence visual verification performance. Experiment 1 used intersective colour adjectives and replicated the established facilitation for double-feature cues, showing faster responses to adjective&#8211;noun phrases than to single-word cues. Experiment 2, using subsective size adjectives, produced a similar but weaker pattern, with the advantage no longer significant once individual variability was modelled. Experiment 3 directly contrasted both adjective types within participants and revealed slower overall responses for subsective adjectives but no reliable interaction between adjective type and feature number. Taken together, these results indicate that the apparent &#8220;compositional advantage&#8221; in this paradigm depends on the semantic transparency of the features involved rather than on a categorical compositional mechanism. Intersective adjectives provide directly alignable perceptual information, yielding faster visual&#8211;linguistic integration, whereas subsective adjectives rely on context-dependent interpretation and show less consistent facilitation. Word order did not systematically affect comprehension in any experiment.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>1 Introduction</title>
<p>A defining characteristic of human cognition is the capacity to generate an unbounded array of complex ideas through the combination of elements from a finite set of discrete building blocks (<xref ref-type="bibr" rid="B18">Humboldt 1876</xref>; <xref ref-type="bibr" rid="B14">Chomsky 2006</xref>). Consequently, compositional representations are widely recognized across diverse cognitive domains, from social and visual cognition to language. They are associated with efficient learning, enabling productive thought and generalization to novel contexts (<xref ref-type="bibr" rid="B29">Phillips &amp; Wilson 2011</xref>; <xref ref-type="bibr" rid="B20">Kemp 2012</xref>; <xref ref-type="bibr" rid="B12">Chang et al. 2018</xref>). Understanding how compositional representations are formed and maintained in working memory is particularly crucial for capturing the dynamic nature of language processing, where we must rapidly combine individual word meanings into a complex whole that can be readily interpreted and understood. In this paper, we investigate how specific exemplars of intersective (e.g., <italic>red</italic>) and subsective (e.g., <italic>big</italic>) adjectives, together with order of presentation (e.g., <italic>red boat</italic> vs. <italic>boat red</italic>), influence the processing of adjective&#8211;noun combinations in visual matching tasks.</p>
</sec>
<sec>
<title>2 Compositionality in language</title>
<p>Compositionality, though evident across diverse cognitive domains (perhaps with domainspecific characteristics; <xref ref-type="bibr" rid="B5">Barsalou 2017</xref>), holds a particularly significant, and often debated, position in the study of language (<xref ref-type="bibr" rid="B13">Chomsky 1956</xref>; <xref ref-type="bibr" rid="B19">Jackendoff &amp; Pinker 2005</xref>; <xref ref-type="bibr" rid="B4">Barker &amp; Jacobson 2007</xref>). While the principle of compositionality, which states that the meaning of a complex expression is determined by the meanings of its parts and the rules governing their combination, provides a theoretical framework, its application to real-world language use faces numerous challenges (<xref ref-type="bibr" rid="B13">Chomsky 1956</xref>; <xref ref-type="bibr" rid="B15">Frege 1963</xref>; <xref ref-type="bibr" rid="B28">Partee 1995</xref>). For instance, phenomena like compounds, idioms, or semi-transparent constructions such as light verbs (e.g., <xref ref-type="bibr" rid="B40">Wittenberg 2016</xref>; <xref ref-type="bibr" rid="B42">Ziegler, Snedeker &amp; Wittenberg 2018</xref>) and metaphors (e.g., <xref ref-type="bibr" rid="B17">Holyoak &amp; Stamenkovi&#263; 2018</xref>) seem to challenge the straightforward application of compositional principles. The frequency of these phenomena underscores the continuing need for empirical investigation into the underlying mechanisms of compositionality in language. Nevertheless, the fact that language allows both transparent and non-transparent combinations highlights the importance of understanding when and how compositional mechanisms operate efficiently.</p>
<p>In the domain of simple modification, a growing body of work demonstrates that combining words into simple phrases can facilitate visual target matching. Early work by Potter &amp; Faulconer (<xref ref-type="bibr" rid="B30">1979</xref>), for instance, used a picture-probe technique to examine how adjectival information influences the understanding of a noun. Participants heard sentences containing either adjective-noun phrases (e.g., <italic>burning house</italic>) or nouns alone, then saw a picture and indicated whether it matched the previous description. Responses were fastest when the picture and phrase corresponded: when cued with <italic>burning house</italic> participants responded fastest to an image of a burning house, whereas when cued with <italic>house</italic> they responded fastest to an image of a typical house.</p>
<p>The authors concluded that listeners rapidly integrate the meanings of adjectives and nouns to form a unified representation of the modified noun phrase. Comprehension involves accessing this integrated representation, rather than processing each word separately. The meaning of a noun is retrieved in conjunction with its modifying adjective, not independently, especially when the phrase describes a familiar concept. This argues against a model where noun meanings are retrieved in isolation and then combined with adjective meanings. Instead, the adjective influences retrieval of the noun&#8217;s meaning from the outset.</p>
<p>Later studies (many designed primarily to identify the neural correlates of composition) used similar minimal-phrase tasks and yielded converging behavioral patterns. In experiments where participants indicated whether visual stimuli matched written cues (e.g., <italic>red boat</italic> vs. <italic>xgf boat</italic>) reaction times were reliably faster for two-word expressions (<xref ref-type="bibr" rid="B7">Bemis &amp; Pylkk&#228;nen 2011</xref>; <xref ref-type="bibr" rid="B8">2013a</xref>; <xref ref-type="bibr" rid="B9">2013b</xref>). These effects demonstrate a consistent behavioral facilitation for matching minimal compositional phrases to visual targets compared to single words.</p>
<p>Subsequent behavioral work refined this phenomenon by identifying factors that constrain or modulate the compositional advantage. The facilitation observed for minimal phrases was found to diminish as phrase complexity increased, suggesting additional processing demands when relations among constituents become more intricate (<xref ref-type="bibr" rid="B33">Rabagliati, Doumas &amp; Bemis 2017</xref>). The timing of feature presentation also affects the advantage: sequential presentation of features leads to faster responses than simultaneous presentation, although prolonged exposure to simultaneous features can ultimately produce a similar effect (<xref ref-type="bibr" rid="B10">Bocanegra, Poletiek &amp; Zwaan 2022</xref>).</p>
<p>These behavioral patterns have motivated representational accounts that explain how multiple features are integrated during comprehension. According to these accounts, individuals form conjunctive (intersective) representations, treating multiple features as jointly necessary (such as combining <italic>red</italic> AND <italic>square</italic> into a single integrated concept) rather than maintaining them as fully independent throughout the matching process. Such representations create a precise mental target, allowing faster matching, with a congruent target, whereas disjunctive processing would require additional verification and thus slower responses (<xref ref-type="bibr" rid="B33">Rabagliati, Doumas &amp; Bemis 2017</xref>). Evidence from visual working memory supports this view: Visual representations include bound conjunctions of features rather than separate attributes (<xref ref-type="bibr" rid="B25">Luck &amp; Vogel 1997</xref>; <xref ref-type="bibr" rid="B1">Allen et al. 2006</xref>; <xref ref-type="bibr" rid="B11">Brockmole et al. 2008</xref>; <xref ref-type="bibr" rid="B24">Logie et al. 2009</xref>). Attending to a single feature tends to slow down responses, whereas linguistic cues specifying multiple bound features facilitate identification. The alignment between linguistic and visual representations therefore provides a plausible mechanism for the compositional advantage observed across these tasks.</p>
<p>Neuroimaging evidence provides converging support for rapid composition processes. Magnetoencephalography (MEG) studies have identified increased activation in the left anterior temporal lobe (LATL) approximately 200&#8211;250 ms after noun onset when participants process minimal adjective-noun phrases compared with unstructured or anomalous controls (<xref ref-type="bibr" rid="B7">Bemis &amp; Pylkk&#228;nen 2011</xref>; <xref ref-type="bibr" rid="B31">Pylkk&#228;nen et al. 2014</xref>; <xref ref-type="bibr" rid="B41">Ziegler &amp; Pylkk&#228;nen 2016</xref>). This &#8220;LATL-composition effect&#8221; has been replicated across languages and stimulus types and is typically interpreted as a neural signature of early combinatorial integration. While these findings do not specify representational mechanisms, they converge with the behavioral results summarized above in suggesting that even minimal expressions engage rapid and efficient compositional processing.</p>
</sec>
<sec>
<title>3 Generalizability across types of composition</title>
<p>While, as we have seen before, adjective-noun phrases containing intersective adjectives are processed faster than individual words, this advantage is modulated by several factors, including representational complexity, processing dynamics, and presentation format. The question, therefore, remains whether this advantage extends beyond commonly studied kinds of composition such as those involving, for example, colour terms. In particular, while there is robust behavioral evidence for a compositional advantage with intersective colour adjectives (<xref ref-type="bibr" rid="B7">Bemis &amp; Pylkk&#228;nen 2011</xref>; <xref ref-type="bibr" rid="B8">Bemis &amp; Pylkk&#228;nen 2013a</xref>; <xref ref-type="bibr" rid="B9">Bemis &amp; Pylkk&#228;nen 2013b</xref>; <xref ref-type="bibr" rid="B33">Rabagliati, Doumas &amp; Bemis 2017</xref>; <xref ref-type="bibr" rid="B10">Bocanegra, Poletiek &amp; Zwaan 2022</xref>), it remains unclear whether this advantage generalizes to other adjective classes, particularly those denoting different types of object features or dimensions.</p>
<p>Clarifying these processing asymmetries requires a closer examination of how adjective-noun composition is treated in formal semantics. Intersective adjectives are typically distinguished from a broader class of non-intersective adjectives, which includes subsective cases such as <italic>big</italic> or <italic>small</italic>, as well as privative and modal modifiers. The current study focuses on this subsective subclass, which, unlike intersective adjectives, cannot be interpreted in the same straightforward way (<xref ref-type="bibr" rid="B28">Partee 1995</xref>; <xref ref-type="bibr" rid="B22">Kennedy 2007</xref>; Toledo &amp; Sassoon 2011). Intersective adjectives, such as <italic>red</italic> in <italic>red boat</italic>, contribute a property that composes directly with the noun&#8217;s meaning. Subsective adjectives, by contrast, require contextual interpretation; their meaning depends on properties that are evaluated relative to the noun&#8217;s extension or a relevant comparison class (<xref ref-type="bibr" rid="B22">Kennedy 2007</xref>; <xref ref-type="bibr" rid="B37">Solt 2009</xref>; Lassiter &amp; Goodman 2017). While different classificatory schemes emphasize different aspects of adjectival meaning, they converge on the idea that some adjectives modify nouns independently, whereas others require contextual calibration.</p>
<p>In formal semantic theory, intersective adjectives are typically analyzed as predicates of type &#10216;e,t&#10217;, which combine with nouns via predicate modification, yielding the intersection of their denotations (<xref ref-type="bibr" rid="B16">Heim &amp; Kratzer, 1998</xref>). Subsective adjectives, in contrast, cannot be interpreted via predicate modification and are therefore classified as a subset of the broader non-intersective category. This category also encompasses adjectives such as <italic>fake</italic> or <italic>alleged</italic>, which are nonsubsective but share the key property of failing to combine via simple intersection, instead shifting or restricting the noun&#8217;s reference. These adjective classes can be distinguished using standard semantic diagnostics (<xref ref-type="bibr" rid="B28">Partee 1995</xref>; <xref ref-type="bibr" rid="B22">Kennedy 2007</xref>). For instance, intersective adjectives such as <italic>red</italic> pass the predicate conjunction test: From &#8220;That is a red boat,&#8221; one can validly infer both &#8220;That is red&#8221; and &#8220;That is a boat.&#8221; Subsective adjectives such as <italic>big</italic>, by contrast, fail this test: From &#8220;That is a big boat,&#8221; one cannot infer &#8220;That is big,&#8221; since <italic>bigness</italic> is evaluated relative to the noun&#8217;s comparison class. Similarly, only intersective adjectives support set-intersection readings, whereas subsective adjectives require context-dependent threshold evaluation.</p>
<p>These diagnostics highlight that the two classes differ not only in their logical type but also in their inferential and interpretive behaviour. Subsective adjectives in particular introduce contextual sensitivity, often modeled via comparison-class variables or scalar thresholds. This prevents them from functioning as simple &#10216;e,t&#10217; predicates and motivates analyses that treat them as higher-order functions (<xref ref-type="bibr" rid="B21">Kamp 2013</xref>), mapping predicates to predicates or introducing additional structure such as intensional or event-based arguments. These compositional differences, whether in semantic type, functional structure, or context dependence, reflect fundamental distinctions in how adjectives integrate with nouns.</p>
<p>In the present study we operationalize this distinction by contrasting colour adjectives (e.g., <italic>red, green</italic>), which exemplify intersective modification, with size adjectives (e.g., <italic>big, small</italic>) which instantiate subsective, context-dependent modification. This pairing provides an illustrative instantiation of the broader intersective-subsective contrast, enabling an empirical comparison of how these adjective types pattern in task performance. Importantly, colour and size adjectives differ not only in intersectivity but also in other semantic dimensions (including gradability, context sensitivity and perceptual feature type) which are likely to jointly shape their processing profiles (Kennedy &amp; McNally 2005; <xref ref-type="bibr" rid="B38">Solt 2015</xref>).</p>
<p>Because intersective and subsective adjectives differ in how they semantically combine with nouns, this distinction also predicts differences in how adjective-noun combinations are processed in real time. The present study investigates whether these theoretically grounded differences surface in real-time comprehension, using visual matching tasks to test how colour and size adjectives combine with nouns during language processing.</p>
<p>Magnetoencephalography studies have revealed that combinatory LATL effects arise selectively for context-insensitive intersective adjectives (e.g., dead), but not for scalar, context-dependent ones (e.g., large, <xref ref-type="bibr" rid="B41">Ziegler &amp; Pylkk&#228;nen 2016</xref>). These findings have been interpreted as supporting a two-stage model of semantic composition, in which early, rapid integration is characteristic of intersective adjectives, while subsective ones undergo delayed, contextsensitive interpretation. These results align with the hypothesis that intersective adjectives afford more efficient composition during language processing. Complementary evidence from eye-tracking-studies further shows that the integration of adjectival meaning is modulated by contextual demands and adjective class. In visual-world-paradigms, colour adjectives permit rapid referent identification, whereas gradable adjectives (both relative and absolute) yield later or more variable facilitation patterns. When contrastive visual contexts are available, facilitation emerges earlier for relative adjectives but only later for absolute ones. This asymmetry reflects the fact that absolute adjectives rely on fixed scalar standards and require threshold-based evaluation, which introduces additional processing demands (<xref ref-type="bibr" rid="B2">Aparicio, Xiang &amp; Kennedy 2015</xref>). Developmental findings reveal a comparable hierarchy: intersective adjectives such a s colour terms are processed more efficiently and accurately than relative or absolute adjectives, with children gradually approaching the adult pattern of earlier adjective integration (<xref ref-type="bibr" rid="B34">Redolfi &amp; Melloni 2025</xref>). While compositional advantages have been consistently observed for minimal phrases, these effects have been tested almost exclusively with intersective adjectives describing perceptual features such as colour. It remains an open question whether similar facilitation occurs for context-dependent adjectives such as size terms. The current study directly compares these classes, assessing how adjective type and word order affect behavioral performance in a visual matching paradigm.</p>
</sec>
<sec>
<title>4 Current studies</title>
<p>In what follows, we present three experiments using a visual matching task to investigate how the semantic type of adjectives, specifically the distinction between intersective and subsective modification, affects compositional processing. Building on previous work by Bocanegra et al. (<xref ref-type="bibr" rid="B10">2022</xref>), we ask whether the compositional advantage (faster responses to two-word cues compared to single-word cues) arises uniformly across adjective types, or whether it reflects deeper semantic differences in how adjective-noun combinations are interpreted.</p>
<p>Experiment 1 replicates Bocanegra et al.&#8217;s (<xref ref-type="bibr" rid="B10">2022</xref>) Experiment 2b using intersective adjectives, specifically colour terms. Participants are shown linguistic cues consisting of either one feature (e.g., <italic>red</italic> or <italic>square</italic>) or two features (e.g., <italic>red square</italic>). In dual-feature trials, we further manipulate word order by comparing an adjective-first sequence (<italic>red square</italic>) to a noun-first sequence (<italic>square red</italic>). We predict that intersective adjective-noun combinations will yield a compositional advantage, replicating prior results. We also examine whether word order influences performance in this minimal compositional context. If modifiers require a referent to be fully integrated, then noun-first order might support more efficient composition. Alternatively, if processing is shaped by structural markedness or linear parsing expectations, adjective-first order may produce faster responses.</p>
<p>Experiment 2 extends this design to subsective adjectives such as <italic>big</italic> or <italic>small</italic>, which do not contribute independent intersective properties but rather depend on the noun for interpretation. This experiment tests whether a compositional advantage can arise when adjective-noun meaning is computed through context-sensitive, function-argument composition. If the previously observed advantage is tied specifically to predicate conjunction, then we should not observe facilitation for subsective phrases. However, if other compositional mechanisms can support efficient integration, a processing benefit may still emerge. We also explore word order effects. Because subsective adjectives depend more strongly on the noun for interpretation, presenting the noun first could potentially support more efficient composition. On the other hand, if a noun-first sequence introduces additional processing cost, we may see slower responses in that condition.</p>
<p>Experiment 3 directly compares intersective and subsective adjectives within a single task. Participants encounter both adjective types in blocked trials, allowing for a direct test of whether the compositional advantage varies systematically with semantic type. This experiment does not manipulate word order and focuses solely on the presence or absence of the advantage across adjective categories. If the efficiency of composition is modulated by semantic type, we expect a stronger or more reliable advantage for intersective than for subsective modification. Together, these experiments investigate whether the observed compositional advantage is a general feature of adjective-noun composition or whether it is modulated by differences in semantic type that are central to formal theories of modification. The results speak to the interface between lexical semantics and real-time language comprehension.</p>
</sec>
<sec>
<title>5 Experiment 1: Colour adjectives (replication of <xref ref-type="bibr" rid="B10">Bocanegra et al. 2022</xref>)</title>
<p>Experiment 1 examined the speed of visual verification following linguistic descriptions, replicating the design of Bocanegra et al., (<xref ref-type="bibr" rid="B10">2022</xref>). Participants were presented with images of varying shapes and colours and tasked with rapidly identifying targets matching either singlefeature (e.g., <italic>red</italic>) or double-feature (e.g., <italic>red square</italic>) descriptions. This design investigated whether the presence of two features facilitates visual search, reflecting efficient integration of word meanings during comprehension.</p>
<sec>
<title>5.1 Participants</title>
<p>Fifty-nine proficient English speakers with normal or corrected-to-normal vision were recruited through [university name redacted] Research Participation System and were compensated 5 Euros for their participation. To obtain the pre-registered sample of 48 complete data sets, additional participants were tested to replace cases with missing data. Participants were on average 25 years old.</p>
</sec>
<sec>
<title>5.2 Materials and procedures</title>
<p>The study was conducted in a laboratory setting. Stimuli were presented and responses recorded using PsychoPy (version 2023.2.3). Closely following Bocanegra et al.&#8217;s design, the experiment used the same adjective and noun sets (<italic>red, green; diamond, square</italic>) and trial structure. Each trial began with a 500 ms fixation cross, followed by the linguistic stimulus. In double-feature trials, participants saw an adjective and noun presented sequentially for 1 s each, separated by a 2 s blank screen (e.g., <italic>red diamond</italic> or <italic>diamond red</italic>). In single-feature trials, a single word was presented twice (1 s each, separated by a 2 s blank screen) to match the duration of doublefeature trials. Adjective-noun combinations were presented sequentially for one second per word, separated by a two second blank screen. In single-feature trials, the word was presented twice (for one second each presentation, separated by a two second blank screen) to maintain equal trial durations across all trial types.</p>
<p>After the linguistic stimulus a 2 s blank screen preceded the presentation of a single icon, which remained onscreen until the participant responded. Participants pressed the &#8220;A&#8221; key to indicate a match and the &#8220;L&#8221; key to indicate a mismatch between the linguistic and visual stimulus. Mismatch trials were designed such that the presented icon always mismatched the linguistic stimuli on all relevant semantic dimensions. For example, if the linguistic stimulus was <italic>red diamond</italic>, the icon would be a green square. If the linguistic stimulus was <italic>red</italic>, the icon would be either a <italic>green square</italic> or a <italic>green diamond</italic>, balanced across trials. Similarly, if the linguistic stimulus was <italic>diamond</italic>, the icon would be either a <italic>red square</italic> or a <italic>green square</italic>, balanced across trials.</p>
<p>The experiment comprised 128 trials (32 trials &#215; 4 blocks). The four blocks corresponded to the four conditions (adjective&#8211;noun order, noun&#8211;adjective order, adjective-only, noun-only); blocks were presented once per participant in randomized order, and trials were randomized within blocks. Each experimental session consisted of an equal number of match and mismatch trials (50% each). On match trials, the visual display corresponded to the linguistic cue (e.g., red square &#8594; red square), whereas on mismatch trials, the display and cue did not correspond (e.g., red square &#8594; green diamond, see <xref ref-type="fig" rid="F1">Figure 1</xref>). The preregistration is available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.17605/OSF.IO/M4ZPT">https://doi.org/10.17605/OSF.IO/M4ZPT</ext-link>.</p>
<fig id="F1">
<caption>
<p><bold>Figure 1:</bold> Illustration of the task design in Experiment 1. The top-left panel shows an example of a single-feature trial with an adjective cue (<italic>red</italic>) and the bottom-left panel shows a noun cue trial (<italic>square</italic>). The top-right panel depicts a double-feature trial with an adjective-first order (<italic>red square</italic>), and the bottom-right panel shows a doublefeature trial with a noun-first order (<italic>square red</italic>).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-11-23979-g1.png"/>
</fig>
</sec>
<sec>
<title>5.3 Data analysis and results</title>
<p>Accuracy rates for match<xref ref-type="fn" rid="n1">1</xref> trials were overall high, with 95.8% and 95.1% correct responses in the double-feature and single-feature trials, respectively. Reaction times faster than 100ms and slower than 2000 ms were excluded and only the mean reaction times from correct trials were analysed (&lt;1% of all valid trials).</p>
<p>Following Bocanegra et al. (<xref ref-type="bibr" rid="B10">2022</xref>), a repeated-measures analysis of variance (ANOVA) was conducted to examine the effects of feature number (single vs. double) and headedness (adjective-first [noun-headed] vs. noun-first [colour-headed] word order) on reaction times.<xref ref-type="fn" rid="n2">2</xref> All analyses were conducted in R (<xref ref-type="bibr" rid="B32">R Core Team 2023</xref>). ANOVA models were fitted using the afex package (<xref ref-type="bibr" rid="B36">Singmann et al. 2021</xref>), which reports generalized eta squared (<inline-formula>
<alternatives>
<mml:math id="Eq001-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>3</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
<tex-math id="M1">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^3
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e1.gif"/>
</alternatives>
</inline-formula>) as a measure of effect size.<xref ref-type="fn" rid="n3">3</xref></p>
<p>The analysis revealed a significant main effect of feature number, <italic>F</italic>(1, 47) = 15.89, <italic>p</italic> &lt; .001, <inline-formula>
<alternatives>
<mml:math id="Eq002-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.036</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M2">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 = .036
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e2.gif"/>
</alternatives>
</inline-formula>, with slower responses for single-feature trials (<italic>M</italic> = 610 ms, <italic>SD</italic> = 191 ms) compared to double-feature trials (<italic>M</italic> = 565 ms, <italic>SD</italic> = 183 ms). A significant main effect of headedness also emerged, <italic>F</italic>(1, 47) = 9.93, <italic>p</italic> = .003, <inline-formula>
<alternatives>
<mml:math id="Eq003-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.011</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M3">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2  = .011
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e3.gif"/>
</alternatives>
</inline-formula>, with faster responses to colour-headed cues (<italic>M</italic> = 572 ms, <italic>SD</italic> = 183 ms) than to noun-headed cues (<italic>M</italic> = 602 ms, <italic>SD</italic> = 183 ms, see <xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<fig id="F2">
<caption>
<p><bold>Figure 2:</bold> Mean reaction times in Experiment 1 for double-feature trials (adjective-first vs. noun-first order) and single-feature trials (colour vs. noun cues). Asterisks indicate statistically significant effects based on the overall ANOVA and post hoc pairwise comparisons.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-11-23979-g2.png"/>
</fig>
<p>The interaction between feature number and headedness was significant, <italic>F</italic>(1, 47) = 4.78, <italic>p</italic> = .034, <inline-formula>
<alternatives>
<mml:math id="Eq004-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.005</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M4">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 = .005
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e4.gif"/>
</alternatives>
</inline-formula>, indicating that the effect of feature number on reaction time also depended on cue headedness. In particular, the cost of single-feature trials was greater when the cue was noun-headed (<italic>M</italic> = 627 ms, <italic>SD</italic> = 191 ms) compared to colour-headed (<italic>M</italic> = 592 ms, <italic>SD</italic> = 190 ms), indicating that the effect of feature number varied slightly as a function of cue headedness. While all effects reached statistical significance, the effect of feature number (<inline-formula>
<alternatives>
<mml:math id="Eq005-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.036</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M5">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 = .036
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e2.gif"/>
</alternatives>
</inline-formula>) was notably the strongest, suggesting that the primary influence on reaction times was the number of features rather than the headedness itself.</p>
<p>To assess the robustness of these effects while accounting for random variation across participants and items we analyzed log-transformed reaction times, using a linear mixed-effects model (LMM) fitted in R (<xref ref-type="bibr" rid="B32">R Core Team, 2023</xref>) with the <italic>lme4</italic> package (<xref ref-type="bibr" rid="B6">Bates et al. 2015</xref>). The model included fixed effects of feature number (single vs. double), headedness (noun-headed vs. colour-headed), and their interaction, with random slopes for both predictors by participant and random intercepts for items.</p>
<p>The model replicated the ANOVA findings, revealing a significant main effect of feature number, <italic>t</italic>(45) = 3.72, <italic>p</italic> &lt; .001 and headedness (<italic>t</italic>(41) = 3.38, <italic>p</italic> = .002), as well as a significant interaction between the two factors (<italic>t</italic>(2207) = 2.84, <italic>p</italic> = .005).</p>
<p>Post hoc pairwise comparisons were conducted using the Kenward&#8211;Roger method for degrees of freedom estimation, with Tukey adjustments applied to control the family-wise error rate (via the <italic>emmeans</italic> package; <xref ref-type="bibr" rid="B23">Lenth 2023</xref>).</p>
<p>Participants responded significantly slower in the single-feature noun-headed condition (e.g., <italic>square, M</italic> = 667 ms) compared to the single-feature colour-headed condition (e.g., <italic>red, M</italic> = 626 ms), <italic>t</italic>(2378) = 4.40, <italic>p</italic> &lt; .001. Reaction times for single-feature noun-headed trials were also significantly slower than for both the double-feature colour-headed condition (e.g., <italic>red square, M</italic> = 595 ms), <italic>t</italic>(2379) = 7.74, <italic>p</italic> &lt; .001, and the double-feature noun-headed condition (e.g., <italic>square red, M</italic> = 607 ms), <italic>t</italic>(2374) = 6.59, <italic>p</italic> &lt; .001.</p>
<p>In contrast, responses to <italic>red</italic> (<italic>M</italic> = 626 ms) were significantly slower than to <italic>square red</italic> (<italic>M</italic> = 552 ms), <italic>t</italic>(2379) = 3.46, <italic>p</italic> = .003, but did not differ significantly from <italic>red square</italic> (<italic>M</italic> = 577 ms), <italic>t</italic>(2378) = 2.20, <italic>p</italic> = .123. Together, these comparisons show that the apparent advantage for double-feature cues was primarily driven by slower responses to nouns presented alone; for adjectives, combining them with a second feature only led to significantly faster responses in adjective-first(e.g., <italic>red square</italic>), but not in adjective second phrases (e.g., <italic>square red</italic>).</p>
</sec>
<sec>
<title>5.4 Discussion of experiment 1</title>
<p>This study investigated how feature number (single vs. double) and word order (adjective-first vs. noun-first) influence reaction times in a visual feature-matching task. Replicating prior work (<xref ref-type="bibr" rid="B10">Bocanegra et al. 2022</xref>; <xref ref-type="bibr" rid="B7">Bemis &amp; Pylkk&#228;nen 2011</xref>; <xref ref-type="bibr" rid="B8">2013a</xref>; <xref ref-type="bibr" rid="B9">2013b</xref>; <xref ref-type="bibr" rid="B33">Rabagliati et al. 2017</xref>), responses were faster in double-feature than in single-feature trials, indicating a facilitation effect due to redundancy rather than increased syntactic complexity.</p>
<p>Both the repeated-measures ANOVA and linear mixed-effects analyses revealed a robust main effect of feature number, with faster responses when both features were available. Although greater linguistic complexity might intuitively predict slower responses, this result aligns with evidence that redundant or congruent cues can facilitate visual processing and decision making (<xref ref-type="bibr" rid="B35">Rubio-Fern&#225;ndez 2016</xref>). A double-feature cue provides a more precise and format-congruent template for the two-dimensional visual target, allowing the linguistic representation to map more efficiently onto the perceptual input. In contrast, a single-feature cue under-specifies the visual object and leaves an irrelevant dimension to be filtered, increasing decision time. The interaction between feature number and headedness reached statistical significance but appears to reflect lexical category differences (adjective vs. noun) rather than a genuine order effect. In single-feature trials, headedness corresponds to lexical category (adjective vs. noun), whereas in double-feature trials it marks word order. Post hoc comparisons confirmed that word order did not reliably affect performance when both features were present. The small interaction likely reflects slower access to shape nouns relative to colour adjectives rather than differences in compositional processing.</p>
<p>The absence of a word-order effect is theoretically informative. Despite English grammar favouring the adjective-noun sequence, participants responded equally quickly to noun-first phrases. This pattern suggests that, under rapid matching conditions, participants relied primarily on feature-based integration of visual information rather than full syntactic composition. Both words independently activated relevant conceptual features, and their linear order had little influence on the efficiency of visual comparison. Because English adjectives lack agreement morphology, both sequences are locally interpretable and can serve as effective retrieval cues. Future work could test this further in morphologically richer languages or with tasks requiring deeper syntactic processing, where word order constraints might exert stronger effects.</p>
<p>The asymmetry between lexical categories (specifically, slower responses to nouns than to adjectives) likely reflects lexical&#8211;semantic and perceptual factors. Colour terms are less semantically complex, more visually diagnostic, and more rapidly accessed than nouns denoting shapes or objects (<xref ref-type="bibr" rid="B26">Nicholson &amp; Humphrey 2004</xref>; <xref ref-type="bibr" rid="B39">Werning 2010</xref>). This interpretation also accounts for the absence of a difference between single-colour and double-feature cues: the inclusion of a salient colour cue in either case already maximizes efficiency.</p>
<p>In sum, the present findings highlight that in rapid visual matching, performance depends primarily on feature-based congruency and lexical salience. The observed &#8220;composition effect&#8221; is best understood as a behavioural consequence of redundant and perceptually diagnostic cues, rather than evidence of explicit syntactic composition.</p>
</sec>
</sec>
<sec>
<title>6 Experiment 2: Size adjectives</title>
<p>The results of Experiment 1 indicated that redundant, converging linguistic cues facilitate efficient visual matching. Experiment 2 replicated this paradigm using size adjectives (<italic>big, small</italic>) instead of colour terms to test whether the facilitation observed in Experiment 1 generalizes across feature dimensions. If the effect reflects a general benefit of redundant feature specification, similar performance patterns should emerge for size-based descriptions. We used a similar verification paradigm as in Experiment 1. Participants judged whether presented icons corresponded to single- or double-feature descriptions. Crucially, this experiment presented only subsective adjectives (e.g., <italic>big, small</italic>), enabling investigation into whether facilitated processing of double-feature descriptions is a universal phenomenon in adjective-noun- phrase processing or restricted to specific adjective types.</p>
<p>We use a limited space (four icons) and simplified size distribution (two size adjectives) to create a processing environment similar to that of intersective adjectives. Thus, the integrated representations formed during composition should yield a processing advantage regardless of adjective type.</p>
<sec>
<title>6.1 Participants</title>
<p>Fifty native English speakers were recruited via Prolific (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.prolific.co">www.prolific.co</ext-link>) and compensated at a rate of &#163;9.00 per hour. The experiment took an average of 22 minutes to complete. To obtain the pre-registered sample of 48 complete data sets, additional participants were tested to replace cases with missing reaction time data. No demographic information (e.g., age, gender) was collected, as participants were recruited anonymously via Prolific.<xref ref-type="fn" rid="n4">4</xref></p>
</sec>
<sec>
<title>6.2 Materials and procedure</title>
<p>To maintain consistency across experiments, Experiment 2 adapted the replication procedure, utilizing new adjectives (<italic>big, small</italic>), nouns (<italic>car, boat</italic>), and corresponding images. Data collection took place online using Pavlovia (version 2024.1.4).</p>
<p>Simple purple line drawings of these objects in two sizes served as icons. The small car icon was approximately 1.5 cm wide (&#8776;130 px), and the large car icon was approximately 10 cm wide (&#8776;370 px) when displayed on a 1920 &#215; 1080n pixel screen. The boat icons were similarly scaled.</p>
<p>As before, the experiment consisted of 128 trials (32 trials per block &#215; 4 blocks), equally divided among the four trial types: adjective-first order, noun-first order, adjective-only, and noun-only. The four blocks corresponded to these conditions and were presented once per participant in a randomized order. For double-feature trials (adjective-noun phrases), word order was fixed within each block (adjective&#8211;noun vs. noun&#8211;adjective). Single-feature trials were presented in dedicated adjective-only and noun-only blocks. Within each block, trials were randomized. Stimuli were presented and responses recorded using PsychoPy (version 2023.2.3) and Pavlovia (platform version 2024.1.4). Each experimental session consisted of an equal number of match and mismatch trials (50 % each). On match trials, the visual display corresponded to the linguistic cue (e.g., <italic>big boat</italic> &#8594; big boat, see <xref ref-type="fig" rid="F3">Figure 3</xref>), whereas on mismatch trials, the display and cue did not correspond (e.g., <italic>big boat</italic> &#8594; small car). The preregistration is available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.17605/OSF.IO/DQUGW">https://doi.org/10.17605/OSF.IO/DQUGW</ext-link>.</p>
<fig id="F3">
<caption>
<p><bold>Figure 3:</bold> Illustration of the task design in Experiment 2. The top-left panel shows an example of a single-feature trial with an adjective cue (<italic>big</italic>) and the bottom-left panel shows a single-feature trial with a noun cue (<italic>boat</italic>). The top-right panel depicts a double-feature trial with an adjective-first order (<italic>big boat</italic>), and the bottom-right panel shows a double-feature trial with a noun-first order (<italic>boat big</italic>).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-11-23979-g3.png"/>
</fig>
</sec>
<sec>
<title>6.3 Data analysis and results</title>
<p>Data were analysed using the same procedures and packages as in Experiment 1. Accuracy was high overall. On match<xref ref-type="fn" rid="n5">5</xref> trials participants responded correctly on 92.0% of double-feature trials and 89.9% of single-feature trials (24.1% of correct match trials were excluded due to RT trimming &lt;100 ms or &gt;2000 ms).</p>
<p>The ANOVA revealed a significant main effect of feature number, <italic>F</italic>(1, 47) = 8.48, <italic>p</italic> = .005, <inline-formula>
<alternatives>
<mml:math id="Eq006-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.014</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M6">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 = .014
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e5.gif"/>
</alternatives>
</inline-formula>, with faster responses in the double-feature condition (<italic>M</italic> = 666 ms, <italic>SD</italic> = 257 ms) than in the single-feature condition (<italic>M</italic> = 698 ms, <italic>SD</italic> = 260 ms).</p>
<p>No significant main effect of headedness was observed, <italic>F</italic>(1, 47) = 0.46, <italic>p</italic> = .549, <inline-formula>
<alternatives>
<mml:math id="Eq007-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>&#x003C;</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.001</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M7">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 &lt; .001
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e6.gif"/>
</alternatives>
</inline-formula>, indicating no reliable difference in response times between size-headed cues (<italic>M</italic> = 678 ms, <italic>SD</italic> = 263 ms) and noun-headed cues (<italic>M</italic> = 692 ms, <italic>SD</italic> = 243 ms).</p>
<p>The interaction between feature number and headedness was not significant, <italic>F</italic>(1, 47) = 1.99, <italic>p</italic> = .17, <inline-formula>
<alternatives>
<mml:math id="Eq008-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>&#x003C;</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.001</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M8">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 &lt; .001
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e6.gif"/>
</alternatives>
</inline-formula> (see <xref ref-type="fig" rid="F2">Figure 2</xref>). Across conditions, mean RTs were 677 ms for single&#8211;size trials, 717 ms for single&#8211;noun trials, 666 ms for double&#8211;size trials, and 666 ms for double&#8211;noun trials, indicating that responses were generally slower for single cues and especially for single nouns (see <xref ref-type="fig" rid="F4">Figure 4</xref>).</p>
<fig id="F4">
<caption>
<p><bold>Figure 4:</bold> Mean reaction times in Experiment 2 for double-feature trials (adjective-first e.g., <italic>big boat</italic> vs. nounfirst order e.g., <italic>boat big</italic>) and single-feature trials (size e.g., <italic>big</italic> vs. noun cues e.g., <italic>boat</italic>). Asterisks indicate statistically significant effects based on the overall ANOVA.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-11-23979-g4.png"/>
</fig>
<p>To verify the robustness of these effects, a linear mixed-effects model was fitted with byparticipant random slopes for both predictors and a by-item random intercept.</p>
<p>This model did not yield any reliable fixed effects (all <italic>p</italic> &gt; .20), indicating that once individual variability was explicitly modelled, no contrasts reached statistical significance.</p>
<p>Estimated marginal means (back-transformed from log RTs) ranged from 679 ms (single&#8211;size) to 663 ms (double&#8211;noun). Pairwise comparisons confirmed the absence of significant differences between any conditions (all Tukey-adjusted <italic>p</italic> &gt; .49). Together, the ANOVA and mixed-effects analyses converge on a numerically consistent but attenuated double-feature advantage. Participants were, on average, faster when both features were present, yet this effect was small and variable across individuals. The lack of a significant interaction or word-type difference suggests that the advantage was not modulated by lexical category (adjective vs. noun) and does not reflect sensitivity to word order.</p>
</sec>
<sec>
<title>6.4 Discussion of experiment 2</title>
<p>The results of Experiment 2 replicated the visual-matching paradigm of Experiment 1 using size adjectives (<italic>big, small</italic>) paired with nouns denoting concrete objects (<italic>boat, car</italic>). The repeatedmeasures ANOVA showed a significant main effect of feature condition, with faster responses for double-feature than single-feature cues. This pattern numerically replicated the facilitation observed in Experiment 1, but the linear mixed-effects model, which incorporated byparticipant random slopes, did not show any reliable fixed effects. Thus, once individual variability was fully modelled, the advantage for double-feature cues no longer reached statistical significance.</p>
<p>The attenuation of the effect is theoretically meaningful. Subsective adjectives such as big and small express relational properties that depend on comparison classes (a <italic>big car</italic> differs from a <italic>big boat</italic>). Unlike intersective colour terms, which map directly onto perceptual dimensions, size adjectives require an internal standard of comparison before evaluation can occur. For subsective adjectives such as <italic>big</italic> and <italic>small</italic>, efficient use of the noun information requires establishing a contextually appropriate comparison standard (e.g., <italic>big for a car</italic> vs. <italic>big for a boat</italic>). This evaluative process cannot be completed until the noun concept is fully accessed, which likely offsets any potential facilitation from having both words present. In other words, while the noun provides essential interpretive information, it also introduces a processing bottleneck: integrating size adjectives with their comparison class is slower and more variable than the direct perceptual mapping available for intersective (colour) adjectives.</p>
<p>As a result, their meanings cannot be mapped directly onto the visual input but must be interpreted relative to context, making the linguistic&#8211;visual correspondence inherently noisier. This attenuation is best understood as a consequence of semantic indeterminacy, not of syntactic composition. Whereas colour adjectives denote more fixed, perceptually grounded features, size adjectives are context-sensitive and therefore provide less stable and less redundant cues for matching. Participants could not simply align each word with an independent visual feature, since the notion of &#8220;bigness&#8221; or &#8220;smallness&#8221; is graded and depends on object category. Consequently, double-feature cues offered less additive benefit, because the size term did not contribute a discrete, perceptually verifiable dimension in the way colour did in Experiment 1. Importantly, word order again had no measurable effect, even though subsective adjectives were hypothesised to depend more on grammatical adjective-noun structure for interpretation. If participants had engaged in full syntactic composition to derive phrase-level meaning (e.g., <italic>big boat</italic>), the adjective-first order should have conferred an advantage over <italic>boat big</italic>.</p>
<p>The absence of such an effect suggests that even subsective adjectives did not trigger syntactic composition under these task conditions. Instead, participants relied on direct, feature-based mappings between linguistic and visual representations, with both words processed in parallel as partially independent semantic cues. The additional variability observed for subsective adjectives thus reflects greater ambiguity in conceptual mapping, not the engagement of syntactic mechanisms. Taken together, the results show that the double-feature advantage generalises only weakly to subsective adjectives. Whereas intersective adjectives support rapid, perceptually grounded feature matching, subsective adjectives engage more variable, contextdependent semantics. This pattern thus reinforces the view that the &#8220;composition effect&#8221; in this paradigm arises from redundant feature overlap and perceptual congruency.</p>
</sec>
</sec>
<sec>
<title>7 Experiment 3: Colour and size adjectives within participants</title>
<p>Experiment 3 combined the designs of the previous studies to directly compare intersective and subsective adjectives within the same participants. Using the same image-verification task, participants encountered trials containing both adjective types, allowing a within-subject test of whether the redundancy-based facilitation observed earlier differed by semantic class. Word-order manipulations were omitted to focus specifically on the impact of adjective type on performance in single- and double-feature cues.</p>
<sec>
<title>7.1 Participants</title>
<p>Ninety-nine native English speakers were recruited via Prolific (<ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.prolific.co">www.prolific.co</ext-link>) and compensated at a rate of &#163;10.80 per hour. The experiment took an average of approximately 21 minutes to complete. To obtain the pre-registered sample of 98 complete data sets, one additional participant was added to replace a case with missing reaction time data. After exclusions based on pre-registered criteria, 96 complete data sets were retained for analysis. No demographic information (e.g., age, gender) was collected, as participants were recruited anonymously via Prolific.</p>
</sec>
<sec>
<title>7.2 Materials and procedure</title>
<p>Experiment 3 largely mirrored the design and procedure of Experiments 1 and 2, closely following Bocanegra et al., (<xref ref-type="bibr" rid="B10">2022</xref>) but with key modifications. Most importantly, adjective type (colour vs. size) was manipulated within-subjects, allowing for direct comparison of processing differences while controlling for individual variability. Stimuli were drawn from Experiment 2 and the replication study, with <italic>orange</italic> replacing <italic>red</italic> in the intersective condition. Intersective trials used <italic>orange</italic> and <italic>green</italic> while subsective trials used <italic>big</italic> and <italic>small</italic>, both paired with the same nouns (<italic>boat</italic> and <italic>car</italic>). This design enabled a direct assessment of the compositional advantage for each adjective type within the same participants.</p>
<p>The experiment included blocks of adjective-noun phrases and blocks of single-feature trials presenting only the adjective (one block for each adjective type). A 2000 ms blank screen separated the linguistic stimuli and the presentation of a coloured line drawing of the object. Participants judged whether the icon matched or mismatched the preceding linguistic stimuli by pressing &#8216;A&#8217; for match or &#8216;L&#8217; for mismatch. Two trial types, intersective and subsective, employed different visual icons. Intersective trials used green or orange cars and boats presented in a medium size (approximately 12 cm or 400 px wide on a 1920 px-wide screen). Subsective trials used purple versions of the same objects in two sizes: small (approximately 4 cm or 133 px wide) and large (36 cm or 1200 px wide). These icons consisted of simple line drawings of the respective object categories. Prior to the main experiment, participants completed four practice trials.</p>
<p>The experiment consisted of 128 trials (32 trials per block &#215; 4 blocks), equally divided among the four trial types: intersective adjective-noun phrase, subsective adjective-noun phrase, intersective-only and subsective-only. The four blocks corresponded to these conditions and were presented once per participant in a randomized order. Within each block, trials were randomized. Stimuli were presented and responses recorded using PsychoPy (version 2023.2.3) and Pavlovia (platform version 2024.1.4). Each experimental session consisted of an equal number of match and mismatch trials (50% each). On match trials, the visual display corresponded to the linguistic cue (e.g., <italic>big boat</italic> &#8594; big boat), whereas on mismatch trials, the display and cue did not correspond (e.g., <italic>big boat</italic> &#8594; small car, see <xref ref-type="fig" rid="F5">Figure 5</xref>).</p>
<p>In Experiment 3, single-feature trials were restricted to adjectives rather than including nouns. This choice was motivated by two considerations. First, Experiment 1 already established that object-noun cues elicit slower responses than adjectives, likely reflecting differences in conceptual complexity and perceptual diagnosticity. Including additional single-noun trials would therefore not have provided new theoretical leverage. Second, limiting single-feature cues to adjectives allowed us to maintain a balanced number of trials across conditions and to keep total session length within feasible limits, given that Experiment 3 introduced both intersective and subsective adjective types within participants. This ensured that the experiment targeted the intended lexical&#8211;semantic contrast without imposing excessive task duration or fatigue. The preregistration is available at <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.17605/OSF.IO/J8K92">https://doi.org/10.17605/OSF.IO/J8K92</ext-link>.</p>
<fig id="F5">
<caption>
<p><bold>Figure 5:</bold> Illustration of the task design in Experiment 3. The top-left panel shows an example of a double-feature trial with a colour adjective (e.g., <italic>green boat</italic>), and the top-right panel shows the double-feature trial featuring a size adjective (e.g., <italic>big boat</italic>). The bottom-left panel depicts a single-feature trial with a colour adjective (e.g., <italic>green</italic>), and the bottom-right panel shows a single-feature trial with a size adjective (e.g., <italic>big</italic>).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-11-23979-g5.png"/>
</fig>
</sec>
<sec>
<title>7.3 Data analysis and results</title>
<p>Data were analysed using the same procedures and packages as in Experiments 1 and 2. Accuracy in match<xref ref-type="fn" rid="n6">6</xref> trials was high across all conditions (91&#8211;94%) and did not differ systematically by feature condition or adjective type, indicating that response-time patterns were not confounded by speed&#8211;accuracy trade-offs (<italic>21.3% of correct match trials were excluded due to RT trimming &lt;100 ms or &gt;2000 ms</italic>). Analyses of correct match trials revealed a consistent main effect of adjective type. A repeated-measures ANOVA with factors feature condition (single vs. double) and adjective type (colour vs. size) showed significantly slower responses for size than for colour adjectives, <italic>F</italic>(1, 95) = 23.97, <italic>p</italic> &lt; .001, <inline-formula>
<alternatives>
<mml:math id="Eq009-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
<mml:mtext>&#x00A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mn>.024</mml:mn></mml:mrow>
</mml:math>
<tex-math id="M9">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2 = .024
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e7.gif"/>
</alternatives>
</inline-formula>. The main effect of feature condition was not significant, <italic>F</italic>(1, 95) = 3.07, <italic>p</italic> = .083, and there was no interaction, <italic>F</italic>(1, 95) = 0.28, <italic>p</italic> = .60. Mean reaction times (based on participant averages) were 691 ms and 661 ms for single- and double-feature intersective trials, and 745 ms and 728 ms for single- and double-feature subsective trials, respectively (see <xref ref-type="fig" rid="F6">Figure 6</xref>).</p>
<fig id="F6">
<caption>
<p><bold>Figure 6:</bold> Mean reaction times in Experiment 3 for double-feature trials featuring size adjectives (e.g., <italic>big boat</italic>) colour adjectives (e.g., <italic>orange boat</italic>) and single-feature trials (size e.g., <italic>big</italic> vs. colour e.g., <italic>orange</italic>). Asterisks indicate statistically significant effects based on the overall ANOVA.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-11-23979-g6.png"/>
</fig>
<p>A linear mixed-effects model including random slopes for both within-subject factors replicated the ANOVA pattern. Responses were reliably slower for size than colour adjectives (&#946; = &#8211;0.038 &#177; 0.009, <italic>t</italic>(19.9) = &#8211;4.11, <italic>p</italic> = .001), while neither the effect of feature condition (&#946; = 0.013, <italic>p</italic> = .18) nor the interaction (&#946; = 0.006, <italic>p</italic> = .46) reached significance. These converging results indicate that participants responded more slowly overall to size than to colour adjectives, but the advantage of double- over single-feature cues did not differ reliably between adjective types.</p>
<p>Given the absence of an interaction, exploratory follow-up models were fit within each adjective type to illustrate, rather than test, potential differences in magnitude. For colour adjectives, responses tended to be faster for double- than single-feature cues (&#946; = 0.022 &#177; 0.008, <italic>t</italic>(78) = 2.67, <italic>p</italic> = .009), corresponding to an advantage of roughly 35 ms (&#8776; 4&#8211;5 %). For size adjectives, no difference was observed (&#946; = 0.005 &#177; 0.016, <italic>t</italic>(7.5) = 0.31, <italic>p</italic> = .77). This pattern should be interpreted cautiously, as the between-type contrast was not statistically reliable.</p>
</sec>
<sec>
<title>7.4 Discussion of experiment 3</title>
<p>Experiment 3 directly compared intersective (colour) and subsective (size) adjectives within the same participants, allowing a within-subject assessment of whether the redundancy-based facilitation observed in the previous experiments generalizes across adjective types. Overall, participants responded more slowly to subsective adjectives than to intersective ones, but there was no reliable interaction between adjective type and feature number. This indicates that both adjective types supported broadly similar patterns of visual&#8211;linguistic matching, with only modest numerical differences in facilitation that did not reach statistical significance. The consistent main effect of adjective type likely reflects differences in semantic specificity rather than differences in compositional processing. Intersective adjectives such as <italic>green</italic> denote stable, perceptually grounded properties that can be directly mapped onto visual features, facilitating rapid comparison between linguistic and visual representations. Subsective adjectives such as <italic>big</italic>, in contrast, describe relational or context-dependent properties whose interpretation depends on the object category (e.g., <italic>big for a car</italic> vs. <italic>big for a boat</italic>). Because these meanings are evaluated relative to an internal comparison standard, their mapping to visual input is less direct, resulting in slower and more variable responses overall.</p>
<p>The absence of a reliable interaction supports the interpretation that redundancy effects in this paradigm arise from the availability of directly alignable perceptual information rather than from syntactic or compositional integration. When both adjectives and nouns refer to perceptually separable dimensions, as in colour&#8211;shape combinations, redundant cues yield modest facilitation. When one of the cues is semantically context-dependent, as with size adjectives, this facilitation is attenuated, not because compositional mechanisms fail to apply, but because the relevant features are less perceptually determinate.</p>
<p>Together with Experiments 1 and 2, these findings indicate that apparent &#8220;combination&#8221; effects in this task are best understood as reflecting perceptual alignment and cue redundancy, rather than obligatory syntactic composition. The efficiency of linguistic&#8211;visual matching thus depends on the perceptual transparency and diagnosticity of the features being described, rather than on their grammatical configuration.</p>
</sec>
</sec>
<sec>
<title>8 General discussion</title>
<p>This study investigated how different types of adjectives influence the processing of adjective&#8211; noun combinations in real time. Across three experiments, we examined whether the semantic properties of adjectives, specifically the distinction between intersective and subsective types, modulate the efficiency of integrating linguistic and visual information during comprehension. Across experiments, a consistent but graded pattern emerged.</p>
<p>In Experiment 1, which used intersective (colour) adjectives, participants were faster to verify visual targets following adjective&#8211;noun phrases than after single-word cues. This replicated the previously observed redundancy-based facilitation effect. In Experiment 2, which tested subsective (size) adjectives, a similar but weaker trend was observed, and the facilitation no longer reached significance once random variation was accounted for in a mixed-effects model. In Experiment 3, which directly contrasted both adjective types within the same participants, responses to subsective adjectives were generally slower than to intersective ones, but no reliable interaction between feature number and adjective type was found. Taken together, these results suggest that the efficiency of visual&#8211;linguistic matching varies with the semantic transparency of the features involved, rather than reflecting a categorical difference in linguistic compositional processes between adjective types.</p>
<p>Intersective adjectives, such as <italic>green</italic>, denote stable, perceptually grounded properties that can be mapped directly onto visual features. Subsective adjectives, such as <italic>big</italic>, describe relational or context-dependent properties that require a comparison class and therefore depend more heavily on contextual information. This difference likely underlies the overall slowdown observed for subsective adjectives, but it does not constitute evidence for qualitatively distinct compositional mechanisms.</p>
<p>The present findings therefore do not demonstrate a processing dissociation between intersective and subsective adjectives. Instead, they suggest that redundancy and perceptual alignment, rather than differences in linguistic compositional processes, determine the speed and efficiency of performance in this task. When cues provide multiple directly alignable features, as in colour&#8211;shape combinations, responses are faster. When one feature is semantically less determinate, as in size&#8211;object combinations, this facilitation is attenuated, not because compositional mechanisms fail to apply, but because the relevant features are less perceptually determinate. This interpretation is consistent with recent proposals that composition-like effects in such paradigms may reflect feature-based mappings between linguistic cues and visual dimensions, rather than syntactic assembly alone (e.g., <xref ref-type="bibr" rid="B7">Bemis &amp; Pylkk&#228;nen 2011</xref>; Flick et al. 2022). Participants appear to rely on feature-based mappings between words and visual dimensions, processing both components in parallel without necessarily computing a fully <italic>linguistically</italic> compositional meaning.</p>
<p>Accordingly, the absence of word order effects across all three experiments indicates that participants did not engage in syntactic composition under these task conditions, likely because both cues were sufficient to identify the visual referent.</p>
<p>Across all three experiments, exploratory mismatch trial analyses<xref ref-type="fn" rid="n7">7</xref> showed uniformly high accuracy (typically above 90%) and closely mirrored the reaction-time patterns observed for match trials. Participants reliably rejected nonmatching displays, with minimal variability across conditions. In Experiment 1, mismatch responses were faster and more accurate for double-feature cues and for adjective-headed (colour-based) phrases, reflecting redundancydriven facilitation similar to that observed in match trials. In Experiments 2 and 3, which included size adjectives, accuracy again remained near ceiling, and the same small advantage for double cues appeared only with intersective (colour) adjectives. Subsective (size) adjectives showed no such effect. Taken together, the mismatch data confirm that the facilitation associated with redundant or perceptually diagnostic features is not limited to affirmative (match) responses but extends to rejection trials as well. These converging patterns strengthen the interpretation that the observed effects reflect general efficiency in feature-based mapping rather than speed&#8211;accuracy trade-offs or response biases.</p>
<p>Although the current results align with formal semantic distinctions between adjective types, they do not imply that these distinctions have direct and categorical processing consequences. Instead, they point to a more graded relationship between semantic determinacy and processing efficiency: adjectives that denote concrete, perceptually grounded properties support faster visual matching than those that encode relational or context-sensitive ones. This graded pattern may reconcile prior theoretical and neurocognitive findings by suggesting that differences between adjective types emerge only under conditions requiring detailed compositional analysis.</p>
<p>It is worth noting that in the size-adjective conditions, the objects were depicted in a uniform purple hue that was not mentioned in the linguistic cue (for example, <italic>big boat</italic>). While this visual&#8211;linguistic asymmetry could have introduced a minor perceptual mismatch, all objects necessarily possessed some colour, and the hue was constant across conditions. Future work could directly test whether such incidental colour incongruities contribute to the slower responses observed for subsective adjectives. The consistent processing advantage for intersective adjectives likely arises from their direct perceptual grounding: their meaning can be mapped onto visual features without additional contextual computation. In contrast, subsective adjectives require evaluating the noun relative to a comparison class, introducing variability and slowing identification even when both words are available.</p>
</sec>
<sec>
<title>Supplementary Materials</title>
<sec>
<title>Experiment 1</title>
<sec>
<title>Supplement A: Trial and Learning Effects</title>
<p>To evaluate potential practice or learning effects introduced by blocking, we re-analyzed the preregistered 48-participant dataset using three linear mixed-effects models (LMMs) on logtransformed reaction times (RTs). Each model included by-participant random intercepts and a random slope for the relevant trial index (<italic>lme4</italic>; <xref ref-type="bibr" rid="B6">Bates et al. 2015</xref>).</p>
<p>We defined three trial indices capturing learning at different levels of granularity (global, within-block, and within-condition), as described in Experiment 1.</p>
</sec>
</sec>
<sec>
<title>Results</title>
<p>Participants showed reliable speed-up over time:</p>
<list list-type="bullet">
<list-item><p><bold>Global model:</bold> <italic>&#946;</italic> = &#8211;0.050 (SE = 0.0105), <italic>t</italic>(&#8776;35.1) = &#8211;4.79, <italic>p</italic> &lt; .001.</p>
<p>Interactions with feature number and headedness were <bold>not</bold> significant (all <italic>p</italic> &gt; .15) <bold>except</bold> a small three-way interaction, &#946; = 0.0167 (SE = 0.0073), <italic>t</italic>(&#8776;1288) = 2.29, <italic>p</italic> = .022.</p></list-item>
<list-item><p><bold>Within-block model:</bold> <italic>&#946;</italic> = &#8211;0.0249 (SE = 0.0061), <italic>t</italic>(&#8776;42.1) = &#8211;4.09, <italic>p</italic> &lt; .001. The interactions were <bold>not</bold> reliably different from zero (&#124;<italic>t</italic>&#124; &#8804; 1.69, <italic>p</italic> &#8805; .09).</p></list-item>
<list-item><p><bold>Within-condition model:</bold> <italic>&#946;</italic> = &#8211;0.0399 (SE = 0.0073), <italic>t</italic>(&#8776;42.4) = &#8211;5.45, <italic>p</italic> &lt; .001. Interactions with feature number, headedness, and their two-way were <bold>not</bold> significant (all <italic>p</italic> &#8805; .12).</p></list-item>
</list>
<p>Taken together, these analyses show a general practice effect (faster responses later in the session) that is similar across blocks and conditions. Thus, the blocking scheme did not introduce condition-specific learning that could explain the main results.</p>
<sec>
<title>Supplement B: Mismatch Data</title>
<p>Accuracy in mismatch trials was uniformly high across all conditions (95-98%; <italic>single-noun</italic>: 95.2%; <italic>single-adjective</italic>: 97.1%; <italic>double adjective-first</italic>: 97.7%; <italic>double noun-first</italic>: 98.3%). A generalized linear mixed-effects model with random intercepts for participants and items showed higher accuracy for double-feature cues than for single-feature cues (<italic>&#946;</italic> = &#8211;0.34, <italic>SE</italic> = 0.13, <italic>z</italic> = &#8211;2.54, <italic>p</italic> = .011). The main effect of headedness did not reach significance (<italic>&#946;</italic> = &#8211;0.23, <italic>SE</italic> = 0.13, <italic>z</italic> = &#8211;1.71, <italic>p</italic> = .087), although the numerical pattern suggested slightly higher accuracy for colour-headed than for shape-headed cues. There was no interaction (<italic>p</italic> = .69). For correct rejections, reaction times (RTs) were analyzed with a linear mixed-effects model on log-transformed RTs including random intercepts for participants and items. Responses were faster for double- than single-feature cues (&#946; = 0.039, <italic>SE</italic> = 0.006, <italic>t</italic>(7.02) = 6.13, <italic>p</italic> &lt; .001) and for adjective-headed (<italic>single-colour</italic> or <italic>noun-first</italic>) than for noun-headed <italic>(single-noun</italic> or <italic>adjective-first</italic>) cues (&#946; = 0.021, <italic>SE</italic> = 0.006, <italic>t</italic>(7.02) = 3.31, <italic>p</italic> = .013), with no interaction (<italic>p</italic> = .60). For correct rejections, reaction times (RTs) were analyzed with a linear mixed-effects model on log-transformed RTs including random intercepts for participants and items. Responses were faster for double- than single-feature cues (&#946; = 0.039, <italic>SE</italic> = 0.006, <italic>t</italic>(7.02) = 6.13, <italic>p</italic> &lt; .001) and for adjective-headed (<italic>single-colour</italic> or <italic>noun-first</italic>) than for noun-headed <italic>(single-noun</italic> or <italic>adjective-first</italic>) cues (&#946; = 0.021, <italic>SE</italic> = 0.006, <italic>t</italic>(7.02) = 3.31, <italic>p</italic> = .013), with no interaction (<italic>p</italic> = .60). Consistent with these effects, back-transformed estimated means showed faster responses for colour than for noun cues in single-feature trials (625 vs. 657 ms) and for adjective-first compared to noun-first sequences in double-feature trials (582 vs. 603 ms).</p>
</sec>
<sec>
<title>Supplement C: Accuracy Analysis Match Data</title>
<p>To complement the RT analyses, we examined accuracy patterns on <italic>match</italic> trials in Experiment 1. Accuracy was very high overall. In the full sample, mean accuracy ranged from 95.0 % to 96.2 % across conditions. A binomial GLMM predicting trial-level accuracy from featurecondition (single vs. double) and headedness (noun vs. colour), with random intercepts for participant and item, revealed no significant effects or interactions (all &#124;z&#124; &lt; 1, p &gt; .38).</p>
<p>Estimated marginal probabilities on the response scale confirmed near-ceiling performance (<italic>single-noun</italic>: 0.959 (SE = 0.009), <italic>double adjective-first</italic>: 0.964 (SE = 0.008), <italic>single-colour</italic>: 0.963 (SE = 0.008), <italic>double adjective-first</italic>: 0.970 (SE = 0.007). The model showed a boundary (singular) fit due to negligible item variance, consistent with uniformly high accuracy.</p>
</sec>
</sec>
<sec>
<title>Experiment 2</title>
<sec>
<title>Supplement D: Learning Effects</title>
<p>To evaluate potential practice or learning effects introduced by blocking, we re-analysed the preregistered 48-participant dataset with three linear mixed-effects models (LMMs) on logtransformed RTs. Each model included by-participant random intercepts and a random slope for the corresponding trial index (lme4; <xref ref-type="bibr" rid="B6">Bates et al. 2015</xref>). Three trial indices were constructed to capture learning at different levels of granularity: a global index counting trials across the whole session, a within-block index (for the four observed blocks: <italic>single-size, single-noun, double adjective-first, double noun-first</italic>), and a within-condition index collapsing across single vs. double trials.</p>
</sec>
</sec>
<sec>
<title>Results</title>
<p>No reliable evidence of systematic practice or learning effects was found.</p>
<list list-type="bullet">
<list-item><p><bold>Global model:</bold> <italic>&#946;</italic> = &#8211;0.0029 (SE = 0.0103), <italic>t</italic>(&#8776; 41) = &#8211;0.29, <italic>p</italic> = .78. A small interaction between trial and feature condition was significant (<italic>&#946;</italic> = 0.0184, <italic>SE</italic> = 0.0070, <italic>t</italic>(&#8776; 1882) = 2.63, <italic>p</italic> = .009), indicating a slightly flatter slope for double- than single-feature trials. The main effects reflected the expected baseline RT differences between conditions, but there was no evidence that these patterns changed systematically over trials.</p></list-item>
<list-item><p><bold>Within-block model:</bold> <italic>&#946;</italic> = 0.0079 (<italic>SE</italic> = 0.0056), <italic>t</italic>(&#8776; 526) = 1.42, <italic>p</italic> = .16. Interactions with block label were not significant (all <italic>p</italic> &#8805; .28). Mean RTs differed across blocks (single &gt; double; noun &gt; size), but these differences remained constant over trials.</p></list-item>
<list-item><p><bold>Within-condition model:</bold> <italic>&#946;</italic> = &#8211;0.0024 (<italic>SE</italic> = 0.0099), <italic>t</italic>(&#8776; 33.1) = &#8211;0.24, <italic>p</italic> = .81. A small trial &#215; feature-condition interaction (<italic>&#946;</italic> = 0.0111, <italic>SE</italic> = 0.0054, <italic>t</italic>(&#8776; 2125) = 2.06, <italic>p</italic> = .04) and a marginal trial &#215; word-type interaction (<italic>p</italic> = .058) suggested weak, inconsistent trends. The three-way interaction was not significant (<italic>p</italic> = .81).</p></list-item>
</list>
<p>Taken together, these analyses show no systematic RT reductions over time and no evidence of condition-specific learning. Participants&#8217; response speeds remained stable across the session, indicating that the observed processing asymmetries reflect persistent representational differences rather than cumulative practice effects.</p>
<sec>
<title>Supplement E: Mismatch Data</title>
<p>Accuracy in mismatch trials was uniformly high across all conditions, indicating that participants reliably rejected nonmatching displays.</p>
<p>For correct rejections, reaction times (RTs) were analyzed using a linear mixed-effects model on log-transformed RTs with random intercepts for participants and items, and by-participant slopes for both within-subject factors.</p>
<p>There was no reliable main effect of feature condition (<italic>&#946;</italic> = 0.012, SE = 0.007, <italic>t</italic>(&#8776;13) = 1.67, <italic>p</italic> = .12). Responses were faster overall for colour than for size adjectives (<italic>&#946;</italic> = &#8211;0.037, <italic>SE</italic> = 0.008, <italic>t</italic>(&#8776;17) = &#8211;4.87, <italic>p</italic> &lt; .001). The interaction between feature condition and adjective type was small and only marginally significant (<italic>&#946;</italic> = 0.014, <italic>SE</italic> = 0.006, <italic>t</italic>(&#8776;7) = 2.27, <italic>p</italic> = .056), suggesting that any double-feature-related facilitation may depend on adjective type.</p>
<p>Back-transformed estimated means (geometric means) mirrored this pattern: for colour adjectives, double cues were faster than single cues (663 vs. 697 ms), whereas size adjectives showed no difference (733 vs. 731 ms). In sum, mismatch performance remained near ceiling, and double features yielded only a modest, adjective-specific RT advantage</p>
</sec>
<sec>
<title>Supplement F: Accuracy Analysis Match Data</title>
<p>Accuracy in match trials was high across all conditions (87&#8211;92%; single-size: 87.4%; singlenoun: 92.4%; double-size: 91.9%; double-noun: 92.1%). A generalized linear mixed-effects model with random intercepts for participants and items showed no significant effects of feature condition, word type, or their interaction (<italic>&#946;</italic> = &#8211;0.14, <italic>SE</italic> = 0.14, <italic>z</italic> = &#8211;1.00, <italic>p</italic> = .32; <italic>&#946;</italic> = &#8211;0.17, <italic>SE</italic> = 0.14, <italic>z</italic> = &#8211;1.21, <italic>p</italic> = .23; interaction <italic>p</italic> = .26).</p>
<p>Estimated accuracy probabilities confirmed that responses were comparably accurate across all cue types (<italic>single-size</italic>: 0.87; <italic>single-noun</italic>: 0.92; <italic>double-size</italic>: 0.92; <italic>double-noun</italic>: 0.92). Thus, while size cues tended to produce slightly lower accuracy numerically than noun cues, this difference did not reach significance, and accuracy was uniformly high overall.</p>
</sec>
</sec>
<sec>
<title>Experiment 3</title>
<sec>
<title>Supplement G: Learning Effects</title>
<p>To assess potential practice or learning effects in Experiment 3, we analyzed log-transformed reaction times (RTs) from correct match trials (0.1&#8211;2 s) using three linear mixed-effects models (LMMs) with by-participant random intercepts and random slopes for the relevant trial index. Each model examined RT change over the course of the experiment at a different level of aggregation. Specifically, the <italic>global model</italic> assessed overall change across the full session; the <italic>within-block model</italic> captured possible practice effects within each of the four block types (<italic>singlecolour, single-size, double-colour, double-size</italic>); and the <italic>within-condition model</italic> tested learning within feature conditions (single vs. double) while retaining adjective type (colour vs. size) as a second within-subject factor. In all models, coefficients associated with trial indices indicate learning slopes, whereas cross-sectional condition effects serve only as reference baselines.</p>
</sec>
</sec>
<sec>
<title>Results</title>
<p>No evidence for systematic practice or fatigue effects that could account for the main experimental outcomes</p>
<list list-type="bullet">
<list-item><p><bold>Global model:</bold> Across the full session, there was no evidence of general practicerelated speed-up (<italic>&#946;</italic> = 0.0073, <italic>SE</italic> = 0.0075, <italic>t</italic> = 0.97, <italic>p</italic> = .34). Interactions of trial position with feature number or adjective type were not significant (all <italic>p</italic> &#8805; .12), indicating that learning slopes did not differ across conditions. The only reliable effects were constant RT differences between conditions (responses were faster in double-feature than single-feature trials and slower for size than colour adjectives) reflecting stable baseline differences rather than learning.</p></list-item>
<list-item><p><bold>Within-block model:</bold> When examining trial order within each block, there was again no reliable indication of within-block practice (<italic>&#946;</italic> = 0.0029, <italic>SE</italic> = 0.0052, <italic>t</italic> = 0.57, <italic>p</italic> = .57). Several main effects of block label mirrored the overall RT differences between conditions (e.g., faster responses for <italic>double-colour</italic> than <italic>double-size</italic> trials), but their interactions with the trial index were small, inconsistent, and did not form a systematic pattern across blocks.</p></list-item>
<list-item><p><bold>Within-condition model:</bold> A similar picture emerged when learning was modeled within each feature condition. The overall trend toward faster RTs over trials was weak and non-significant (<italic>&#946;</italic> = &#8211;0.0104, <italic>SE</italic> = 0.0059, <italic>t</italic> = &#8211;1.76, <italic>p</italic> = .08). However, a small but significant three-way interaction between trial order, feature condition, and adjective type (<italic>&#946;</italic> = &#8211;0.0161, <italic>SE</italic> = 0.0049, <italic>t</italic> = &#8211;3.28, <italic>p</italic> = .001) indicated that the minimal learning observed was modulated by both factors. Inspection of fitted trends suggested that responses to colour adjectives showed a slightly greater improvement over time than those to size adjectives, particularly in the double-feature condition. No other trial-related effects reached significance.</p></list-item>
</list>
<p>Across all analyses, reaction times did not generally decrease over the session or within blocks, and only a minor interaction indicated slightly greater improvement for colour adjectives. Thus, the observed condition differences in Experiment 3 are stable and cannot be attributed to differential learning across time.</p>
<sec>
<title>Supplement H: Mismatch Data</title>
<p>Accuracy in mismatch trials was uniformly high across all conditions, indicating that participants reliably rejected nonmatching displays.</p>
<p>Reaction times (RTs) for correct rejections were analyzed using a linear mixed-effects model on log-transformed RTs, with random intercepts for participants and items and by-participant slopes for both within-subject factors.</p>
<p>There was no reliable main effect of feature condition (<italic>&#946;</italic> = .012, <italic>SE</italic> = .007, <italic>t</italic>(&#8776;13) = 1.67, <italic>p</italic> = .12). Responses were faster overall for colour than size adjectives (<italic>&#946;</italic> = &#8211;.036, <italic>SE</italic> = .008, <italic>t</italic>(&#8776;17) = &#8211;4.73, <italic>p</italic> &lt; .001). The interaction between feature condition and adjective type was small and only marginally significant (<italic>&#946;</italic> = .014, <italic>SE</italic> = .006, <italic>t</italic>(&#8776;7) = 2.29, <italic>p</italic> = .055), suggesting that any redundancy-related facilitation depended on adjective type.</p>
<p>Back-transformed means showed that colour adjectives elicited faster responses for double- than single-feature cues (&#8776; 665 ms vs. 690 ms), whereas size adjectives showed virtually identical latencies across cue types (&#8776; 730 ms for both). In sum, mismatch performance remained near ceiling, and redundancy yielded only a modest, adjective-specific RT advantage.</p>
</sec>
<sec>
<title>Supplement I: Accuracy Analysis Match Data</title>
<p>Accuracy in match trials was uniformly high across all conditions (92&#8211;96%), indicating that participants responded reliably when the cue and display matched.</p>
<p>A generalized linear mixed-effects model (GLMM) with random intercepts for participants and items revealed significantly higher accuracy for double- than single-feature cues (<italic>&#946;</italic> = &#8211;0.12, <italic>SE</italic> = 0.05, <italic>z</italic> = &#8211;2.33, <italic>p</italic> = .020) and for size compared to colour adjectives (<italic>&#946;</italic> = 0.25, <italic>SE</italic> = 0.05, z = 4.84, <italic>p</italic> &lt; .001). The interaction between feature condition and adjective type was not significant (<italic>p</italic> = .21).</p>
</sec>
<sec>
<title>Supplement J: Global Analysis of Exp. 1 and 2</title>
<p>To assess the consistency of redundancy and adjective-type effects across experiments, reaction times (RTs) from Experiments 1 and 2 were combined and analyzed using a linear mixed-effects model on log-transformed RTs. The model included fixed effects of feature condition (single vs. double), adjective type (colour vs. size), and their interaction, with by-participant random slopes for feature condition and by-item random intercepts.</p>
<p>Responses were faster overall for double- than single-feature cues (<italic>&#946;</italic> = 0.032, <italic>SE</italic> = 0.011, <italic>t</italic>(&#8776; 29) = 2.79, <italic>p</italic> = .009), and for colour compared to size adjectives (<italic>&#946;</italic> = &#8211;0.068, <italic>SE</italic> = 0.024, <italic>t</italic>(&#8776; 115) = &#8211;2.79, <italic>p</italic> = .006). The interaction between feature condition and adjective type was not significant (<italic>p</italic> = .71).</p>
<p>These results replicate the main double-feature and adjective-type effects observed in the individual experiments, indicating that both the facilitation from multiple cues and the speed advantage for colour adjectives generalize across experiments.</p>
</sec>
</sec>
</sec>
</body>
<back>
<sec>
<title>Data availability</title>
<p>Data and analysis scripts are available in the Open Science Framework repository, which can be accessed using the following link: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/3nqa9/">https://osf.io/3nqa9/</ext-link>.</p>
</sec>
<sec>
<title>Ethics and consent</title>
<p>This study was approved by, and carried out in accordance with the recommendations of, the Central European University Psychological Research Ethics Board. All participants gave written informed consent in accordance with the Declaration of Helsinki.</p>
</sec>
<sec>
<title>Acknowledgments</title>
<p>We thank the editor and two anonymous reviewers for their thoughtful and constructive comments and Attila Balla for assistance with data collection.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<fn-group>
<fn id="n1"><p>A detailed analysis of the correct mismatch trials is provided in the <italic>Supplementary Materials</italic>; results closely paralleled those observed for the match trials.</p></fn>
<fn id="n2"><p>Some data sets were automatically excluded during ANOVA due to missing values.</p></fn>
<fn id="n3"><p>Effect sizes are reported as generalized eta squared <inline-formula>
<alternatives>
<mml:math id="Eq010-mml">
<mml:mrow>
<mml:msubsup>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi>g</mml:mi>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
</mml:math>
<tex-math id="M10">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
\eta_g^2
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-11-23979-e8.gif"/>
</alternatives>
</inline-formula>, as recommended for repeated-measures designs (<xref ref-type="bibr" rid="B27">Olejnik &amp; Algina 2003</xref>; <xref ref-type="bibr" rid="B3">Bakeman 2005</xref>).</p></fn>
<fn id="n4"><p>Given the challenges of recruiting native English speakers locally, Experiments 2 and 3 were conducted online. This change in recruitment strategy aimed to mitigate potential confounds arising from non-native language processing in a study investigating subtle semantic effects.</p></fn>
<fn id="n5"><p>A detailed analysis of the correct mismatch trials is provided in the <italic>Supplementary Materials</italic>; results paralleled those observed for the match trials.</p></fn>
<fn id="n6"><p>A detailed analysis of the correct mismatch trials is provided in the <italic>Supplementary Materials</italic>; results closely paralleled those observed for the match trials.</p></fn>
<fn id="n7"><p>See section <italic>Supplementary Materials</italic> for details.</p></fn>
</fn-group>
<ref-list>
<ref id="B1"><mixed-citation publication-type="journal"><string-name><surname>Allen</surname>, <given-names>Richard</given-names></string-name> &amp; <string-name><surname>Alan</surname>, <given-names>Baddeley</given-names></string-name> &amp; <string-name><surname>Hitch</surname>, <given-names>Graham</given-names></string-name>. <year>2006</year>. <article-title>Is the binding of visual features in working memory resource-demanding?</article-title> <source>Journal of Experimental Psychology: General</source> <volume>135</volume>. <fpage>298</fpage>&#8211;<lpage>313</lpage>. <pub-id pub-id-type="doi">10.1037/0096-3445.135.2.298</pub-id></mixed-citation></ref>
<ref id="B2"><mixed-citation publication-type="journal"><string-name><surname>Aparicio</surname>, <given-names>H.</given-names></string-name> &amp; <string-name><surname>Xiang</surname>, <given-names>M.</given-names></string-name>, &amp; <string-name><surname>Kennedy</surname>, <given-names>C.</given-names></string-name> (<year>2015</year>). <article-title>Processing gradable adjectives in context: A visual world study</article-title>. <pub-id pub-id-type="doi">10.3765/salt.v25i0.3128</pub-id></mixed-citation></ref>
<ref id="B3"><mixed-citation publication-type="journal"><string-name><surname>Bakeman</surname>, <given-names>Roger</given-names></string-name>. <year>2005</year>. <article-title>Recommended effect size statistics for repeated measures designs</article-title>. <source>Behavior Research Methods</source> <volume>37</volume>(<issue>3</issue>). <fpage>379</fpage>&#8211;<lpage>384</lpage>. <pub-id pub-id-type="doi">10.3758/BF03192707</pub-id></mixed-citation></ref>
<ref id="B4"><mixed-citation publication-type="book"><string-name><surname>Barker</surname>, <given-names>Chris</given-names></string-name> &amp; <string-name><surname>Jacobson</surname>, <given-names>Pauline</given-names></string-name>. <year>2007</year>. <source>Direct compositionality</source> (Oxford Linguistics 14). <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>. <pub-id pub-id-type="doi">10.1093/oso/9780199204373.001.0001</pub-id></mixed-citation></ref>
<ref id="B5"><mixed-citation publication-type="book"><string-name><surname>Barsalou</surname>, <given-names>Lawrence W.</given-names></string-name> <year>2017</year>. <chapter-title>Cognitively plausible theories of concept composition</chapter-title>. In <string-name><surname>Hampton</surname>, <given-names>James A.</given-names></string-name> &amp; <string-name><surname>Winter</surname>, <given-names>Yoad</given-names></string-name> (eds.), <source>Compositionality and Concepts in Linguistics and Psychology</source> (Language, Cognition, and Mind), vol. <volume>3</volume>, <fpage>9</fpage>&#8211;<lpage>30</lpage>. <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-3-319-45977-6_2</pub-id></mixed-citation></ref>
<ref id="B6"><mixed-citation publication-type="journal"><string-name><surname>Bates</surname>, <given-names>Douglas</given-names></string-name> &amp; <string-name><surname>M&#228;chler</surname>, <given-names>Martin</given-names></string-name> &amp; <string-name><surname>Bolker</surname>, <given-names>Ben</given-names></string-name> &amp; <string-name><surname>Walker</surname>, <given-names>Steve</given-names></string-name>. <year>2015</year>. <article-title>Fitting linear mixed-effects models using lme4</article-title>. <source>Journal of Statistical Software</source> <volume>67</volume>(<issue>1</issue>). <pub-id pub-id-type="doi">10.18637/jss.v067.i01</pub-id></mixed-citation></ref>
<ref id="B7"><mixed-citation publication-type="journal"><string-name><surname>Bemis</surname>, <given-names>Douglas K.</given-names></string-name> &amp; <string-name><surname>Pylkk&#228;nen</surname>, <given-names>Liina</given-names></string-name>. <year>2011</year>. <article-title>Simple composition: A magnetoencephalography investigation into the comprehension of minimal linguistic phrases</article-title>. <source>Journal of Neuroscience</source>. <volume>31</volume>(<issue>8</issue>). <fpage>2801</fpage>&#8211;<lpage>2814</lpage> <pub-id pub-id-type="doi">10.1523/JNEUROSCI.5003-10.2011</pub-id></mixed-citation></ref>
<ref id="B8"><mixed-citation publication-type="journal"><string-name><surname>Bemis</surname>, <given-names>Douglas K.</given-names></string-name> &amp; <string-name><surname>Pylkk&#228;nen</surname>, <given-names>Liina</given-names></string-name>. <year>2013a</year>. <article-title>Basic linguistic composition recruits the left anterior temporal lobe and left angular gyrus during both listening and reading</article-title>. <source>Cerebral Cortex</source> <volume>23</volume>(<issue>8</issue>). <fpage>1859</fpage>&#8211;<lpage>1873</lpage>. <pub-id pub-id-type="doi">10.1093/cercor/bhs170</pub-id></mixed-citation></ref>
<ref id="B9"><mixed-citation publication-type="journal"><string-name><surname>Bemis</surname>, <given-names>Douglas K.</given-names></string-name> &amp; <string-name><surname>Pylkk&#228;nen</surname>, <given-names>Liina</given-names></string-name>. <year>2013b</year>. <article-title>Flexible composition: MEG evidence for the deployment of basic combinatorial linguistic mechanisms in response to task demands</article-title>. <source>PLoS ONE</source> <volume>8</volume>(<issue>9</issue>). <elocation-id>e73949</elocation-id>. <pub-id pub-id-type="doi">10.1371/journal.pone.0073949</pub-id></mixed-citation></ref>
<ref id="B10"><mixed-citation publication-type="journal"><string-name><surname>Bocanegra</surname>, <given-names>Bruno R.</given-names></string-name> &amp; <string-name><surname>Poletiek</surname>, <given-names>Fenna H.</given-names></string-name> &amp; <string-name><surname>Zwaan</surname>, <given-names>Rolf A.</given-names></string-name> <year>2022</year>. <article-title>Language concatenates perceptual features into representations during comprehension</article-title>. <source>Journal of Memory and Language</source> <volume>127</volume>. <elocation-id>104355</elocation-id>. <pub-id pub-id-type="doi">10.1016/j.jml.2022.104355</pub-id></mixed-citation></ref>
<ref id="B11"><mixed-citation publication-type="journal"><string-name><surname>Brockmole</surname>, <given-names>James</given-names></string-name> &amp; <string-name><surname>Parra</surname>, <given-names>Mario</given-names></string-name> &amp; <string-name><surname>Sala</surname>, <given-names>Sergio Della</given-names></string-name> &amp; <string-name><surname>Logie</surname>, <given-names>Robert</given-names></string-name>. <year>2008</year>. <article-title>Do binding deficits account for age-related decline in working memory?</article-title> <source>Psychonomic Bulletin &amp; Review</source> <volume>15</volume>. <fpage>543</fpage>&#8211;<lpage>547</lpage>. <pub-id pub-id-type="doi">10.3758/PBR.15.3.543</pub-id></mixed-citation></ref>
<ref id="B12"><mixed-citation publication-type="webpage"><string-name><surname>Chang</surname>, <given-names>Michael</given-names></string-name> &amp; <string-name><surname>Gupta</surname>, <given-names>Abhishek</given-names></string-name> &amp; <string-name><surname>Levine</surname>, <given-names>S.</given-names></string-name> &amp; <string-name><surname>Griffiths</surname>, <given-names>T.</given-names></string-name> <year>2018</year>. <article-title>Automatically Composing representation transformations as a means for generalization</article-title>. <source>ArXiv</source>. <uri>https://www.semanticscholar.org/paper/1766648967f6206a944a4bd18bbbd92a74c164bd</uri> (<day>13</day> <month>May</month>, 2025).</mixed-citation></ref>
<ref id="B13"><mixed-citation publication-type="journal"><string-name><surname>Chomsky</surname>, <given-names>N.</given-names></string-name> <year>1956</year>. <article-title>Three models for the description of language</article-title>. <source>IRE Transactions on Information Theory</source> <volume>2</volume>(<issue>3</issue>). <fpage>113</fpage>&#8211;<lpage>124</lpage>. <pub-id pub-id-type="doi">10.1109/TIT.1956.1056813</pub-id></mixed-citation></ref>
<ref id="B14"><mixed-citation publication-type="book"><string-name><surname>Chomsky</surname>, <given-names>Noam</given-names></string-name>. <year>2006</year>. <source>Language and mind</source>. <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/CBO9780511791222</pub-id></mixed-citation></ref>
<ref id="B15"><mixed-citation publication-type="journal"><string-name><surname>Frege</surname>, <given-names>Gottlob</given-names></string-name>. <year>1963</year>. <article-title>Compound thoughts</article-title>. <source>Mind</source> <volume>72</volume>(<issue>285</issue>). <fpage>1</fpage>&#8211;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1093/mind/LXXII.285.1</pub-id></mixed-citation></ref>
<ref id="B16"><mixed-citation publication-type="book"><string-name><surname>Heim</surname>, <given-names>Irene</given-names></string-name> &amp; <string-name><surname>Kratzer</surname>, <given-names>Angelika</given-names></string-name>. <year>1998</year>. <source>Semantics in generative grammar</source>. <edition>1st</edition> edn. <publisher-name>Blackwell</publisher-name></mixed-citation></ref>
<ref id="B17"><mixed-citation publication-type="journal"><string-name><surname>Holyoak</surname>, <given-names>Keith J.</given-names></string-name> &amp; <string-name><surname>Stamenkovi&#263;</surname>, <given-names>Du&#353;an</given-names></string-name>. <year>2018</year>. <article-title>Metaphor comprehension: A critical review of theories and evidence</article-title>. <source>Psychological Bulletin</source> <volume>144</volume>(<issue>6</issue>). <fpage>641</fpage>&#8211;<lpage>671</lpage>. <pub-id pub-id-type="doi">10.1037/bul0000145</pub-id></mixed-citation></ref>
<ref id="B18"><mixed-citation publication-type="book"><string-name><surname>Humboldt</surname>, <given-names>Wilhelm Freiherr von</given-names></string-name>. <year>1876</year>. <source>Ueber die Verschiedenheit des menschlichen Sprachbaues und ihren Einfluss auf die geistige Entwickelung des Menschengeschlechts</source>. <publisher-name>S. Calvary</publisher-name>.</mixed-citation></ref>
<ref id="B19"><mixed-citation publication-type="journal"><string-name><surname>Jackendoff</surname>, <given-names>Ray</given-names></string-name> &amp; <string-name><surname>Pinker</surname>, <given-names>Steven</given-names></string-name>. <year>2005</year>. <article-title>The nature of the language faculty and its implications for evolution of language (Reply to Fitch, Hauser, and Chomsky)</article-title>. <source>Cognition</source> <volume>97</volume>(<issue>2</issue>). <fpage>211</fpage>&#8211;<lpage>225</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2005.04.006</pub-id></mixed-citation></ref>
<ref id="B20"><mixed-citation publication-type="journal"><string-name><surname>Kemp</surname>, <given-names>Charles</given-names></string-name>. <year>2012</year>. <article-title>Exploring the conceptual universe</article-title>. <source>Psychological Review</source> <volume>119</volume>(<issue>4</issue>). <fpage>685</fpage>&#8211;<lpage>722</lpage>. <pub-id pub-id-type="doi">10.1037/a0029347</pub-id></mixed-citation></ref>
<ref id="B21"><mixed-citation publication-type="book"><string-name><surname>Kamp</surname>, <given-names>Hans</given-names></string-name>. <year>2013</year>. <chapter-title>Two theories about adjectives</chapter-title>. In <string-name><surname>von Heusinger</surname>, <given-names>Klaus</given-names></string-name> &amp; <string-name><surname>ter Meulen</surname>, <given-names>Alice G. B.</given-names></string-name> (eds.), <source>Meaning and the dynamics of interpretation: Selected papers of Hans Kamp</source> (Current Research in the Semantics/Pragmatics Interface volume <volume>29</volume>). <publisher-loc>Leiden; Boston</publisher-loc>: <publisher-name>Brill</publisher-name>. <pub-id pub-id-type="doi">10.1163/9789004252882</pub-id></mixed-citation></ref>
<ref id="B22"><mixed-citation publication-type="journal"><string-name><surname>Kennedy</surname>, <given-names>Christopher</given-names></string-name>. <year>2007</year>. <article-title>Vagueness and grammar: The semantics of relative and absolute gradable adjectives</article-title>. <source>Linguistics and Philosophy</source> <volume>30</volume>(<issue>1</issue>). <fpage>1</fpage>&#8211;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.1007/s10988-006-9008-0</pub-id></mixed-citation></ref>
<ref id="B23"><mixed-citation publication-type="webpage"><string-name><surname>Lenth</surname>, <given-names>Russell V.</given-names></string-name> <year>2023</year>. <article-title>emmeans: Estimated marginal means, aka least-squares means (R package version 1.8.5)</article-title>. <source>Vienna: R Foundation for Statistical Computing</source>. <uri>https://CRAN.Rproject.org/package=emmeans</uri></mixed-citation></ref>
<ref id="B24"><mixed-citation publication-type="journal"><string-name><surname>Logie</surname>, <given-names>Robert H.</given-names></string-name> &amp; <string-name><surname>Brockmole</surname>, <given-names>James R.</given-names></string-name> &amp; <string-name><surname>Vandenbroucke</surname>, <given-names>Annelinde R. E.</given-names></string-name> <year>2009</year>. <article-title>Bound feature combinations in visual short-term memory are fragile but influence long-term learning</article-title>. <source>Visual Cognition</source> <volume>17</volume>(<issue>1&#8211;2</issue>). <fpage>160</fpage>&#8211;<lpage>179</lpage>. <pub-id pub-id-type="doi">10.1080/13506280802228411</pub-id></mixed-citation></ref>
<ref id="B25"><mixed-citation publication-type="journal"><string-name><surname>Luck</surname>, <given-names>Steven J.</given-names></string-name> &amp; <string-name><surname>Vogel</surname>, <given-names>Edward K.</given-names></string-name> <year>1997</year>. <article-title>The capacity of visual working memory for features and conjunctions</article-title>. <source>Nature</source> <volume>390</volume>(<issue>6657</issue>). <fpage>279</fpage>&#8211;<lpage>281</lpage>. <pub-id pub-id-type="doi">10.1038/36846</pub-id></mixed-citation></ref>
<ref id="B26"><mixed-citation publication-type="journal"><string-name><surname>Nicholson</surname>, <given-names>Karen G.</given-names></string-name> &amp; <string-name><surname>Humphrey</surname>, <given-names>G. Keith</given-names></string-name>. <year>2004</year>. <article-title>The effect of colour congruency on shape discriminations of novel objects</article-title>. <source>Perception</source> <volume>33</volume>(<issue>3</issue>). <fpage>339</fpage>&#8211;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1068/p5136</pub-id></mixed-citation></ref>
<ref id="B27"><mixed-citation publication-type="journal"><string-name><surname>Olejnik</surname>, <given-names>Stephen</given-names></string-name> &amp; <string-name><surname>Algina</surname>, <given-names>James</given-names></string-name>. <year>2003</year>. <article-title>Generalized eta and omega squared statistics: Measures of effect size for some common research designs</article-title>. <source>Psychological Methods</source> <volume>8</volume>(<issue>4</issue>). <fpage>434</fpage>&#8211;<lpage>447</lpage>. <pub-id pub-id-type="doi">10.1037/1082-989X.8.4.434</pub-id></mixed-citation></ref>
<ref id="B28"><mixed-citation publication-type="webpage"><string-name><surname>Partee</surname>, <given-names>Barbara</given-names></string-name>. <year>1995</year>. <chapter-title>Lexical semantics and compositionality</chapter-title>. In <source>An invitation to cognitive science: Language</source> <volume>1</volume>. <fpage>311</fpage>&#8211;<lpage>360</lpage>. <edition>2nd</edition> edn. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>. <uri>https://www.cs.brandeis.edu/~jamesp/classes/cs216-2009/readings2009/ParteeSemanticsAndCompositionality.pdf</uri> (13 May, 2025).</mixed-citation></ref>
<ref id="B29"><mixed-citation publication-type="journal"><string-name><surname>Phillips</surname>, <given-names>Steven</given-names></string-name> &amp; <string-name><surname>Wilson</surname>, <given-names>William H.</given-names></string-name> <year>2011</year>. <article-title>Categorial compositionality II: Universal constructions and a general theory of (quasi-)systematicity in human cognition</article-title>. <source>PLoS Computational Biology</source> <volume>7</volume>(<issue>8</issue>). <elocation-id>e1002102</elocation-id>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002102</pub-id></mixed-citation></ref>
<ref id="B30"><mixed-citation publication-type="journal"><string-name><surname>Potter</surname>, <given-names>Mary C.</given-names></string-name> &amp; <string-name><surname>Faulconer</surname>, <given-names>Barbara A.</given-names></string-name> <year>1979</year>. <article-title>Understanding noun phrases</article-title>. <source>Journal of Verbal Learning and Verbal Behavior</source> <volume>18</volume>(<issue>5</issue>). <fpage>509</fpage>&#8211;<lpage>521</lpage>. <pub-id pub-id-type="doi">10.1016/S0022-5371(79)90274-3</pub-id></mixed-citation></ref>
<ref id="B31"><mixed-citation publication-type="journal"><string-name><surname>Pylkk&#228;nen</surname>, <given-names>Liina</given-names></string-name> &amp; <string-name><surname>Bemis</surname>, <given-names>Douglas K.</given-names></string-name> &amp; <string-name><surname>Elorrieta</surname>, <given-names>Estibaliz Blanco</given-names></string-name>. <year>2014</year>. <article-title>Building phrases in language production: An MEG study of simple composition</article-title>. <source>Cognition</source> <volume>133</volume>(<issue>2</issue>). <fpage>371</fpage>&#8211;<lpage>384</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2014.07.001</pub-id></mixed-citation></ref>
<ref id="B32"><mixed-citation publication-type="webpage"><collab>R Core Team</collab>. <year>2023</year>. <source>R: A language and environment for statistical computing</source>. <publisher-loc>Vienna, Austria</publisher-loc>: <publisher-name>R Foundation for Statistical Computing</publisher-name>. <uri>https://www.R-project.org</uri></mixed-citation></ref>
<ref id="B33"><mixed-citation publication-type="journal"><string-name><surname>Rabagliati</surname>, <given-names>Hugh</given-names></string-name> &amp; <string-name><surname>Doumas</surname>, <given-names>Leonidas A. A.</given-names></string-name> &amp; <string-name><surname>Bemis</surname>, <given-names>Douglas K.</given-names></string-name> <year>2017</year>. <article-title>Representing composed meanings through temporal binding</article-title>. <source>Cognition</source> <volume>162</volume>. <fpage>61</fpage>&#8211;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2017.01.013</pub-id></mixed-citation></ref>
<ref id="B34"><mixed-citation publication-type="journal"><string-name><surname>Redolfi</surname>, <given-names>M.</given-names></string-name> &amp; <string-name><surname>Melloni</surname>, <given-names>C.</given-names></string-name> <year>2025</year>. <article-title>Processing adjectives in development: Evidence from eyetracking</article-title>. <source>Journal of Child Language</source> <volume>52</volume>(<issue>2</issue>). <fpage>270</fpage>&#8211;<lpage>293</lpage>. <pub-id pub-id-type="doi">10.1017/S0305000923000703</pub-id></mixed-citation></ref>
<ref id="B35"><mixed-citation publication-type="journal"><string-name><surname>Rubio-Fern&#225;ndez</surname>, <given-names>Paula</given-names></string-name>. <year>2016</year>. <article-title>How redundant are redundant color adjectives? An efficiency-based analysis of color overspecification</article-title>. <source>Frontiers in Psychology</source> <volume>7</volume>. <pub-id pub-id-type="doi">10.3389/fpsyg.2016.00153</pub-id></mixed-citation></ref>
<ref id="B36"><mixed-citation publication-type="webpage"><string-name><surname>Singmann</surname>, <given-names>Henrik</given-names></string-name> &amp; <string-name><surname>Bolker</surname>, <given-names>Ben</given-names></string-name> &amp; <string-name><surname>Westfall</surname>, <given-names>Jacob</given-names></string-name> &amp; <string-name><surname>Aust</surname>, <given-names>Frederik</given-names></string-name> &amp; <string-name><surname>Ben-Shachar</surname>, <given-names>Mattan S.</given-names></string-name> <year>2021</year>. <source>afex: Analysis of factorial experiments (R package version 1.0-1)</source>. <publisher-name>CRAN</publisher-name>. <uri>https://CRAN.Rproject.org/package=afex</uri></mixed-citation></ref>
<ref id="B37"><mixed-citation publication-type="webpage"><string-name><surname>Solt</surname>, <given-names>S.</given-names></string-name> <year>2009</year>. <source>The semantics of adjectives of quantity</source> (Order No. 3349494). Available from ProQuest Dissertations &amp; Theses Global: The Humanities and Social Sciences Collection. (304859514). Retrieved from <uri>https://www.proquest.com/dissertations-theses/semantics-adjectives-quantity/docview/304859514/se-2</uri></mixed-citation></ref>
<ref id="B38"><mixed-citation publication-type="journal"><string-name><surname>Solt</surname>, <given-names>Stephanie</given-names></string-name>. <year>2015</year>. <article-title>Vagueness and Imprecision: Empirical Foundations</article-title>. <source>Annual Review Linguistics</source> <volume>1</volume>. <fpage>107</fpage>&#8211;<lpage>127</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-linguist-030514-125150</pub-id></mixed-citation></ref>
<ref id="B39"><mixed-citation publication-type="journal"><string-name><surname>Werning</surname>, <given-names>Markus</given-names></string-name>. <year>2010</year>. <article-title>Complex first? On the evolutionary and developmental priority of semantically thick words</article-title>. <source>Philosophy of Science</source> <volume>77</volume>(<issue>5</issue>). <fpage>1096</fpage>&#8211;<lpage>1108</lpage>. <pub-id pub-id-type="doi">10.1086/656826</pub-id></mixed-citation></ref>
<ref id="B40"><mixed-citation publication-type="book"><string-name><surname>Wittenberg</surname>, <given-names>Eva</given-names></string-name>. <year>2016</year>. <source>With light verb constructions from syntax to concepts</source>. <publisher-name>Universit&#228;tsverlag Potsdam</publisher-name>.</mixed-citation></ref>
<ref id="B41"><mixed-citation publication-type="journal"><string-name><surname>Ziegler</surname>, <given-names>Jayden</given-names></string-name> &amp; <string-name><surname>Pylkk&#228;nen</surname>, <given-names>Liina</given-names></string-name>. <year>2016</year>. <article-title>Scalar adjectives and the temporal unfolding of semantic composition: An MEG investigation</article-title>. <source>Neuropsychologia</source> <volume>89</volume>. <fpage>161</fpage>&#8211;<lpage>171</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuropsychologia.2016.06.010</pub-id></mixed-citation></ref>
<ref id="B42"><mixed-citation publication-type="journal"><string-name><surname>Ziegler</surname>, <given-names>Jayden</given-names></string-name> &amp; <string-name><surname>Snedeker</surname>, <given-names>Jesse</given-names></string-name> &amp; <string-name><surname>Wittenberg</surname>, <given-names>Eva</given-names></string-name>. <year>2018</year>. <article-title>Event structures drive semantic structural priming, Not thematic roles: Evidence from idioms and light verbs</article-title>. <source>Cognitive Science</source> <volume>42</volume>(<issue>8</issue>). <fpage>2918</fpage>&#8211;<lpage>2949</lpage>. <pub-id pub-id-type="doi">10.1111/cogs.12687</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>