<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2397-1835</journal-id>
<journal-title-group>
<journal-title>Glossa: a journal of general linguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2397-1835</issn>
<publisher>
<publisher-name>Open Library of Humanities</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.16995/glossa.16797</article-id>
<article-categories>
<subj-group>
<subject>Squibs</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Experimental evidence for variation across exclusive modifiers</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Ronai</surname>
<given-names>Eszter</given-names>
</name>
<email>ronai@northwestern.edu</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fagen</surname>
<given-names>Lucas</given-names>
</name>
<email>lfagen@uchicago.edu</email>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Northwestern University, US</aff>
<aff id="aff-2"><label>2</label>The University of Chicago, US</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-01-23">
<day>23</day>
<month>01</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>10</volume>
<issue>1</issue>
<fpage>1</fpage>
<lpage>22</lpage>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2025 The Author(s)</copyright-statement>
<copyright-year>2025</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.glossa-journal.org/articles/10.16995/glossa.16797/"/>
<abstract>
<p>This paper is an investigation of variation across the English exclusive modifiers <italic>only, just</italic>, and <italic>merely</italic>&#8212;a domain that has received ample attention in the theoretical literature but has thus far not been subjected to experimental testing. Using scalar diversity as a testing ground, we report on two experiments: Experiment 1 tests the robustness of exclusionary inference calculation (e.g., <italic>merely intelligent</italic> &#8594; <italic>not brilliant</italic>), and Experiment 2 directly tests whether a rank-order (e.g., <italic>not brilliant</italic>) or complement-exclusion (e.g., <italic>not ambitious</italic>) reading is preferred with different exclusives. Our findings reveal that 1) <italic>just</italic> excludes less robustly than the other two exclusives, and 2) while <italic>only</italic> allows both complement-exclusion and rank-order readings, <italic>just</italic> has a weak and <italic>merely</italic> a strong preference for rank-order ones. These results bear on previous theoretical observations about exclusives, and they are also informative about the robust by-scale variation in inference calculation, i.e., scalar diversity. Lastly, we argue that the methodological success of our experiments opens up avenues for further examination of more precise predictions of competing theoretical accounts.</p>
</abstract>
</article-meta>
</front>
<body>
<sec>
<title>1 Introduction</title>
<p>This squib experimentally investigates lexical semantic differences between the English exclusive modifiers <italic>only, just</italic>, and <italic>merely</italic>. Exclusives, as shown in (1), form a lexical class.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(1)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Mary <bold>only</bold> ate [the cookies]<sub><italic>F</italic></sub>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Mary <bold>just</bold> ate [the cookies]<sub><italic>F</italic></sub>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>c.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Mary <bold>merely</bold> ate [the cookies]<sub><italic>F</italic></sub>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Sentences with exclusives typically convey that some proposition is true&#8212;the <italic>prejacent</italic>, usually treated as a presupposition since Horn (<xref ref-type="bibr" rid="B12">1969</xref>)&#8212;as in (2a), and that alternatives to the prejacent are false, as in (2b).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(2)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Mary ate the cookies.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Mary did not eat alternatives to the cookies.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Beyond their common core meaning, exclusives vary considerably across different parameters (<xref ref-type="bibr" rid="B6">Coppock &amp; Beaver 2014</xref>). For example, individual exclusives exhibit significant flexibility in the syntactic category and semantic type of what they modify. Exclusives also impose different restrictions on the alternatives to the prejacent and how they can be ordered along differently structured scales (<xref ref-type="bibr" rid="B14">Horn 2000</xref>; <xref ref-type="bibr" rid="B17">Klinedinst 2005</xref>; <xref ref-type="bibr" rid="B2">Beaver &amp; Clark 2008</xref>; <xref ref-type="bibr" rid="B7">Fagen 2025</xref>). Additionally, some exclusives vary more widely than others. <italic>Just</italic> especially has a range of &#8220;noncanonical&#8221; (<xref ref-type="bibr" rid="B26">Warstadt 2020: p. 374</xref>) readings that <italic>only</italic> and <italic>merely</italic> lack, leading some researchers (<xref ref-type="bibr" rid="B26">Warstadt 2020</xref>; <xref ref-type="bibr" rid="B3">Beltrama 2021</xref>) to posit variation in the strength of the exclusive operation.</p>
<p>At the same time, variation between exclusives can be quite subtle and difficult to pin down. Such effects may emerge more starkly across items in an experimental setting than via intuition alone, and are therefore worth testing directly. Here, we present (to our knowledge) the first experimental assessment of this domain, using the scalar diversity phenomenon (<xref ref-type="bibr" rid="B25">van Tiel et al. 2016</xref>) as a testing ground. To preview our results, we find that 1) the likelihood of an exclusive inference is lower with <italic>just</italic> than with <italic>only</italic> and <italic>merely</italic>, and 2) the three exclusives vary in the extent to which they allow the exclusion of different kinds of alternatives: <italic>merely</italic> strongly prefers &#8220;rank-order&#8221; scales, <italic>just</italic> has a similar but weaker preference, while <italic>only</italic> freely allows both &#8220;rank-order&#8221; and &#8220;complement-exclusion&#8221; scales. We argue that the methodological success of our experiments also opens up avenues for further examination of the influence of semantic and pragmatic factors on the different parameters of variation.</p>
<p>The paper is structured as follows. We first review existing theoretical proposals about the semantics of exclusives as well as work on scalar diversity. We then present two experiments that compare how robustly different exclusives exclude alternatives, what type of alternatives they tend to exclude, and how this interacts with scalar diversity. Finally, we discuss the theoretical and methodological implications of our results.</p>
<sec>
<title>1.1 Scale structure</title>
<p>Exclusives can impose different orderings on the alternatives they exclude. (3) can be interpreted exhaustively, as conveying that the student possesses absolutely no relevant properties other than <italic>intelligent</italic> (3a). Following Coppock &amp; Beaver (<xref ref-type="bibr" rid="B6">2014</xref>), we&#8217;ll call this the <italic>complement-exclusion</italic> reading. (3) can also be interpreted as only excluding alternatives ordered higher than the prejacent along some pragmatically determined dimension (3b). We&#8217;ll call this the <italic>rank-order</italic> reading.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(3)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The student is <bold>only</bold> [intelligent]<sub><italic>F</italic></sub>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>&#8594; The student has no other relevant properties (not curious, charming, etc).</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>&#8594; The student is not brilliant.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>These readings can be given a uniform semantic analysis (<xref ref-type="bibr" rid="B5">Bonomi &amp; Casalegno 1993</xref>; <xref ref-type="bibr" rid="B24">van Rooij 2002</xref>; <xref ref-type="bibr" rid="B17">Klinedinst 2005</xref>; <xref ref-type="bibr" rid="B2">Beaver &amp; Clark 2008</xref>; <xref ref-type="bibr" rid="B6">Coppock &amp; Beaver 2014</xref>). For example, Coppock &amp; Beaver (<xref ref-type="bibr" rid="B6">2014</xref>) propose a unified typology of exclusives using two operators <sc>min</sc> and <sc>max</sc>. The presuppositional content of exclusives is specified in terms of <sc>min</sc>, and the assertive content is specified in terms of <sc>max</sc>. <sc>min</sc> and <sc>max</sc> apply to the prejacent and an ordered set of alternatives, which are modeled as answers to the current Question Under Discussion (CQ). Following Beaver &amp; Clark&#8217;s (<xref ref-type="bibr" rid="B2">2008</xref>) implementation of QUDs as ordered sets of propositions, or &#8220;scales&#8221;, the ordering relation &#8805; is retrievable from the CQ. <sc>min</sc> contributes existential quantification over the alternatives: some alternative ranked at least as high as the prejacent is true (4a). <sc>max</sc> contributes universal quantification: the prejacent is ranked at least as high as all true alternatives. The inference that the prejacent itself is true results from the combined effect of <sc>min</sc> and <sc>max</sc>.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(4)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><inline-formula><alternatives><mml:math id="Eq001-mml"><mml:mrow><mml:mtext>MIN</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:mi>p</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x03BB;</mml:mo><mml:mi>w</mml:mi><mml:mo>.</mml:mo><mml:mo>&#x2203;</mml:mo><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mo>&#x2208;</mml:mo><mml:mtext>CQ</mml:mtext></mml:mrow></mml:msub><mml:mo stretchy='false'>[</mml:mo><mml:mi>q</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>w</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x2227;</mml:mo><mml:mi>q</mml:mi><mml:mo>&#x2265;</mml:mo><mml:mi>p</mml:mi><mml:mo stretchy='false'>]</mml:mo></mml:mrow></mml:math>
<tex-math id="M1">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
{\rm MIN}(p)=\lambda w.\exists q_{\in{\rm CQ}}[q(w)\land q\geq p]
\]
\end{document}
</tex-math>
<graphic xlink:href="glossa-10-16797-e1.gif"/></alternatives></inline-formula></p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><inline-formula><alternatives><mml:math id="Eq002-mml"><mml:mtext>MAX</mml:mtext><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>p</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>&#x03BB;</mml:mo><mml:mi>w</mml:mi></mml:mrow><mml:mo lspace="0em">.</mml:mo><mml:mo>&#x2200;</mml:mo><mml:mrow><mml:msub><mml:mi>q</mml:mi><mml:mrow><mml:mo>&#x2208;</mml:mo><mml:mtext>CQ</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy='false'>[</mml:mo><mml:mrow><mml:mi>q</mml:mi><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>w</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mo stretchy='false'>&#x2192;</mml:mo><mml:mi>p</mml:mi><mml:mo>&#x2265;</mml:mo><mml:mi>q</mml:mi></mml:mrow><mml:mo stretchy='false'>]</mml:mo></mml:mrow></mml:math>
<tex-math id="M2">
\documentclass[10pt]{article}
\usepackage{wasysym}
\usepackage[substack]{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage[mathscr]{eucal}
\usepackage{mathrsfs}
\usepackage{pmc}
\usepackage[Euler]{upgreek}
\pagestyle{empty}
\oddsidemargin -1.0in
\begin{document}
\[
{\rm MAX}(p)=\lambda w.\forall q_{\in{\rm CQ}}[q(w)\rightarrow p\geq q]
\]
\end{document}
</tex-math><graphic xlink:href="glossa-10-16797-e2.gif"/></alternatives></inline-formula></p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Different readings (e.g., (3a) vs. (3b)) result from variation in the ordering relation and the structure of the alternative set (<xref ref-type="bibr" rid="B6">Coppock &amp; Beaver 2014</xref>). Complement-exclusion scales can be represented as a semilattice closed under conjunction (5), in which the higher-ranked alternatives entail the lower ones. In (5), the prejacent is <italic>The student is intelligent</italic> (abbreviated as <italic>intelligent</italic>). Application of <sc>max</sc> excludes all alternatives that are not ranked lower than the prejacent, resulting in the reading that the student is intelligent and has no other relevant properties. We indicate this in (5) by <bold>bolding</bold> the prejacent and <strike>crossing out</strike> the excluded alternatives.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(5)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g7.png"/></p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Rank-order scales impose an ordering on atomic alternatives (6). We consider Horn scales like &lt;<italic>clever, intelligent, brilliant</italic>&gt; paradigmatic examples of rank-order scales, as they involve atomic alternatives that stand in an entailment relation. However, since rank-order scales are not closed under conjunction and lack the semilattice structure of (5), the alternatives can also be logically independent or mutually exclusive (e.g., &lt;<italic>assistant professor, associate professor, full professor</italic>&gt;), and can in principle be ordered by a wider range of relations. In this paper we will mostly focus on Horn scales.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(6)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g8.png"/></p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Different exclusives are compatible with differently structured scales, but there is debate as to how freely exclusives actually vary in this regard. Horn (<xref ref-type="bibr" rid="B14">2000</xref>) argues from data involving projection (7) that <italic>only</italic> sentences presuppose the prejacent itself, while <italic>just</italic> sentences presuppose a lower bound on the prejacent. Since the <italic>only</italic> sentences presuppose the prejacent, it projects under negation, which leads to oddness when the alternatives are mutually exclusive. By contrast, no oddness is perceived with the <italic>just</italic> sentences.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(7)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>(<xref ref-type="bibr" rid="B14">Horn 2000: p. 150&#8211;151, ex. 7c, 9b</xref>)</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>They&#8217;re not just/?only engaged, they&#8217;re married.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>I didn&#8217;t just/?only get a B on the test, I got an A.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Translated into Coppock &amp; Beaver&#8217;s (<xref ref-type="bibr" rid="B6">2014</xref>) scalar framework, Horn&#8217;s argument amounts to the claim that <italic>only</italic> selects for complement-exclusion scales and <italic>just</italic> selects for rank-order scales. Since stronger alternatives on a complement-exclusion scale entail the prejacent, negating the exclusive content preserves the inference to the prejacent. With rank-order scales, the alternatives do not need to stand in an entailment relation, so the inference to the prejacent can disappear under negation.</p>
<p>The generalization that <italic>only</italic> selects for complement-exclusion and <italic>just</italic> for rank-order scales might be too strong: some speakers can access the rank-order reading with <italic>only</italic> in examples like (7). Coppock &amp; Beaver reinterpret the contrast between <italic>just</italic> and <italic>only</italic> in (7) as indicating that exclusives have weaker pragmatic preferences for different scales: &#8220;<italic>only</italic> prefers entailment scales [i.e., complement-exclusion scales] and <italic>just</italic> has a slight preference for non-entailment scales [i.e., rank-order scales]&#8221; (<xref ref-type="bibr" rid="B6">Coppock &amp; Beaver 2014: p. 425</xref>), without committing to a pragmatic analysis of what these preferences are or what factors influence them. The authors analyze <italic>merely</italic> as selecting an &#8220;evaluative&#8221; rank-order scale whose alternatives are ordered according to what the speaker considers good or bad. This is motivated by examples like (8), in which the higher-ranked alternatives are considered &#8220;better&#8221; (for further discussion of evaluativity effects with <italic>merely</italic>, see Wiegand, <xref ref-type="bibr" rid="B29">2018</xref>; <xref ref-type="bibr" rid="B30">Windhearn, 2021</xref>; <xref ref-type="bibr" rid="B20">as well as Orenstein &amp; Greenberg, 2010</xref> on the Hebrew exclusive <italic>stam</italic>).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(8)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>How can people be happy or satisfied with merely the norm?</p></list-item>
<list-item><p><styled-content style="float:right;">(<xref ref-type="bibr" rid="B6">Coppock &amp; Beaver 2014: p. 424, ex. 180</xref>)</styled-content></p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>The goal of both experiments in this paper is to provide a quantitative experimental assessment of these claimed pragmatic preferences in scale structure.</p>
</sec>
<sec>
<title>1.2 Strength of exclusion</title>
<p>No theory of exclusives modeled after Horn&#8217;s (<xref ref-type="bibr" rid="B12">1969</xref>) analysis of <italic>only</italic>, Coppock &amp; Beaver&#8217;s <sc>min/max</sc> entry schema included, can straightforwardly account for the <italic>just</italic> examples in (9). As the paraphrases show, these examples do not obviously communicate the exclusion of alternatives in the same way as the examples in (1).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(9)</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>a.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>I was sitting there and the lamp <bold>just</bold> broke! <styled-content style="float:right;">(<xref ref-type="bibr" rid="B29">Wiegand 2018: ex. 1b</xref>)</styled-content></p></list-item>
<list-item><p><italic>Paraphrase: Nothing caused the lamp to break</italic>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>b.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The lights in this place <bold>just</bold> turn off and on. <styled-content style="float:right;">(<xref ref-type="bibr" rid="B26">Warstadt 2020: ex. 1a</xref>)</styled-content></p></list-item>
<list-item><p><italic>Paraphrase: The lights turn off and on for no reason</italic>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>c.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>I&#8217;m not mad at you. I&#8217;m <bold>just</bold> mad. <styled-content style="float:right;">(<xref ref-type="bibr" rid="B26">Warstadt 2020: ex. 11a</xref>)</styled-content></p></list-item>
<list-item><p><italic>Paraphrase: I&#8217;m not mad at anyone in particular</italic>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>&#160;</p></list-item>
</list>
<list list-type="wordfirst">
<list-item><p>d.</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The essay is <bold>just</bold> perfect. <styled-content style="float:right;">(<xref ref-type="bibr" rid="B3">Beltrama 2021: ex. 1a</xref>)</styled-content></p></list-item>
<list-item><p><italic>Paraphrase: The essay is perfect and nothing more needs to be said</italic>.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>One way that prior literature has accounted for such uses of <italic>just</italic> is by positing that it differs from other exclusives in (what we refer to as) its strength of exclusion. For example, Warstadt (<xref ref-type="bibr" rid="B26">2020</xref>) proposes an entry for <italic>just</italic> that differs from <italic>only</italic> and other exclusive modifiers in two places. First, <italic>just</italic> excludes answers to &#8220;potential&#8221; questions, or &#8220;intuitively possible future QUDs&#8221; (p. 373, see also <xref ref-type="bibr" rid="B19">Onea 2016</xref>), rather than the current QUD. Second and crucially for our purposes, Warstadt argues that a unified account of <italic>just</italic>&#8217;s various readings requires relaxing the truth-conditional status of the exclusive operation. He proposes a distinction between &#8220;strong&#8221; exclusives which declare alternative propositions false, and &#8220;weak&#8221; exclusives which declare them unassertable.<xref ref-type="fn" rid="n1">1</xref></p>
<p>On Warstadt&#8217;s analysis, the semantic contribution of <italic>just</italic> in, e.g., (9b) is as follows. A speaker who asserts that the lights turn off and on might anticipate the addressee asking why this is. <italic>Just</italic> marks the answers to this potential question as unassertable, before the addressee can ask why, &#8220;thus preventing the addressee from asking a useless question&#8221; (<xref ref-type="bibr" rid="B26">Warstadt 2020: p. 373</xref>). However, <italic>just</italic> can also target the current QUD. In this case, the claim that other alternative answers are unassertable, together with the presupposed truth of the prejacent, leads to a meaning very similar to what a canonical exclusive entry would have delivered, the primary difference being one of strength: a speaker who uses <italic>just</italic> is refusing to commit to either the truth or falsity of alternatives, unlike a speaker who uses <italic>only</italic> to mark alternatives as false. Warstadt analyzes the full range of <italic>just</italic>&#8217;s readings as involving this weaker exclusive operation&#8212;in contrast with <italic>only</italic>, which Warstadt analyzes as a strong exclusive.</p>
<p>Other accounts of non-canonical readings with <italic>just</italic> have posited other sources of variation in the set of alternatives <italic>just</italic> can operate on, suggesting that it can target: covert modifiers with trivial semantic content (<xref ref-type="bibr" rid="B29">Wiegand 2018</xref>; <xref ref-type="bibr" rid="B30">Windhearn 2021</xref>), alternative scale granularities (<xref ref-type="bibr" rid="B23">Thomas &amp; Deo 2020</xref>), or metalinguistic alternatives at the speech act level (<xref ref-type="bibr" rid="B3">Beltrama 2021</xref>).</p>
<p>Altogether, whether an exclusive is weak or strong arises as a subtle pragmatic effect. Experiment 1 therefore tests <italic>just</italic>&#8217;s claimed &#8220;weakness&#8221;, or more generally, the possibility that it allows for non-canonically exclusive readings.</p>
</sec>
<sec>
<title>1.3 Scalar diversity</title>
<p>Our testing ground in this paper is pairs of lexical items that form a scale. More specifically, we turn to the scalar diversity phenomenon: the observation that scalar expressions vary in how likely they are to lead to scalar implicature (SI) (i.a. <xref ref-type="bibr" rid="B25">van Tiel et al. 2016</xref>; <xref ref-type="bibr" rid="B9">Gotzner et al. 2018</xref>; <xref ref-type="bibr" rid="B22">Sun et al. 2018</xref>). A classic example of SI is (10): upon encountering an utterance containing <italic>some</italic>, hearers compute the negation of its stronger scalar alternative <italic>all</italic> (<xref ref-type="bibr" rid="B10">Grice 1967</xref>; <xref ref-type="bibr" rid="B13">Horn 1972</xref>).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(10)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>Mary ate some of the cookies.</p></list-item>
<list-item><p>&#8594; SI: Mary ate some, but not all, of the cookies.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Similarly to &lt;<italic>some, all</italic>&gt;, e.g. &lt;<italic>intelligent, brilliant</italic>&gt; also forms a scale: an utterance containing <italic>intelligent</italic> can lead to the SI <italic>not brilliant</italic> (11). But variation exists across different scales: the SI in (10) arises much more robustly than the one in (11).</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(11)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The student is intelligent.</p></list-item>
<list-item><p>&#8594; SI: The student is intelligent, but not brilliant.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Scalar diversity persists even in the presence of exclusives. Ronai &amp; Xiang (<xref ref-type="bibr" rid="B21">2024</xref>) found that even when sentences such as (10)&#8211;(11) contain <italic>only</italic>, variation still remains in the likelihood of deriving a <italic>not all</italic> or <italic>not brilliant</italic> inference. This is puzzling, since while SI is a cancellable pragmatic inference, <italic>only</italic> encodes alternative exclusion in the semantics&#8212;which would predict uniformly ceiling-level inference rates. Ronai &amp; Xiang hypothesized that interpretations of <italic>only</italic> were split between rank-order and complement-exclusion readings, leading to variation in whether the stronger scalar term was included in the alternative set. Given <italic>The student is only intelligent</italic>, the <italic>not brilliant</italic> inference would arise with rank-order <italic>only</italic>, but not necessarily with complement-exclusion <italic>only</italic>, which could be understood as excluding other unrelated properties (<italic>not curious, not charming</italic>, etc). By experimentally comparing <italic>only</italic> to other exclusives, potentially less flexible in scale structure preference, we will also be able to test Ronai &amp; Xiang&#8217;s hypothesis.</p>
<p>As mentioned, although Horn scales like &lt;<italic>some, all</italic>&gt; and &lt;<italic>intelligent, brilliant</italic>&gt; stand in an entailment relation, we class these as rank-order scales because the structure of the alternative set does not bottom out in logically independent &#8220;atomic&#8221; propositions; that is, the scale structure is as in (6), not (5). Throughout this paper we use the terms &#8220;complement-exclusion&#8221; and &#8220;rank-order&#8221; to characterize scale structure rather than the precise relations used to order alternatives.</p>
</sec>
</sec>
<sec>
<title>2 Experiment 1: Inference task</title>
<p>To investigate the lexical semantics of exclusive modifiers, we compare how they affect scalar diversity. We replicate Ronai &amp; Xiang&#8217;s (<xref ref-type="bibr" rid="B21">2024</xref>) experiment testing <italic>only</italic>, and add as comparison manipulations with <italic>just</italic> and <italic>merely</italic>. This will allow us to test two predictions made by the theoretical literature on exclusives&#8212;reviewed above in Sections 1.1&#8211;1.2. First, since our experiment will specifically test rank-order alternatives, Coppock &amp; Beaver&#8217;s (<xref ref-type="bibr" rid="B6">2014</xref>) account predicts that alternative exclusion will be more robust with <italic>merely</italic> than with <italic>only</italic>, since these authors analyze the former as preferring rank-order readings. Second, the claims by Warstadt (<xref ref-type="bibr" rid="B26">2020</xref>) about strength of exclusion predict that participants will be less likely to exclude alternatives with <italic>just</italic> than with <italic>only</italic>. If <italic>just</italic> marks alternatives as unassertable rather than false, participants should be more reluctant to infer that alternatives are false. It is possible that Warstadt&#8217;s unified analysis is on the wrong track and <italic>just</italic> is instead lexically ambiguous between exclusive and nonexclusive readings. In that case, we still predict lower rates of exclusion, on the assumption that participants who interpret <italic>just</italic> non-exclusively on some trials will be less likely to commit to excluding the stronger alternative.</p>
<sec>
<title>2.1 Participants</title>
<p>120 monolingual native speakers of American English participated in the experiment (40 in each of the between-participants conditions), which was administered on the web-based PCIbex platform (<xref ref-type="bibr" rid="B31">Zehr &amp; Schwarz 2018</xref>). Participants were recruited on Prolific and compensated $2. Native speaker status was established via a language background questionnaire, where payment was not conditioned on the participant&#8217;s response. Participants were excluded for making more than 3 mistakes on the 7 attention check items. After exclusions, data from 111 participants was analyzed (37 for <italic>only</italic>, 39 for <italic>just</italic>, and 35 for <italic>merely</italic>.)</p>
</sec>
<sec>
<title>2.2 Materials and procedure</title>
<p>The experiment used a two-alternative forced choice inference task, identical to Ronai &amp; Xiang&#8217;s experiments (see also i.a. <xref ref-type="bibr" rid="B25">van Tiel et al. 2016</xref>). On each trial, participants saw a target sentence uttered by Mary, which contained an exclusive (e.g., <italic>The student is <roman>{</roman>just/only/merely<roman>}</roman> intelligent</italic>), and they were asked whether Mary thinks that the rank-order alternative is not true (e.g., the student is not brilliant). <xref ref-type="fig" rid="F1">Figure 1</xref> shows an example with <italic>just</italic>. Here, a &#8220;Yes&#8221; response indexes that the participant has calculated the exclusionary (<italic>not brilliant</italic>) inference, while selecting &#8220;No&#8221; suggests that the inference was not calculated. The experiment tested three different exclusives&#8212;<italic>only, just</italic>, and <italic>merely</italic>&#8212;as a between-participants manipulation.</p>
<fig id="F1">
<label>Figure 1</label>
<caption>
<p>Example experimental trial from Experiment 1 (<italic>just</italic> condition).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g1.png"/>
</fig>
<p>We tested 51 lexical scales as critical items, a subset of those used by Ronai &amp; Xiang (<xref ref-type="bibr" rid="B21">2024</xref>). Of their 60 items, we removed those incompatible with <italic>just</italic> or <italic>merely</italic>. This amounted to removing a subset of the adverbial scales, e.g., &lt;<italic>mostly, entirely</italic>&gt; (<italic>#Peter&#8217;s answers were merely [mostly]<sub>F</sub> wrong</italic>.); &lt;<italic>primarily, exclusively</italic>&gt; (<italic>#The residents are merely [primarily]<sub>F</sub> Greek</italic>.); or &lt;<italic>probably, necessarily</italic>&gt; (<italic>#A delay will merely [probably]<sub>F</sub> occur</italic>.). An additional modification was made to the verbal scales in the <italic>just</italic> condition: wherever a temporal interpretation of <italic>just</italic> would have been possible, we modified the tense of the carrier sentence to rule this out. For example, for &lt;<italic>survive, thrive</italic>&gt;, <italic>The plant only/merely survived</italic> was changed to <italic>The plant is just surviving</italic>. Similarly, for &lt;<italic>reduce, eliminate</italic>&gt;, <italic>The city only/merely reduced waste</italic> was modified to <italic>The city will just reduce waste</italic>.<xref ref-type="fn" rid="n2">2</xref></p>
<p>In addition to the critical items, the experiment tested 7 fillers that served as attention checks. The fillers, adapted from Ronai &amp; Xiang (<xref ref-type="bibr" rid="B21">2024</xref>) and van Tiel et al. (<xref ref-type="bibr" rid="B25">2016</xref>), contained two antonyms (<italic>wide</italic> &#8594; <italic>not narrow</italic>) and therefore had an unambiguous &#8220;Yes&#8221; answer. The experiment began with 2 practice items.</p>
</sec>
<sec>
<title>2.3 Results</title>
<p><xref ref-type="fig" rid="F2">Figure 2</xref> shows the results of Experiment 1. For the statistical analysis, we fit a logistic mixed effects regression model using the lme4 package in R (<xref ref-type="bibr" rid="B1">Bates et al. 2015</xref>). The model predicted Response (&#8220;Yes&#8221; vs. &#8220;No&#8221;) as a function of Exclusive (<italic>just</italic> vs. <italic>only</italic> vs. <italic>merely</italic>). Random intercepts for participants and random slopes and intercepts for items were included. Since predictions were made for <italic>just</italic> vs. <italic>only</italic> (strength of exclusion) and <italic>merely</italic> vs. <italic>only</italic> (scale structure bias), the predictor Exclusive was treatment coded, with <italic>only</italic> coded as the reference level. The model revealed significantly lower rates of inference calculation with <italic>just</italic> compared to <italic>only</italic> (Estimate = &#8211;0.63, SE = 0.27, <italic>z</italic> = &#8211;2.33, <italic>p</italic> &lt; 0.05), as well as significantly higher rates with <italic>merely</italic> than with <italic>only</italic> (Estimate = 1, SE = 0.3, <italic>z</italic> = 3.4, <italic>p</italic> &lt; 0.001). Averaged over the 51 different scales, the target exclusionary inference was calculated at the rate of 52.9% with <italic>just</italic>, 63.2% with <italic>only</italic>, and 80.2% with <italic>merely</italic>.</p>
<fig id="F2">
<label>Figure 2</label>
<caption>
<p>Results of Experiment 1: <italic>just, only, merely</italic>. The y axis shows the percent of calculating the target exclusionary inference, i.e., the mean percent of &#8220;Yes&#8221; responses in the inference task. Error bars represent standard error.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g2.png"/>
</fig>
<p>Since inference rates were lowest with <italic>just</italic>, the question may arise whether it can be maintained that <italic>just</italic> excludes alternatives semantically. To test this, we also conducted a replication of Ronai &amp; Xiang&#8217;s (<xref ref-type="bibr" rid="B21">2024</xref>) SI experiment.<xref ref-type="fn" rid="n3">3</xref> We found an average rate of 34.3% SI calculation; <xref ref-type="fig" rid="F3">Figure 3</xref> shows the SI results together with the three exclusives, also visualizing the by-item variation. An additional statistical model compared the findings of this experiment to <italic>just</italic>. The fixed effects predictor was sum-coded (SI: &#8211;0.5 and <italic>just</italic>: 0.5). Inference rates were found to be significantly higher with <italic>just</italic> than in the case of SI (Estimate = 1.29, SE = 0.27, <italic>z</italic> = 4.8, <italic>p</italic> &lt; 0.001). This confirms that alternative exclusion with all three exclusive modifiers is stronger than alternative exclusion via SI.</p>
<fig id="F3">
<label>Figure 3</label>
<caption>
<p>Results of Experiment 1: SI, <italic>just, only, merely</italic>. The percent of exclusionary inference calculation corresponds to the percent of &#8220;Yes&#8221; responses in the inference task.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g3.png"/>
</fig>
<p>One may wonder whether different lexical scales interact differently with the two tested parameters of variation, scale structure bias and strength of exclusion. In order to check whether the relative order of the 51 items remained consistent across manipulations, we calculated rank-order correlations using Kendall&#8217;s <italic>&#964;</italic><sub><italic>B</italic></sub>, and found significant by-item correlations between conditions. As SI rates increase, so do inference rates with <italic>just</italic> (<italic>&#964;</italic> <sub><italic>B</italic></sub> = 0.67, <italic>p</italic> &lt; 0.001); as rates with <italic>just</italic> increase, so do rates with <italic>only</italic> (<italic>&#964;</italic><sub><italic>B</italic></sub> = 0.64, <italic>p</italic> &lt; 0.001); and rates with <italic>merely</italic> are also correlated with <italic>only</italic> (<italic>&#964;</italic><sub><italic>B</italic></sub> = 0.39, <italic>p</italic> &lt; 0.001).<xref ref-type="fn" rid="n4">4</xref> In other words, we found that items that lead to low SI rates also lead to relatively low rates of inference calculation with exclusives, even as the overall rate of inference calculation goes up with an exclusive as compared to SI. To give an example, &lt;<italic>small, tiny</italic>&gt; led to SIs at a rate of 7.9%, and with exclusives, it also continued to trigger inference calculation at a rate lower than the average for that exclusive: 23.1% with <italic>just</italic>, 43.2% with <italic>only</italic> and 61.1% with <italic>merely</italic>.</p>
</sec>
<sec>
<title>2.4 Discussion</title>
<p>Both predictions made by the relevant theoretical accounts regarding strength of exclusion and scale structure bias were borne out by the results. First, Experiment 1 found lower rates of exclusionary inference calculation&#8212;that is, lower rates of &#8220;Yes&#8221; responses in the inference task&#8212;with <italic>just</italic> than with <italic>only</italic>. This is consistent with Warstadt&#8217;s (<xref ref-type="bibr" rid="B26">2020</xref>) proposal that <italic>just</italic> is a weak exclusive, while <italic>only</italic> is a strong exclusive. Or alternatively and more generally, this finding confirms that <italic>just</italic> is not always canonically exclusive; if (some) participants accessed a non-exclusive reading, that could have lowered the overall observed rate of exclusion. Second, Experiment 1 found higher inference rates with <italic>merely</italic> than with <italic>only</italic>. Since all our items tested rank-order alternatives, this strongly supports Coppock &amp; Beaver&#8217;s (<xref ref-type="bibr" rid="B6">2014</xref>) claim that while <italic>only</italic> allows both complement-exclusion and rank-order readings, <italic>merely</italic> prefers rank-order ones. These results also support Ronai &amp; Xiang&#8217;s (<xref ref-type="bibr" rid="B21">2024</xref>) hypothesis that when interpreting target sentences with <italic>only</italic>, participants are split between rank-order and complement-exclusion readings. When a rank-order bias is introduced by <italic>merely</italic>, the stronger scalar term in the &#8220;Would you conclude from this&#8230;?&#8221; task question is more unambiguously understood as one of the salient alternatives, which leads to an increase in calculating the target inference.</p>
<p>While we have found evidence that <italic>only</italic> and <italic>merely</italic> differ in scale structure, with the latter more strongly requiring rank-order alternatives, Experiment 1 does not allow us to straightforwardly infer anything about the (potential) scale structure bias of <italic>just</italic>. Coppock &amp; Beaver (<xref ref-type="bibr" rid="B6">2014</xref>) argue that, like <italic>merely, just</italic> also has a(n albeit weaker) preference for rank-order readings (p. 425, see also <xref ref-type="bibr" rid="B14">Horn 2000</xref>). This would predict a higher rate of inference calculation with <italic>just</italic>, as compared to <italic>only</italic>. However, <italic>just</italic>&#8217;s comparatively weak strength of exclusion pushes in the other direction, which as we have seen has resulted in overall lower inference rates. Therefore, the current results leave open two possibilities: either <italic>just</italic> does not favor rank-order scales, contra the theoretical literature, or it does, but that preference in the Experiment 1 design was counteracted by its weak exclusive status. Experiment 2 was designed to adjudicate between these two possibilities.</p>
</sec>
</sec>
<sec>
<title>3 Experiment 2: Scale choice</title>
<p>In Experiment 1, we were not able to draw conclusions about <italic>just</italic>&#8217;s scale structure bias, since, as our findings confirmed, it excludes relatively less robustly than the other two modifiers. To address this, in Experiment 2, we employ a novel implementation of the two-alternative forced choice task, which allows us to probe what kind of alternative hearers prefer to exclude in a setting where one must be excluded. We test this preference for all three exclusives. As before, Coppock &amp; Beaver&#8217;s account predicts that the choice of rank-order alternatives will be more robust with <italic>merely</italic> than with <italic>only</italic>. As for <italic>just</italic>, both Horn (<xref ref-type="bibr" rid="B14">2000</xref>) and Coppock &amp; Beaver (<xref ref-type="bibr" rid="B6">2014</xref>) predict that <italic>just</italic> will prefer rank-order scales, albeit to different degrees: for Horn, <italic>just</italic> is completely restricted to rank-order scales, whereas for Coppock &amp; Beaver this preference is a weaker pragmatic effect.</p>
<sec>
<title>3.1 Participants</title>
<p>120 monolingual native speakers of American English participated in the experiment (40 in each of the between-participants conditions), administered on PCIbex. Participant recruitment and screening was identical to Experiment 1; compensation was $2 or $2.40 depending on time. 5 participants were removed from the <italic>merely</italic> condition and 1 participant from the <italic>just</italic> condition for making 3 or more mistakes on the 5 attention check items. Data from the remaining 114 participants is reported below.</p>
</sec>
<sec>
<title>3.2 Materials and procedure</title>
<p>Experiment 2 ruled out a non-exclusive interpretation by employing a two-alternative forced choice task where participants had to choose between two alternatives as the target for exclusion. Specifically, they were presented with a target utterance by Mary, e.g., <italic>That student was just intelligent</italic>, which was followed by &#8220;Mary meant that the student was not BLANK&#8221;. Participants had to choose between a rank-order and a complement-exclusion alternative to fill in the blank.<xref ref-type="fn" rid="n5">5</xref>&#160;<xref ref-type="fig" rid="F4">Figure 4</xref> shows an example trial with <italic>just</italic>.</p>
<fig id="F4">
<label>Figure 4</label>
<caption>
<p>Example experimental trial from Experiment 2 (<italic>just</italic> condition).</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g4.png"/>
</fig>
<p>Target inference-triggering sentences occurred in the context of Sue&#8217;s preceding utterance. The context provided by Sue always explicitly introduced alternatives, including the focus associate (here, <italic>intelligent</italic>), the rank-order alternative (<italic>brilliant)</italic>, the complement-exclusion alternative (<italic>ambitious</italic>), and a fourth alternative (<italic>hardworking</italic>) that was not probed in the forced choice task. Our task was designed to make both the rank-order and complement-exclusion alternatives salient and relevant, with the expectation that with salience and relevance controlled for, participants&#8217; responses would directly reflect the scale structure bias contributed by different exclusives.</p>
<p>This design choice was inspired by previous experimental studies on the processing of focus with <italic>only</italic>, which have successfully used discourse contexts to introduce relevant alternatives. (12) illustrates this with an example from Hoeks (<xref ref-type="bibr" rid="B11">2023</xref>)&#8212;see also Washburn et al. (<xref ref-type="bibr" rid="B27">2011</xref>); Kim et al. (<xref ref-type="bibr" rid="B16">2015</xref>); Fraundorf et al. (<xref ref-type="bibr" rid="B8">2010</xref>) for similar designs.</p>
<list list-type="gloss">
<list-item>
<list list-type="wordfirst">
<list-item><p>(12)</p></list-item>
</list>
</list-item>
<list-item>
<list list-type="sentence-gloss">
<list-item>
<list list-type="final-sentence">
<list-item><p>The tourist asked for a variety of items, like some cheese and milk. There was already an ashtray on the table. When the waiter returned, he remembered to bring only milk but no cheese to the table where the tourist was seated.</p></list-item>
</list>
</list-item>
</list>
</list-item>
</list>
<p>Inherent to the goals of our experiment is that the context sentence uttered by Sue would have to introduce two alternatives that are ranked relative to one another (<italic>intelligent</italic> and <italic>brilliant</italic> in <xref ref-type="fig" rid="F4">Figure 4</xref>), in addition to an alternative that is not inherently related to either (i.e., the complement-exclusion one, <italic>ambitious</italic> in <xref ref-type="fig" rid="F4">Figure 4</xref>). In order to mitigate against potential unnaturalness arising from this design, we took the following steps. First, as mentioned, a fourth alternative which can be classified as complement-exclusion (<italic>hardworking</italic> in <xref ref-type="fig" rid="F4">Figure 4</xref>) was also included, in order to make sure that the target complement-exclusion alternative (<italic>ambitious</italic>) was not overly salient as being the exception. Second, to ensure a natural reading experience throughout the experiment and to prevent participants from noticing ordering patterns, the order of the four alternatives was randomized on every trial. Analysis of the Experiment 2 data suggests that the design did not introduce a bias towards one kind of alternative as compared to Experiment 1 (see footnote 6).</p>
<p>The basic frame of the inference-triggering sentences was kept similar to Experiment 1 (and <xref ref-type="bibr" rid="B21">Ronai &amp; Xiang 2024</xref>). The focus associates (i.e., weaker scalar terms embedded under an exclusive) and rank-order alternatives were identical to Experiment 1. Complement-exclusion alternatives were generated for each item, using corpus searching (for strings such as <italic>only X but not Y</italic>, with <italic>X</italic> as the focus associate) and the intuitions of the authors. The fourth alternative was similarly generated by author intuition.</p>
<p>Experiment 2 tested 48 items. 3 items were removed from the item set of Experiment 1 because we were unable to generate a suitable complement-exclusion alternative (&lt;<italic>some, all</italic>&gt;, &lt;<italic>or, and</italic>&gt;, &lt;<italic>once, twice</italic>&gt;). We also included 3 practice items and 5 fillers to serve as catch trials. In filler items, one of the two forced-choice alternatives was the focus associate itself (e.g., <italic>The street is just wide</italic>&#8212;Mary meant that the street is not: wide vs. tree-lined) and hence unambiguously the incorrect choice.</p>
</sec>
<sec>
<title>3.3 Results</title>
<p><xref ref-type="fig" rid="F5">Figure 5</xref> shows the results of Experiment 2. Averaged over the 48 different scales, the rank-order alternative (as opposed to the complement-exclusion alternative) was chosen 56.8% of the time with <italic>only</italic>,<xref ref-type="fn" rid="n6">6</xref> 65.6% with <italic>just</italic>, and 80.8% with <italic>merely</italic>. For the statistical analysis, a logistic mixed effects regression model was fit (lme4). We compared Response (rank-order vs. complement-exclusion choice) for each exclusive to chance, i.e., to 50% rank-order/complement-exclusion choice. Chance level reflects the hypothetical whereby an exclusive has no constraints on its scale structure, and freely allows either type of alternative to be excluded; in this case, we assume participants would choose one over the other with 50% likelihood. The model included random intercepts for participants and random slopes and intercepts for items. We found that <italic>just</italic> (Estimate = 0.98, SE = 0.26, <italic>z</italic> = 3.78, <italic>p</italic> &lt; 0.001) and <italic>merely</italic> (Estimate = 2.01, SE = 0.26, <italic>z</italic> = 7.88, <italic>p</italic> &lt; 0.001) differed significantly from chance level by producing a higher rate of rank-order choice. On the other hand, results with <italic>only</italic> did not significantly differ from chance (Estimate = 0.44, SE = 0.27, <italic>z</italic> = 1.65, <italic>p</italic> = 0.1).</p>
<fig id="F5">
<label>Figure 5</label>
<caption>
<p>Results of Experiment 2: Mean percent of choosing rank-order (as opposed to complement-exclusion) alternative with <italic>only</italic> vs. <italic>just</italic> vs. <italic>merely</italic>. Error bars represent standard error.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g5.png"/>
</fig>
<p>In order to directly compare the behavior of the three different exclusives to one another, we fit an additional model that predicted Response (rank-order vs. complement-exclusion choice) as a function of Exclusive (<italic>just</italic> vs. <italic>only</italic> vs. <italic>merely</italic>). Here, the Exclusive predictor was treatment-coded, with <italic>just</italic> serving as the reference level. We found that <italic>merely</italic> produced significantly higher rates of rank-order choice than <italic>just</italic> (Estimate = 1.03, SE = 0.32, <italic>z</italic> = 3.21, <italic>p</italic> &lt; 0.01). The <italic>only</italic> vs. <italic>just</italic> difference (i.e., <italic>only</italic> producing fewer rank-order choices) was smaller and failed to reach significance (Estimate = &#8211;0.54, SE = 0.3, <italic>z</italic> = &#8211;1.81, <italic>p</italic> = 0.07). As visual inspection of <xref ref-type="fig" rid="F6">Figure 6</xref> shows, this was likely due to by-item variation.</p>
<fig id="F6">
<label>Figure 6</label>
<caption>
<p>Results of Experiment 2: By-scale variation in choosing rank-order alternative.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="glossa-10-16797-g6.png"/>
</fig>
</sec>
<sec>
<title>3.4 Discussion</title>
<p>By probing whether participants took the target sentence to exclude a rank-order or a complement-exclusion alternative, instead of probing whether they calculated the target exclusionary inference, we were able to more clearly isolate scale structure bias in Experiment 2. We found that <italic>merely</italic> and <italic>just</italic> are clearly different from chance level in showing a preference for the exclusion of rank-order alternatives. <italic>Merely</italic> strongly preferring rank-order exclusion successfully replicates Experiment 1, while the <italic>just</italic> finding goes beyond the previous experiment with the confound of a non-exclusive reading now ruled out. Unlike the other two exclusives, <italic>only</italic> was found not to be different from chance level, suggesting that it allows both types of alternatives (more) freely.</p>
<p>Experiment 2&#8217;s findings are in line with Coppock &amp; Beaver&#8217;s (<xref ref-type="bibr" rid="B6">2014</xref>) observations about the scale structure bias of <italic>merely</italic> and <italic>just</italic>. These authors argue that <italic>merely</italic> requires rank-order alternatives, while <italic>just</italic>&#8217;s preference for them is only slight. Our finding that <italic>merely</italic> and <italic>just</italic> patterned significantly differently from each other constitutes strong support for this claim. The results of Experiment 2, on the other hand, are less compatible with Horn (<xref ref-type="bibr" rid="B14">2000</xref>), whose proposal leads to the expectation that <italic>just</italic> selects for rank-order scales&#8212;our data suggests that <italic>just</italic>&#8217;s preference is more subtle than that.</p>
</sec>
</sec>
<sec>
<title>4 General discussion</title>
<p>This squib presented novel experimental evidence testing variation across exclusive modifiers in English. In this domain, previous claims about available readings and speaker preferences between readings were based on particular, often idiosyncratic examples (e.g., (9)). We sought evidence for graded pragmatic preferences between readings in a controlled experimental setting. Our results provided support for two claims from the theoretical literature: that exclusives vary in scale structure bias (with <italic>merely</italic> and <italic>just</italic> both preferring rank-order readings, the former more so than the latter) and in strength of exclusion (with <italic>just</italic> excluding less robustly).</p>
<p>Our <italic>merely</italic> results straightforwardly support Coppock &amp; Beaver&#8217;s (<xref ref-type="bibr" rid="B6">2014</xref>) claim that <italic>merely</italic> is restricted to (&#8220;evaluative&#8221;) rank-order scales. Since all our Experiment 1 items tested rank-order alternatives, the stronger scalar alternative was more frequently included in the alternative set given an exclusive restricted to rank-order readings, leading to higher rates of target inference calculation compared to <italic>only</italic>. Experiment 1&#8217;s results also support Ronai &amp; Xiang&#8217;s (<xref ref-type="bibr" rid="B21">2024</xref>) hypothesis that participants in their <italic>only</italic> experiment&#8212;which we successfully replicated&#8212;were split between rank-order and complement-exclusion readings. This additionally confirms Coppock &amp; Beaver&#8217;s claim that both readings are indeed available for <italic>only</italic>, a claim at odds with Horn (<xref ref-type="bibr" rid="B14">2000</xref>), who instead analyzes <italic>only</italic> as selecting complement-exclusion scales. The <italic>merely</italic> findings were replicated in Experiment 2. The design of Experiment 2, which ruled out the possibility of non-exclusion, additionally allowed us to draw conclusions about the scale structure bias of <italic>just</italic>. We found that it indeed favors rank-order scales, although not to the extent that <italic>merely</italic> does&#8212;a pattern more in line with Coppock &amp; Beaver (<xref ref-type="bibr" rid="B6">2014</xref>) than Horn (<xref ref-type="bibr" rid="B14">2000</xref>). Our overall results support Coppock &amp; Beaver&#8217;s view of scale structure bias as a matter of pragmatic preference, as well as their specific claims about which exclusives favor which scales.</p>
<p>Our <italic>just</italic> results in Experiment 1 verify the recurrent intuition in the literature that <italic>just</italic> excludes less robustly than <italic>only</italic>. Different analyses of the apparent strength difference have been proposed, several of which are compatible with our results. If <italic>just</italic> excludes alternatives on the basis of pragmatic assertability rather than truth (i.e., &#8220;weak&#8221; exclusion), as argued by Warstadt (<xref ref-type="bibr" rid="B26">2020</xref>) and Beltrama (<xref ref-type="bibr" rid="B3">2021</xref>), participants could have been more reluctant to conclusively reject alternatives with <italic>just</italic> compared to <italic>only</italic>. An equally plausible, if less interesting interpretation would attribute the lower inference rates to lexical ambiguity: participants excluded the stronger scalar alternative when <italic>just</italic> was interpreted exclusively and not otherwise.</p>
<p>To our knowledge, this squib reports on the first experimental investigation of variation across exclusive modifiers. We hope that the success of the experimental methods employed here, including the novel design of Experiment 2, opens up possibilities for future research which can address this variation. We found strong evidence that different exclusives favor different sorts of scales, and we succeeded in precisely quantifying the strength of these preferences by exclusive. Prior theoretical literature (e.g., <xref ref-type="bibr" rid="B14">Horn 2000</xref>; <xref ref-type="bibr" rid="B6">Coppock &amp; Beaver 2014</xref>) has observed these preferences on a descriptive level, but to our knowledge there has been little work aimed at uncovering their source. If scale structure bias is a pragmatic phenomenon that is not fully determined by lexical semantics, it is possible that exclusives compete with each other: the use of one exclusive over another could trigger defeasible pragmatic inferences about what sorts of alternatives the speaker intended to exclude, and how these alternatives were ordered. For example, the use of <italic>only</italic>, which is lexically unrestricted with respect to scale structure, over an exclusive like <italic>just</italic> or <italic>merely</italic>, which prefer rank-order scales, could lead to a preference for the complement-exclusion reading with <italic>only</italic> in some contexts. Since the speaker did not use an exclusive that would have explicitly enforced the rank-order reading, a pragmatic listener might reason that a complement-exclusion reading must have been intended. Independent lexical semantic differences between exclusives could also make one or another exclusive a better &#8220;fit&#8221; for certain scales over others; this is arguably the case with <italic>merely</italic>, whose evaluative nature&#8212;which is in principle independent of scale structure&#8212;could underlie its preference for rank-order scales. These questions are difficult to pursue on an intuitionistic level. We are optimistic that the experimental methods employed in this paper&#8212;which make it possible to systematically manipulate relevance, salience, and other pragmatic factors&#8212;can therefore provide a basis for such investigations.</p>
<p>Lastly, though in our experiments scalar diversity was a testing ground, not the main object of study, the results are nonetheless informative with respect to this phenomenon. In particular, we found significant by-item rank-order correlations across all conditions in Experiment 1. That is, if we order different lexical scales (e.g., &lt;<italic>intelligent, brilliant</italic>&gt; vs. &lt;<italic>allowed, obligatory</italic>&gt;) based on their likelihood of leading to an exclusionary inference, this order remains consistent across experiments: from SI to different exclusive modifiers. This is despite the fact that as inference rates increase across the board (reaching on average 80.2% with <italic>merely</italic>), there is necessarily less variation across lexical scales, and their relative ranking becomes less meaningful. This finding points to the importance of lexico-semantic factors in the scalar diversity phenomenon. Specifically, prior work has shown that properties of lexical scales such as the distinctness of the weaker and stronger scale-mates (<xref ref-type="bibr" rid="B25">van Tiel et al. 2016</xref>), their semantic relatedness (<xref ref-type="bibr" rid="B28">Westera &amp; Boleda 2020</xref>), the expectedness of the stronger alternative (<xref ref-type="bibr" rid="B15">Hu et al. 2023</xref>), or adjectival polarity and extremeness (<xref ref-type="bibr" rid="B9">Gotzner et al. 2018</xref>; <xref ref-type="bibr" rid="B4">Beltrama &amp; Xiang 2013</xref>) can predict a scale&#8217;s likelihood of leading to SIs. In this paper, we did not directly test whether these properties are also significant predictors of the variation found in our <italic>just, only</italic>, and <italic>merely</italic> results. Nonetheless, it seems likely that since such properties reliably predict variation in SI rates, and&#8212;as evidenced by the significant by-item correlations&#8212;overall the same variation arises with exclusives, factors like distinctness, etc. could underlie the variation in the exclusives data as well. Additionally, exclusives can be thought of as a probe for the alternative expectedness predictor in particular. As Ronai &amp; Xiang (<xref ref-type="bibr" rid="B21">2024</xref>) have argued based on their <italic>only</italic> findings, scales that do not robustly trigger the calculation of the target exclusionary inference with an exclusive are likely those where the stronger alternative is not expected (<xref ref-type="bibr" rid="B15">Hu et al. 2023</xref>) or available (<xref ref-type="bibr" rid="B25">van Tiel et al. 2016</xref>). This is because exclusives like <italic>only</italic> semantically encode the exclusion of an alternative; therefore, if a participant nonetheless responds &#8220;No&#8221; in the inference task, that can be attributed to their inability to take the provided stronger alternative (e.g., <italic>brilliant</italic> given <italic>only intelligent</italic>) to be a relevant alternative to the weaker term. The argument also applies to our Experiment 1, and our rank-order correlation findings suggest that it can be extended to the exclusives <italic>just</italic> and <italic>merely</italic> as well.</p>
<p>As such, the findings reported in this paper shed light not only on parameters of variation across exclusives, but also on the variation across lexical scales in their propensity to lead to exclusionary inferences.</p>
</sec>
</body>
<back>
<sec>
<title>Data availability</title>
<p>Stimuli, data, and the scripts used for data visualization and analysis can be found in the following OSF repository: <ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/ktrzm/?view_only=36c09d6b218d4f73b00cc01c860184e4">https://osf.io/ktrzm/?view_only=36c09d6b218d4f73b00cc01c860184e4</ext-link>.</p>
</sec>
<sec>
<title>Ethics and consent</title>
<p>The study reported as Experiment 1 in this paper was approved by the University of Chicago Institutional Review Board (#IRB20-2038-CR003). The study reported as Experiment 2 was granted exemption by the Northwestern University Institutional Review Board (#STU00220908) due to being low risk and involving only tests, surveys, interviews, observation, or benign behavioral interventions.</p>
</sec>
<sec>
<title>Funding information</title>
<p>This material is partially based upon work supported by the National Science Foundation under Grant No. #BCS-2041312.</p>
</sec>
<sec>
<title>Acknowledgements</title>
<p>We thank editor Lyn Tieu, three anonymous reviewers, Itamar Francez, Chris Kennedy, and Michael Tabatowski as well as audiences at the 23rd Amsterdam Colloquium, the 97th Annual Meeting of the LSA, the UChicago LEAP workshop and the Northwestern Experimental Meaning Group for helpful feedback and discussion. All remaining errors are our own.</p>
</sec>
<sec>
<title>Competing interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<sec>
<title>Authors&#8217; contributions</title>
<p>The authors contributed equally to this work and are listed in reverse alphabetical order.</p>
</sec>
<fn-group>
<fn id="n1"><p>See also Beltrama (<xref ref-type="bibr" rid="B3">2021</xref>), who proposes that emphatic <italic>just</italic> as in (9d) declares alternatives not &#8220;worthy of assertion&#8221; (p. 347).</p></fn>
<fn id="n2"><p>An additional concern might be whether <italic>just</italic> can be interpreted emphatically. We think this is unlikely, since in our experiment <italic>just</italic> always modified a weaker scale-mate (<italic>just intelligent</italic>), and for an emphatic use it would typically need to modify an extreme adjective (<xref ref-type="bibr" rid="B18">Morzycki 2012</xref>; <xref ref-type="bibr" rid="B3">Beltrama 2021</xref>). To the extent that extreme adjectives occurred in our experiment, they would have been in the task question (&#8220;Would you conclude from&#8230; not brilliant?&#8221;), not modified by <italic>just</italic>.</p></fn>
<fn id="n3"><p>The SI replication was identical to the main Experiment 1, with the exception that target sentences (i.e., Mary&#8217;s utterances) did not contain an exclusive. 40 participants were recruited; 1 was excluded for failing attention checks and 1 for being bilingual.</p></fn>
<fn id="n4"><p>Though still highly significant, the <italic>only-merely</italic> correlation is less strong than the other two pairs, i.e., the coefficient is lower. This is expected, since with <italic>merely</italic>, inference rates are more uniform, closer to ceiling, and therefore the relative difference between any two lexical scales is much smaller.</p></fn>
<fn id="n5"><p>Given the assumptions about scale structure we make in (5), one might object that alternatives based on single properties like <italic>ambitious</italic> are not really complement-exclusion alternatives, and that the actual complement-exclusion alternative in a trial like that shown in <xref ref-type="fig" rid="F4">Figure 4</xref> would be the conjunction of an independent property with the focus associate, in this case <italic>intelligent &amp; ambitious</italic>. Since the exclusion of the stronger conjunction entails the negation of the atomic proposition in context, we take it that participants who select <italic>ambitious</italic> in this task have still accessed the complement-exclusion reading.</p></fn>
<fn id="n6"><p>A reviewer notes that the likelihood of rank-order choices with <italic>only</italic> was lower in Experiment 2 (56.8%) than the corresponding likelihood of &#8220;Yes&#8221; responses in Experiment 1 (63.2%) and wonders whether the contexts provided in Experiment 2 made participants less able to access rank-order interpretations. To explore this possibility, we fit a logistic mixed effects regression model to the combined <italic>only</italic> data from Experiments 1 and 2 (restricted to the 48 scales tested in both). The model included random intercepts for participants and random slopes and intercepts for items and predicted Response by Experiment (sum-coded, Experiment 1: &#8211;0.5 and Experiment 2: 0.5). The model revealed that difference between the likelihood of &#8220;Yes&#8221; responses in Experiment 1 and of rank-order choices in Experiment 2 was not significant (Estimate = &#8211;0.33, SE = 0.31, <italic>z</italic> = &#8211;1.08, <italic>p</italic> = 0.28). This suggests that the two different experimental paradigms did not affect participants&#8217; ability to assign rank-order interpretations.</p></fn>
</fn-group>
<ref-list>
<ref id="B1"><mixed-citation publication-type="journal"><string-name><surname>Bates</surname>, <given-names>Douglas</given-names></string-name> &amp; <string-name><surname>M&#228;chler</surname>, <given-names>Martin</given-names></string-name> &amp; <string-name><surname>Bolker</surname>, <given-names>Ben</given-names></string-name> &amp; <string-name><surname>Walker</surname>, <given-names>Steve</given-names></string-name>. <year>2015</year>. <article-title>Fitting linear mixed-effects models using lme4</article-title>. <source>Journal of Statistical Software</source> <volume>67</volume>(<issue>1</issue>). <fpage>1</fpage>&#8211;<lpage>48</lpage>. DOI: <pub-id pub-id-type="doi">10.18637/jss.v067.i01</pub-id></mixed-citation></ref>
<ref id="B2"><mixed-citation publication-type="book"><string-name><surname>Beaver</surname>, <given-names>David</given-names></string-name> &amp; <string-name><surname>Clark</surname>, <given-names>Brady</given-names></string-name>. <year>2008</year>. <source>Sense and sensitivity: how focus determines meaning</source>. <publisher-loc>Chichester</publisher-loc>: <publisher-name>Wiley-Blackwell</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1002/9781444304176</pub-id></mixed-citation></ref>
<ref id="B3"><mixed-citation publication-type="journal"><string-name><surname>Beltrama</surname>, <given-names>Andrea</given-names></string-name>. <year>2021</year>. <article-title>Just perfect, simply the best: an analysis of emphatic exclusion</article-title>. <source>Linguistics and Philosophy</source> <volume>45</volume>. <fpage>321</fpage>&#8211;<lpage>364</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s10988-021-09326-x</pub-id></mixed-citation></ref>
<ref id="B4"><mixed-citation publication-type="book"><string-name><surname>Beltrama</surname>, <given-names>Andrea</given-names></string-name> &amp; <string-name><surname>Xiang</surname>, <given-names>Ming</given-names></string-name>. <year>2013</year>. <chapter-title>Is &#8216;good&#8217; better than &#8216;excellent&#8217;? An experimental investigation on scalar implicatures and gradable adjectives</chapter-title>. In <string-name><surname>Chemla</surname>, <given-names>Emmanuel</given-names></string-name> &amp; <string-name><surname>Homer</surname>, <given-names>Vincent</given-names></string-name> &amp; <string-name><surname>Winterstein</surname>, <given-names>Gr&#233;goire</given-names></string-name> (eds.), <source>Proceedings of Sinn und Bedeutung 17</source>, <fpage>81</fpage>&#8211;<lpage>98</lpage>.</mixed-citation></ref>
<ref id="B5"><mixed-citation publication-type="journal"><string-name><surname>Bonomi</surname>, <given-names>Andrea</given-names></string-name> &amp; <string-name><surname>Casalegno</surname>, <given-names>Paolo</given-names></string-name>. <year>1993</year>. <article-title><italic>Only</italic>: association with focus in event semantics</article-title>. <source>Natural Language Semantics</source> <volume>2</volume>. <fpage>1</fpage>&#8211;<lpage>45</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/BF01255430</pub-id></mixed-citation></ref>
<ref id="B6"><mixed-citation publication-type="journal"><string-name><surname>Coppock</surname>, <given-names>Elizabeth</given-names></string-name> &amp; <string-name><surname>Beaver</surname>, <given-names>David</given-names></string-name>. <year>2014</year>. <article-title>Principles of the exclusive muddle</article-title>. <source>Journal of Semantics</source> <volume>31</volume>(<issue>3</issue>). <fpage>371</fpage>&#8211;<lpage>432</lpage>. DOI: <pub-id pub-id-type="doi">10.1093/jos/fft007</pub-id></mixed-citation></ref>
<ref id="B7"><mixed-citation publication-type="book"><string-name><surname>Fagen</surname>, <given-names>Lucas</given-names></string-name>. <year>2025</year>. <chapter-title>Exclusives and scale structure</chapter-title>. Ms., <publisher-name>The University of Chicago</publisher-name>.</mixed-citation></ref>
<ref id="B8"><mixed-citation publication-type="journal"><string-name><surname>Fraundorf</surname>, <given-names>Scott H.</given-names></string-name> &amp; <string-name><surname>Watson</surname>, <given-names>Duane G.</given-names></string-name> &amp; <string-name><surname>Benjamin</surname>, <given-names>Aaron S.</given-names></string-name> <year>2010</year>. <article-title>Recognition memory reveals just how contrastive contrastive accenting really is</article-title>. <source>Journal of Memory and Language</source> <volume>63</volume>(<issue>3</issue>). <fpage>367</fpage>&#8211;<lpage>386</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2010.06.004</pub-id></mixed-citation></ref>
<ref id="B9"><mixed-citation publication-type="journal"><string-name><surname>Gotzner</surname>, <given-names>Nicole</given-names></string-name> &amp; <string-name><surname>Solt</surname>, <given-names>Stephanie</given-names></string-name> &amp; <string-name><surname>Benz</surname>, <given-names>Anton</given-names></string-name>. <year>2018</year>. <article-title>Scalar diversity, negative strengthening, and adjectival semantics</article-title>. <source>Frontiers in Psychology</source> <volume>9</volume>. <elocation-id>1659</elocation-id>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2018.01659</pub-id></mixed-citation></ref>
<ref id="B10"><mixed-citation publication-type="book"><string-name><surname>Grice</surname>, <given-names>Herbert Paul</given-names></string-name>. <year>1967</year>. <chapter-title>Logic and Conversation</chapter-title>. In <string-name><surname>Grice</surname>, <given-names>Paul</given-names></string-name> (ed.), <source>Studies in the Way of Words</source>, <fpage>41</fpage>&#8211;<lpage>58</lpage>. <publisher-name>Harvard University Press</publisher-name>.</mixed-citation></ref>
<ref id="B11"><mixed-citation publication-type="thesis"><string-name><surname>Hoeks</surname>, <given-names>Morwenna</given-names></string-name>. <year>2023</year>. <source>Comprehending focus/representing contrast</source>: <publisher-name>University of California, Santa Cruz</publisher-name> dissertation.</mixed-citation></ref>
<ref id="B12"><mixed-citation publication-type="journal"><string-name><surname>Horn</surname>, <given-names>Laurence</given-names></string-name>. <year>1969</year>. <article-title>A presuppositional analysis of only and even</article-title>. In <source>Proceedings of CLS 5</source>.</mixed-citation></ref>
<ref id="B13"><mixed-citation publication-type="thesis"><string-name><surname>Horn</surname>, <given-names>Laurence</given-names></string-name>. <year>1972</year>. <source>On the semantic properties of logical operators in English</source>: <publisher-name>UCLA</publisher-name> dissertation.</mixed-citation></ref>
<ref id="B14"><mixed-citation publication-type="book"><string-name><surname>Horn</surname>, <given-names>Laurence</given-names></string-name>. <year>2000</year>. <chapter-title>Pick a theory (not just any theory)</chapter-title>. In <string-name><surname>Horn</surname>, <given-names>Laurence</given-names></string-name> &amp; <string-name><surname>Kato</surname>, <given-names>Yasuhiko</given-names></string-name> (eds.), <source>Negation and polarity: syntactic and semantic perspectives</source>, <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1093/oso/9780198238744.001.0001</pub-id></mixed-citation></ref>
<ref id="B15"><mixed-citation publication-type="journal"><string-name><surname>Hu</surname>, <given-names>Jennifer</given-names></string-name> &amp; <string-name><surname>Levy</surname>, <given-names>Roger</given-names></string-name> &amp; <string-name><surname>Degen</surname>, <given-names>Judith</given-names></string-name> &amp; <string-name><surname>Schuster</surname>, <given-names>Sebastian</given-names></string-name>. <year>2023</year>. <article-title>Expectations over unspoken alternatives predict pragmatic inferences</article-title>. <source>Transactions of the Association for Computational Linguistics</source>. To appear. DOI: <pub-id pub-id-type="doi">10.1162/tacl_a_00579</pub-id></mixed-citation></ref>
<ref id="B16"><mixed-citation publication-type="journal"><string-name><surname>Kim</surname>, <given-names>Christina S.</given-names></string-name> &amp; <string-name><surname>Gunlogson</surname>, <given-names>Christine</given-names></string-name> &amp; <string-name><surname>Tanenhaus</surname>, <given-names>Michael K.</given-names></string-name> &amp; <string-name><surname>Runner</surname>, <given-names>Jeffrey T.</given-names></string-name> <year>2015</year>. <article-title>Context-driven expectations about focus alternatives</article-title>. <source>Cognition</source> <volume>139</volume>. <fpage>28</fpage>&#8211;<lpage>49</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2015.02.009</pub-id></mixed-citation></ref>
<ref id="B17"><mixed-citation publication-type="thesis"><string-name><surname>Klinedinst</surname>, <given-names>Nathan</given-names></string-name>. <year>2005</year>. <source>Scales and <italic>only</italic></source>: <publisher-name>UCLA</publisher-name> MA thesis.</mixed-citation></ref>
<ref id="B18"><mixed-citation publication-type="journal"><string-name><surname>Morzycki</surname>, <given-names>Marcin</given-names></string-name>. <year>2012</year>. <article-title>Adjectival extremeness: Degree modification and contextually restricted scales</article-title>. <source>Natural Language &amp; Linguistic Theory</source> <volume>30</volume>(<issue>2</issue>). <fpage>567</fpage>&#8211;<lpage>609</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s11049-011-9162-0</pub-id></mixed-citation></ref>
<ref id="B19"><mixed-citation publication-type="book"><string-name><surname>Onea</surname>, <given-names>Edgar</given-names></string-name>. <year>2016</year>. <source>Potential questions at the semantics-pragmatics interface</source>. <publisher-name>Brill</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1163/9789004217935</pub-id></mixed-citation></ref>
<ref id="B20"><mixed-citation publication-type="journal"><string-name><surname>Orenstein</surname>, <given-names>Dina</given-names></string-name> &amp; <string-name><surname>Greenberg</surname>, <given-names>Yael</given-names></string-name>. <year>2010</year>. <article-title>The semantics and focus sensitivity of the Hebrew (unstressed) <italic>stam</italic></article-title>. In <source>Proceedings of IATL 26</source>.</mixed-citation></ref>
<ref id="B21"><mixed-citation publication-type="journal"><string-name><surname>Ronai</surname>, <given-names>Eszter</given-names></string-name> &amp; <string-name><surname>Xiang</surname>, <given-names>Ming</given-names></string-name>. <year>2024</year>. <article-title>What could have been said? Alternatives and variability in pragmatic inferences</article-title>. <source>Journal of Memory and Language</source> <volume>136</volume>. <elocation-id>104507</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2024.104507</pub-id></mixed-citation></ref>
<ref id="B22"><mixed-citation publication-type="journal"><string-name><surname>Sun</surname>, <given-names>Chao</given-names></string-name> &amp; <string-name><surname>Tian</surname>, <given-names>Ye</given-names></string-name> &amp; <string-name><surname>Breheny</surname>, <given-names>Richard</given-names></string-name>. <year>2018</year>. <article-title>A link between local enrichment and scalar diversity</article-title>. <source>Frontiers in Psychology</source> <volume>9</volume>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2018.02092</pub-id></mixed-citation></ref>
<ref id="B23"><mixed-citation publication-type="journal"><string-name><surname>Thomas</surname>, <given-names>William</given-names></string-name> &amp; <string-name><surname>Deo</surname>, <given-names>Ashwini</given-names></string-name>. <year>2020</year>. <article-title>The interaction of <italic>just</italic> with modified scalar predicates</article-title>. In <source>Proceedings of Sinn und Bedeutung 24</source>.</mixed-citation></ref>
<ref id="B24"><mixed-citation publication-type="journal"><string-name><surname>van Rooij</surname>, <given-names>Robert</given-names></string-name>. <year>2002</year>. <article-title>Relevance only</article-title>. In <source>Proceedings of the Sixth Workshop on the Semantics and Pragmatics of Dialogue</source>.</mixed-citation></ref>
<ref id="B25"><mixed-citation publication-type="journal"><string-name><surname>van Tiel</surname>, <given-names>Bob</given-names></string-name> &amp; <string-name><surname>Miltenburg</surname>, <given-names>Emiel Van</given-names></string-name> &amp; <string-name><surname>Zevakhina</surname>, <given-names>Natalia</given-names></string-name> &amp; <string-name><surname>Geurts</surname>, <given-names>Bart</given-names></string-name>. <year>2016</year>. <article-title>Scalar diversity</article-title>. <source>Journal of Semantics</source> <volume>33</volume>(<issue>1</issue>). <fpage>137</fpage>&#8211;<lpage>175</lpage>.</mixed-citation></ref>
<ref id="B26"><mixed-citation publication-type="book"><string-name><surname>Warstadt</surname>, <given-names>Alex</given-names></string-name>. <year>2020</year>. <chapter-title>&#8220;Just&#8221; don&#8217;t ask: exclusives and potential questions</chapter-title>. In <string-name><surname>Franke</surname>, <given-names>Michael</given-names></string-name> &amp; <string-name><surname>Kompa</surname>, <given-names>Nikola</given-names></string-name> &amp; <string-name><surname>Liu</surname>, <given-names>Mingya</given-names></string-name> &amp; <string-name><surname>Mueller</surname>, <given-names>Jutta L.</given-names></string-name> &amp; <string-name><surname>Schwab</surname>, <given-names>Juliane</given-names></string-name> (eds.), <source>Proceedings of Sinn und Bedeutung 24</source>, vol. <volume>2</volume>. <fpage>373</fpage>&#8211;<lpage>390</lpage>.</mixed-citation></ref>
<ref id="B27"><mixed-citation publication-type="journal"><string-name><surname>Washburn</surname>, <given-names>Mary Byram</given-names></string-name> &amp; <string-name><surname>Kaiser</surname>, <given-names>Elsi</given-names></string-name> &amp; <string-name><surname>Zubizarreta</surname>, <given-names>Maria Luisa</given-names></string-name>. <year>2011</year>. <article-title>Focus facilitation and non-associative sets</article-title>. <source>Proceedings of the 15th Workshop on the Semantics and Pragmatics of Dialogue</source> <fpage>94</fpage>&#8211;<lpage>102</lpage>.</mixed-citation></ref>
<ref id="B28"><mixed-citation publication-type="journal"><string-name><surname>Westera</surname>, <given-names>Matthijs</given-names></string-name> &amp; <string-name><surname>Boleda</surname>, <given-names>Gemma</given-names></string-name>. <year>2020</year>. <article-title>A closer look at scalar diversity using contextualized semantic similarity</article-title>. <source>Proceedings of Sinn und Bedeutung</source> <volume>24</volume>(<issue>2</issue>). <fpage>439</fpage>&#8211;<lpage>454</lpage>. DOI: <pub-id pub-id-type="doi">10.18148/sub/2020.v24i2.908</pub-id></mixed-citation></ref>
<ref id="B29"><mixed-citation publication-type="book"><string-name><surname>Wiegand</surname>, <given-names>Mia</given-names></string-name>. <year>2018</year>. <chapter-title>Exclusive morphosemantics: <italic>just</italic> and covert quantification</chapter-title>. In <string-name><surname>Bennett</surname>, <given-names>Wm. G.</given-names></string-name> &amp; <string-name><surname>Hracs</surname>, <given-names>Lindsay</given-names></string-name> &amp; <string-name><surname>Storoshenko</surname>, <given-names>Dennis Ryan</given-names></string-name> (eds.), <source>Proceedings of the 35th West Coast Conference on Formal Linguistics</source>. <fpage>419</fpage>&#8211;<lpage>429</lpage>. <publisher-loc>Somerville, MA</publisher-loc>.</mixed-citation></ref>
<ref id="B30"><mixed-citation publication-type="thesis"><string-name><surname>Windhearn</surname>, <given-names>Mia</given-names></string-name>. <year>2021</year>. <source>Alternatives, exclusivity, and underspecification</source>. <publisher-name>Cornell University</publisher-name> dissertation.</mixed-citation></ref>
<ref id="B31"><mixed-citation publication-type="journal"><string-name><surname>Zehr</surname>, <given-names>Jeremy</given-names></string-name> &amp; <string-name><surname>Schwarz</surname>, <given-names>Florian</given-names></string-name>. <year>2018</year>. <article-title>PennController for Internet Based Experiments (IBEX)</article-title>. DOI: <pub-id pub-id-type="doi">10.17605/OSF.IO/MD832</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>