<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20120330//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd">
<!--<?xml-stylesheet type="text/xsl" href="article.xsl"?>-->
<article article-type="research-article" dtd-version="1.2" xml:lang="en" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id journal-id-type="issn">2397-1835</journal-id>
<journal-title-group>
<journal-title>Glossa: a journal of general linguistics</journal-title>
</journal-title-group>
<issn pub-type="epub">2397-1835</issn>
<publisher>
<publisher-name>Ubiquity Press</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5334/gjgl.1457</article-id>
<article-categories>
<subj-group>
<subject>Research</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Adjectival polarity and the processing of scalar inferences</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-4169-3179</contrib-id>
<name>
<surname>van Tiel</surname>
<given-names>Bob</given-names>
</name>
<email>bobvantiel@gmail.com</email>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0001-8453-1105</contrib-id>
<name>
<surname>Pankratz</surname>
<given-names>Elizabeth</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
</contrib-group>
<aff id="aff-1"><label>1</label>Donders Institute for Brain, Cognition and Behaviour, Postbus 9010, 6500 GL Nijmegen, NL</aff>
<aff id="aff-2"><label>2</label>Leibniz-Zentrum f&#252;r Allgemeine Sprachwissenschaft, Sch&#252;tzenstra&#223;e 18, 10117 Berlin, DE</aff>
<pub-date publication-format="electronic" date-type="pub" iso-8601-date="2021-03-31">
<day>31</day>
<month>03</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>6</volume>
<issue>1</issue>
<elocation-id>32</elocation-id>
<history>
<date date-type="received" iso-8601-date="2020-10-02">
<day>02</day>
<month>10</month>
<year>2020</year>
</date>
<date date-type="accepted" iso-8601-date="2021-02-07">
<day>07</day>
<month>02</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright: &#x00A9; 2021 The Author(s)</copyright-statement>
<copyright-year>2021</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See <uri xlink:href="http://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</uri>.</license-p>
</license>
</permissions>
<self-uri xlink:href="http://www.glossa-journal.org/articles/10.5334/gjgl.1457/"/>
<abstract>
<p>In a seminal study, Bott &amp; Noveck (<xref ref-type="bibr" rid="B3">2004</xref>) found that the computation of the scalar inference of &#8216;some&#8217; implying &#8216;not all&#8217; was associated with increased sentence verification times, suggesting a processing cost. Recently, van Tiel and colleagues (<xref ref-type="bibr" rid="B75">2019b</xref>) hypothesised that the presence of this processing cost critically depends on the polarity of the scalar word. We comprehensively evaluated this polarity hypothesis on the basis of a sentence-picture verification task in which we tested the processing of 16 types of adjectival scalar inferences. We develop a quantitative measure of adjectival polarity which combines insights from linguistics and psychology. In line with the polarity hypothesis, our measure of polarity reliably predicted the presence or absence of a processing cost (i.e., an increase in sentence verification times). We conclude that the alleged processing cost for scalar inferencing in verification tasks is not due to the process of drawing a scalar inference, but rather to the cognitive difficulty of verifying negative information.</p>
</abstract>
<kwd-group>
<kwd>scalar inference</kwd>
<kwd>adjective</kwd>
<kwd>polarity</kwd>
<kwd>sentence processing</kwd>
<kwd>implicature</kwd>
</kwd-group>
<funding-group specific-use="crossref">
<award-group>
<funding-source id="gs1" country="DEU">
<institution-wrap>
<institution>German Research Council</institution>
<institution-id institution-id-type="doi" vocab="open-funder-registry" vocab-identifier="10.13039/open_funder_registry">10.13039/501100001659</institution-id>
</institution-wrap>
</funding-source>
<award-id>FR 3482/2-1</award-id>
<award-id>KR951/14-1</award-id>
<award-id>SA 925/17-1</award-id>
</award-group>
<award-group>
<funding-source id="gs2" country="NLD">
<institution-wrap>
<institution>Dutch Science Organisation</institution>
<institution-id institution-id-type="doi" vocab="open-funder-registry" vocab-identifier="10.13039/open_funder_registry">10.13039/501100003246</institution-id>
</institution-wrap>
</funding-source>
<award-id>024.001.006</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec>
<title>1 Introduction</title>
<p>An utterance of (1a) can be interpreted in (at least) two ways.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(1)</td>
<td>a.</td>
<td>It is warm outside.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>It is hot outside.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>On its <italic>one-sided</italic> interpretation, the utterance conveys that that the temperature outside exceeds some contextually determined value, e.g., 20 degrees Celsius. On its <italic>two-sided</italic> interpretation, the utterance conveys, in addition, that the temperature lies below another contextually determined value, e.g., 30 degrees Celsius. In other words, on its two-sided interpretation, an utterance of (1a) conveys that (1b) is false.</p>
<p>Most current theories assume that the one-sided interpretation corresponds to the <italic>literal</italic> interpretation of (1a). To explain how the two-sided interpretation emerges from this literal interpretation, it is generally assumed that words like &#8216;warm&#8217; evoke lexical scales consisting of words that are ordered in terms of logical strength, e.g., &#10216;warm, hot&#10217;. Here, &#8216;hot&#8217; is assumed to be logically stronger than &#8216;warm&#8217; (at least at the level of literal meaning) since it refers to a more restrictive range of situations. For example, 20 degrees Celsius counts as warm but not hot, but there are no situations that count as hot but not warm (again: at the level of literal meaning). Given a lexical scale, uttering a sentence containing the weaker scalar word may imply that the corresponding sentence containing the stronger scalar word is false. Hence, these inferences have become known as <italic>scalar inferences</italic> (e.g., <xref ref-type="bibr" rid="B34">Horn 1972</xref>; <xref ref-type="bibr" rid="B23">Gazdar 1979</xref>; <xref ref-type="bibr" rid="B68">Soames 1982</xref>; <xref ref-type="bibr" rid="B24">Geurts 2010</xref>; <xref ref-type="bibr" rid="B36">Huang 2014</xref>).</p>
<p>Scalar inferences are commonly explained as a variety of <italic>conversational implicature</italic>, i.e., as a type of inference that can be calculated on the basis of the literal interpretation of an utterance and the assumption that the speaker is cooperative (<xref ref-type="bibr" rid="B31">Grice 1975</xref>). In the case at hand, someone who utters (1a) could have been more informative&#8212;and therefore cooperative&#8212;by saying (1b). Why didn&#8217;t she? Presumably because she believes that it is not hot outside, i.e., she believes that (1b) is false.</p>
<p>According to this implicature-based explanation, the one-sided interpretation is theoretically prior to the two-sided interpretation, since the one-sided interpretation serves as a premise in the reasoning process that ultimately leads to the scalar inference (and, consequently, the two-sided interpretation). An important question is whether the theoretical priority of the literal interpretation is reflected in listeners&#8217; cognitive processing, i.e., whether the computation of scalar inferences is associated with a <italic>processing cost</italic> vis-&#224;-vis the literal interpretation, in line with the latter&#8217;s theoretical priority (<xref ref-type="bibr" rid="B59">R&#233;canati 1995</xref>).</p>
<p>Levinson (<xref ref-type="bibr" rid="B44">2000</xref>) explicitly rejects such an isomorphism between derivational complexity and processing difficulty. Levinson&#8217;s point of departure is the observation that human communication has a comparatively slow information transmission rate because of the time needed for phonetic articulation (i.e., we can only talk so fast). One way of reducing this articulatory bottleneck is by incorporating certain pragmatic inferences&#8212;including scalar inferences&#8212;into the lexical meaning. Levinson argues that this process of lexical integration is pragmatic, but occurs automatically during the construction of the initial interpretation of the utterance. Thus, according to Levinson, an utterance of (1a) receives a scalar inference by default, though this inference can be overridden in certain special situations (e.g., when the speaker continues with &#8216;In fact, it is hot outside&#8217;).</p>
<p>Proponents of <italic>relevance theory</italic> take a more nuanced stance on the cognitive cost of scalar inferencing (e.g., <xref ref-type="bibr" rid="B70">Sperber &amp; Wilson 1987</xref>; <xref ref-type="bibr" rid="B71">1995</xref>; <xref ref-type="bibr" rid="B54">Noveck &amp; Sperber 2007</xref>; <xref ref-type="bibr" rid="B10">Chevallier et al. 2008</xref>). According to relevance theory, listeners try to piece together the speaker&#8217;s intention based on the literal interpretation of an utterance, the surrounding context, and the expectation that the utterance is optimally relevant. Relevance theorists argue that, if the context makes the two-sided interpretation sufficiently relevant (e.g., when (1a) is said to someone who wants to know what to wear today), scalar inferences may be computed without any processing cost, and the two-sided interpretation may even be easier to retrieve than the literal interpretation. However, if there is no such facilitating context&#8212;as will generally be the case in the experiments that we describe below&#8212;the literal interpretation of an utterance is a good first guess as to the speaker&#8217;s intention, and deriving the scalar inference involves an inferential process of meaning construction that is cognitively taxing and time-consuming.</p>
<p>Several more recent proposals side with relevance theory in assuming that the presence of a processing cost for scalar inferencing varies with certain methodological and contextual factors. However, they do not necessarily commit to the relevance-theoretic assumption that relevance is paramount in deciding whether or not a processing cost will be observed. Thus, e.g., these proposals have argued that the presence of a processing cost depends on the question under discussion (<xref ref-type="bibr" rid="B83">Westera 2017</xref>; <xref ref-type="bibr" rid="B62">Ronai &amp; Xiang 2020</xref>), the structural characteristics of the alternatives (<xref ref-type="bibr" rid="B7">Chemla &amp; Bott 2014</xref>; <xref ref-type="bibr" rid="B79">van Tiel &amp; Schaeken 2016</xref>), the naturalness of the utterance (<xref ref-type="bibr" rid="B18">Degen &amp; Tanenhaus 2016</xref>), and, as we will discuss in much more detail later, the <italic>polarity</italic> of the scalar inference (<xref ref-type="bibr" rid="B75">van Tiel et al. 2019b</xref>).</p>
<p>Testing these different theories about the processing of scalar inferences requires operationalising the notion of a processing cost. Various proposals have been made in this respect, focusing on participants&#8217; eye movements (e.g., <xref ref-type="bibr" rid="B32">Grodner et al. 2010</xref>; <xref ref-type="bibr" rid="B37">Huang &amp; Snedeker 2018</xref>), brain signals (e.g., <xref ref-type="bibr" rid="B53">Noveck &amp; Posada 2003</xref>; <xref ref-type="bibr" rid="B1">Barbet &amp; Thierry 2018</xref>), reading times (e.g., <xref ref-type="bibr" rid="B5">Breheny et al. 2006</xref>; <xref ref-type="bibr" rid="B57">Politzer-Ahles &amp; Husband 2018</xref>), and working memory capacity (e.g., <xref ref-type="bibr" rid="B17">De Neys &amp; Schaeken 2007</xref>; <xref ref-type="bibr" rid="B46">Marty &amp; Chemla 2013</xref>). In this study, we focus on the idea that processing costs can be measured by looking at sentence verification times. In the next section, we briefly discuss previous studies using this measure. We show that these studies have given rise to conflicting results, and describe a recent proposal that aims to make sense of these conflicting data in terms of the polarity of scalar words. Afterwards, we turn to our own study in which we systematically and extensively tested the polarity-based explanation.</p>
<sec>
<title>1.1 Previous sentence verification studies</title>
<p>In sentence verification studies, participants are presented with a sentence and have to decide whether that sentence is true or false in a given situation. This situation can be presented pictorially or correspond to participants&#8217; world knowledge. To carry out the verification process, it is often assumed that participants represent both the sentence and the situation in a common format, e.g., a proposition. In addition, participants initialise a truth index that tracks the truth value of the sentence. Sentence verification then consists in systematically manipulating and comparing the representations associated with the sentence and the situation, and carrying out operations on the truth index (cf. <xref ref-type="bibr" rid="B14">Clark &amp; Chase 1972</xref>; <xref ref-type="bibr" rid="B6">Carpenter &amp; Just 1975</xref>).</p>
<p>To examine whether the computation of scalar inferences is associated with a processing cost, Bott &amp; Noveck (<xref ref-type="bibr" rid="B3">2004</xref>) tested the &#10216;some, all&#10217; scale in a series of sentence verification tasks. Participants in their experiments had to indicate the truth value of underinformative sentences like (2).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(2)</td>
<td>a.</td>
<td>Some dogs are mammals.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>Some parrots are birds.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>These sentences are true when interpreted literally, since, e.g., there are dogs that are mammals, but they are false when the scalar inference is computed and &#8216;some&#8217; is interpreted as &#8216;some but not all&#8217;, since, in fact, all dogs are mammals. Hence, participants&#8217; truth judgements to these underinformative sentences are indicative of whether or not they computed a scalar inference.</p>
<p>In Bott and Noveck&#8217;s Exp. 3, participants gave intuitive truth judgements to sentences such as (2). Many participants were ambivalent about the truth of underinformative sentences like these, varying their responses across structurally similar trials. Comparing the verification times of these ambivalent participants, Bott and Noveck found that it took participants significantly longer to answer &#8216;false&#8217; (i.e., the answer suggesting a two-sided interpretation) than &#8216;true&#8217; (i.e., the answer suggesting a literal interpretation). This difference in verification times was absent in a control condition with sentences that were unambiguously true or false, as in (3).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(3)</td>
<td>a.</td>
<td>Some mammals are dogs.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>Some dogs are birds.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The pattern of results that Bott and Noveck observed suggests that the computation of scalar inferences is associated with a processing cost, at least in out-of-the-blue contexts. This conclusion is in line with relevance theory and several more recent approaches (e.g., <xref ref-type="bibr" rid="B7">Chemla &amp; Bott 2014</xref>; <xref ref-type="bibr" rid="B79">van Tiel &amp; Schaeken 2016</xref>; <xref ref-type="bibr" rid="B19">Degen &amp; Tanenhaus 2019</xref>), but speaks against Levinson&#8217;s proposal that the default interpretation of &#8216;some&#8217; is two-sided.</p>
<p>In what follows, we refer to Bott and Noveck&#8217;s finding that participants take significantly longer to reject underinformative sentences like (2) than to accept them as the <italic>B&amp;N effect</italic>. The B&amp;N effect for the &#10216;some, all&#10217; scale has been replicated in numerous studies (e.g., <xref ref-type="bibr" rid="B53">Noveck &amp; Posada 2003</xref>; <xref ref-type="bibr" rid="B74">Tomlinson Jr. et al. 2013</xref>; <xref ref-type="bibr" rid="B7">Chemla &amp; Bott 2014</xref>; <xref ref-type="bibr" rid="B15">Cremers &amp; Chemla 2014</xref>; <xref ref-type="bibr" rid="B79">van Tiel &amp; Schaeken 2016</xref>; <xref ref-type="bibr" rid="B62">Ronai &amp; Xiang 2020</xref>). At the same time, however, several studies have shown that the B&amp;N effect does not always generalise beyond the specific case of &#8216;some&#8217;.</p>
<p>For example, Chevallier and colleagues (<xref ref-type="bibr" rid="B9">2010</xref>) tested the &#10216;or, and&#10217; scale in a sentence-picture verification task. Participants in their experiment had to judge the truth value of sentences like (4) in displays showing different types of objects.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(4)</td>
<td>There is a sun or a train.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In the target condition, the display for (4) showed both a sun and a train. Here, the sentence is literally true but false if the scalar inference is computed and &#8216;or&#8217; is interpreted as excluding &#8216;and&#8217;. As in Bott and Noveck&#8217;s study, many participants vacillated between responding with &#8216;true&#8217; or &#8216;false&#8217; in the target condition. Unlike Bott and Noveck&#8217;s study, however, Chevallier and colleagues did not observe a significant difference in verification times between &#8216;true&#8217; and &#8216;false&#8217; answers.</p>
<p>Even more challenging data comes from studies testing the processing of scalar words in negative sentences, as in (5) (<xref ref-type="bibr" rid="B15">Cremers &amp; Chemla 2014</xref>; <xref ref-type="bibr" rid="B61">Romoli &amp; Schwarz 2015</xref>; <xref ref-type="bibr" rid="B47">Marty et al. 2020</xref>).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(5)</td>
<td>a.</td>
<td>Not all dogs are insects.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>Not all parrots are mammals.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>On their literal interpretation, the sentences in (5) merely convey that there are dogs that are not insects, and that there are parrots that are not mammals. So on their literal interpretation, these sentences are true. However, the sentences in (5) may give rise to the <italic>indirect</italic> scalar inference that the corresponding sentences with &#8216;some&#8217; are false, i.e., that at least some dogs are insects, and that at least some parrots are mammals. Clearly, these scalar inferences are false.</p>
<p>Cremers &amp; Chemla (<xref ref-type="bibr" rid="B15">2014: Exp. 1</xref>) asked participants to give their intuitive truth judgements to sentences such as (5). As in Bott and Noveck&#8217;s study, participants were ambivalent about the truth of these underinformative sentences, responding differently across structurally similar trials. However, unlike Bott and Noveck&#8217;s study, Cremers and Chemla found that participants were <italic>faster</italic> when responding &#8216;false&#8217; than when responding &#8216;true&#8217; (recall that the B&amp;N effect consists in <italic>slower</italic> verification times when responding &#8216;false&#8217;). This difference in response times was absent in a control condition involving sentences that were unambiguously true or false, as in (6).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(6)</td>
<td>a.</td>
<td>Not all mammals are dogs.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>Not all dogs are mammals.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Romoli &amp; Schwarz (<xref ref-type="bibr" rid="B61">2015</xref>) and Marty et al. (<xref ref-type="bibr" rid="B47">2020</xref>) found the same pattern of response times for various other types of indirect scalar inferences. These findings are noteworthy because they suggest that scalar inferences are processed differently when negation is involved. We will return to this point below.</p>
<p>To obtain a more comprehensive picture of the generalisability of the B&amp;N effect, van Tiel and colleagues (<xref ref-type="bibr" rid="B75">2019b</xref>) tested the processing of seven lexical scales: &#10216;some, all&#10217;, &#10216;or, and&#10217;, &#10216;most, all&#10217;, &#10216;might, must&#10217;, &#10216;try, succeed&#10217;, &#10216;low, empty&#10217;, and &#10216;scarce, absent&#10217;. In their Exp. 1, participants gave intuitive truth judgements to sentences containing the weaker scalar word. These sentences were presented in two types of displays. In control displays, the sentence was unambiguously true or false. In target displays, the sentence was literally true but false if the corresponding scalar inference was computed. To illustrate, <bold><italic><xref ref-type="table" rid="T1">Table 1</xref></italic></bold> shows the materials for &#10216;some, all&#10217; and &#10216;low, empty&#10217;.</p>
<table-wrap id="T1">
<label>Table 1</label>
<caption>
<p>Materials used by van Tiel et al. (<xref ref-type="bibr" rid="B75">2019b: Exp. 1</xref>) for the lexical scales &#10216;some, all&#10217; and &#10216;low, empty&#10217;.</p>
</caption>
<table>
<tr>
<th colspan="4"><hr/></th>
</tr>
<tr>
<th align="left"><italic>Sentence</italic></th>
<th align="center"><italic>Control-True</italic></th>
<th align="center"><italic>Control-False</italic></th>
<th align="center"><italic>Target</italic></th>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">Some of the socks are pink.</td>
<td align="center" valign="top"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61799/"/></td>
<td align="center" valign="top"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61800/"/></td>
<td align="center" valign="top"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61801/"/></td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">The battery is low.</td>
<td align="center" valign="top"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61802/"/></td>
<td align="center" valign="top"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61803/"/></td>
<td align="center" valign="top"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61804/"/></td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
</table>
</table-wrap>
<p>In line with Bott and Noveck&#8217;s study, van Tiel and colleagues found that, in the case of &#8216;some&#8217;, participants were significantly slower to answer &#8216;false&#8217; than &#8216;true&#8217; in the target condition, whereas no difference in verification times was observed in the control condition. Van Tiel and colleagues also observed a B&amp;N effect for &#8216;or&#8217; (in contrast with the aforementioned study by <xref ref-type="bibr" rid="B9">Chevallier et al. 2010</xref>), &#8216;might&#8217;, &#8216;most&#8217;, and &#8216;try&#8217;. In the case of &#8216;low&#8217; and &#8216;scarce&#8217;, however, no significant difference in verification times between &#8216;true&#8217; and &#8216;false&#8217; responses was observed.</p>
<p>To explain this pattern of results, van Tiel and colleagues rely on the notion of <italic>polarity</italic>. In particular, van Tiel and colleagues argue that only the scalar inferences associated with <italic>positive</italic> scalar words are associated with a B&amp;N effect, and that this effect is the result of the cognitive difficulty of processing the corresponding <italic>negative</italic> scalar inference. In the next section, we first introduce the notion of polarity. Afterwards, we discuss in more detail the polarity-based explanation proposed by van Tiel and colleagues.</p>
</sec>
<sec>
<title>1.2 Polarity</title>
<p>Polarity is a fundamental but multifarious construct that refers to the fact that some words in natural language are positive while others are negative (cf. <xref ref-type="bibr" rid="B35">Horn 1989: Ch. 1&#8211;3 for an excellent overview</xref>). For example, &#8216;warm&#8217; is usually assumed to be positive, whereas &#8216;cold&#8217; is assumed to be negative. As this example already shows, negative words are not always explicitly marked for negativity. When negative marking is absent, these words are assumed to have an implicit negative element in their underlying semantic representation (<xref ref-type="bibr" rid="B12">Clark 1974</xref>; <xref ref-type="bibr" rid="B33">Heim 2008</xref>; <xref ref-type="bibr" rid="B49">Moracchini 2019</xref>). The notion of polarity has been prominently studied in linguistics and psychology; mostly disparately, but cf. Ingram et al. (<xref ref-type="bibr" rid="B38">2016</xref>) and Nouwen (<xref ref-type="bibr" rid="B52">2020</xref>) for more integrative approaches. However, these two fields have operationalised polarity in importantly different ways.</p>
<p>In linguistics, polarity is usually operationalised in terms of <italic>markedness</italic>, i.e., negative words tend to be marked compared to their positive counterparts (e.g., <xref ref-type="bibr" rid="B30">Greenberg 1966</xref>; <xref ref-type="bibr" rid="B45">Lyons 1968</xref>; <xref ref-type="bibr" rid="B13">Clark &amp; Clark 1977</xref>; <xref ref-type="bibr" rid="B27">Giv&#243;n 1979</xref>; <xref ref-type="bibr" rid="B43">Lehrer &amp; Lehrer 1982</xref>; <xref ref-type="bibr" rid="B42">Lehrer 1985</xref>; <xref ref-type="bibr" rid="B64">Sassoon 2010</xref>; <xref ref-type="bibr" rid="B50">Morzycki 2015</xref>). There are various ways of determining whether or not a word is marked. One such way relies on the fact that certain words make reference to a measurement scale in their semantics (e.g., <xref ref-type="bibr" rid="B39">Kennedy &amp; McNally 2005</xref>; <xref ref-type="bibr" rid="B69">Solt 2015</xref>). To illustrate, compare &#8216;many&#8217; and &#8216;few&#8217;. Both of these words operate on the quantity scale. However, whereas &#8216;many&#8217; denotes a <italic>lower</italic> bound on the quantity scale (e.g., &#8216;Many flowers are red&#8217; implies that the number of red flower is <italic>greater</italic> than a contextually determined threshold), &#8216;few&#8217; denotes an <italic>upper</italic> bound (<xref ref-type="bibr" rid="B78">van Tiel et al. 2021</xref>). To put it differently, &#8220;many-ness&#8221; and quantity are positively related, whereas &#8220;few-ness&#8221; and quantity are negatively related. As a consequence, &#8216;many&#8217; is usually characterised as positive and &#8216;few&#8217; as negative.</p>
<p>Van Tiel and colleagues rely on this characterisation, which they call the <italic>scalarity criterion</italic>, to intuitively classify the scalar words in their sample as positive or negative. Based on this criterion, they labelled &#8216;low&#8217; and &#8216;scarce&#8217; as negative, and all other words as positive. Recall that &#8216;low&#8217; and &#8216;scarce&#8217; were also the only two scalar words that failed to give rise to the B&amp;N effect. This concurrence between polarity and processing led van Tiel and colleagues to hypothesise that polarity is the key feature in determining whether or not a B&amp;N effect will be observed.</p>
<p>However, in addition to the scalarity criterion, there are various other diagnostics of linguistic polarity. Since adjectives are of particular interest for the current study, we focus here on two ways of diagnosing the linguistic polarity of adjectives. Both of these diagnostics build on a standard assumption in linguistics that many adjectives are members of antonym pairs where one member is positive and the other is negative (e.g., <xref ref-type="bibr" rid="B43">Lehrer &amp; Lehrer 1982</xref>).</p>
<p>A first way of diagnosing adjectival polarity involves the interpretation of &#8216;how&#8217; questions. In particular, &#8216;how&#8217; questions involving positive adjectives tend to be neutral, whereas those involving negative adjectives tend to presuppose that the adjective holds (e.g., <xref ref-type="bibr" rid="B60">Rett 2008</xref>). To illustrate, consider the questions in (7).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(7)</td>
<td>a.</td>
<td>How long is a day on Venus?</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>How short is a day on Venus?</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Whereas (7a) is neutral about whether days on Venus are long or short, (7b) intuitively suggests that the speaker believes they are short. This observation suggests that &#8216;long&#8217; is positive, while &#8216;short&#8217; is negative. A direct consequence of the fact that negative adjectives are biasing in &#8216;how&#8217; questions is that they are less likely to occur in such questions than positive adjectives&#8212;e.g., the phrase &#8216;how long&#8217; is much more frequent in the ENCOW16A corpus (a web corpus consisting of almost 17 billion tokens, cf. <xref ref-type="bibr" rid="B66">Sch&#228;fer &amp; Bildhauer 2012</xref>; <xref ref-type="bibr" rid="B65">Sch&#228;fer 2015</xref>) than the phrase &#8216;how short&#8217; (199,033 vs. 2,456 occurrences).</p>
<p>A second way of linguistically delineating positive and negative adjectives looks at ratio phrases, such as &#8216;twice as&#8217; and &#8216;half as&#8217; (cf. <xref ref-type="bibr" rid="B64">Sassoon 2010</xref>). Ratio phrases presuppose a natural zero point. For many positive adjectives, such as &#8216;tall&#8217; and &#8216;old&#8217;, such a natural zero point is intuitively available. For example, conceptually, there is such a thing as zero tallness (i.e., being 0 centimeters tall) or zero oldness (i.e., being 0 days old). For many negative adjectives, however, there is no natural zero point. Thus, there is no such thing as zero shortness (which would correspond to infinite length) or zero youngness (infinite age). As a consequence, the positive adjective &#8216;old&#8217; is felicitous in ratio phrases, while its negative counterpart &#8216;young&#8217; is slightly odd, as shown by the minimal pair in (8).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(8)</td>
<td>a.</td>
<td>&#160;&#160;She is twice as old as him.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>?He is twice as young as her.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In line with this observation, Sassoon (<xref ref-type="bibr" rid="B64">2010</xref>) provides corpus data showing that positive adjectives are&#8212;as a rule&#8212;significantly more frequent than negative adjectives in ratio constructions such as &#8216;twice as&#8217;. In line with these data, &#8216;twice as old&#8217; was substantially more frequent than &#8216;twice as young&#8217; in the ENCOW16A corpus (258 vs. 3 occurrences).</p>
<p>In psychology, polarity is usually defined in terms of <italic>subjective valence</italic>, i.e., in terms of the positive or negative connotations that people have with a particular word (e.g., <xref ref-type="bibr" rid="B80">Wason 1959</xref>; <xref ref-type="bibr" rid="B55">Osgood &amp; Richards 1973</xref>). To illustrate, Mohammad (<xref ref-type="bibr" rid="B48">2018</xref>) presented participants with short lists of words and asked them to rank these based on their valence. These rankings were then converted to numeric values between 0 (indicating that the word was always ranked at the bottom of the list) and 1 (always ranked at the top). Thus, e.g., &#8216;good&#8217; was associated with a value of 0.938; &#8216;bad&#8217; with a value of 0.125, which reflects the intuition that &#8216;good&#8217; is positive and &#8216;bad&#8217; is negative.</p>
<p>The psychological notion of polarity as subjective valence also reverberates in natural language in several ways (e.g., <xref ref-type="bibr" rid="B4">Boucher &amp; Osgood 1969</xref>; <xref ref-type="bibr" rid="B2">Benjafield &amp; Adams-Webber 1976</xref>; <xref ref-type="bibr" rid="B56">Paradis et al. 2012</xref>). For example, it has been found that psychologically positive words are more frequently attested than negative ones (i.e., the <italic>Polyanna hypothesis</italic> formulated and tested by <xref ref-type="bibr" rid="B4">Boucher &amp; Osgood 1969</xref>). In line with this idea, a search in the ENCOW16A corpus shows that &#8216;good&#8217; is more than four times as frequent as &#8216;bad&#8217; (10,869,258 vs. 2,289,838 occurrences).</p>
<p>In most cases, the linguistic and psychological notions of polarity go hand in hand, but not always. For example, as we just saw, from a linguistic perspective, &#8216;old&#8217; is positive while &#8216;young&#8217; is negative. From a psychological perspective, the converse holds: &#8216;young&#8217; is positive and &#8216;old&#8217; is negative (e.g., in the study by <xref ref-type="bibr" rid="B48">Mohammad 2018, participants gave &#8216;old&#8217; a valence rating of 0.41 and &#8216;young&#8217; a rating of 0.81</xref>). Therefore one of the main contributions of this paper is the synthesis of several (potentially conflicting) polarity diagnostics into a single continuous polarity measure. We use this polarity measure to systematically test the speculation from van Tiel and colleagues that only positive scalar words are associated with a B&amp;N effect (recall that they relied on the scalarity criterion, a rather intuitive measure, to classify scalar words as positive or negative). However, before going into more detail about the present study, we explore why it is plausible that polarity affects the processing of scalar inferences.</p>
</sec>
<sec>
<title>1.3 A polarity-based explanation</title>
<p>In order to explain why only positive scalar words are associated with a B&amp;N effect, van Tiel and colleagues rely on the observation that verification times are systematically affected by the polarity of the sentence. To illustrate, consider the three sentences in (9). These sentences vary in their polarity: (9a) is positive, (9b) contains the implicitly negative word &#8216;below&#8217;, and (9c) contains the explicit sentential negation &#8216;not&#8217;. In what follows, we will conveniently refer to these three types of sentences as <italic>positives, implicit negatives</italic>, and <italic>explicit negatives</italic>.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(9)</td>
<td>a.</td>
<td>The star is above the cross.</td>
</tr>
<tr>
<td>&#160;</td>
<td>b.</td>
<td>The cross is below the star.</td>
</tr>
<tr>
<td>&#160;</td>
<td>c.</td>
<td>The cross is not above the star.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Clark &amp; Chase (<xref ref-type="bibr" rid="B14">1972</xref>) measured the verification times for these three types of sentences in displays that always showed two vertically juxtaposed images. Crucially, the three sentences in (9) are all equivalent in these displays. Nonetheless, Clark and Chase found that participants were significantly faster to verify positives like (9a) than implicit negatives like (9b), which, in turn, were verified significantly more rapidly than explicit negatives like (9c). In other words, Clark and Chase found evidence for the generalisation in (10), where &#8216;&lt;&#8217; denotes <italic>faster</italic> verification times.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(10)</td>
<td>positive &lt; implicit negative &lt; explicit negative</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Clark and Chase&#8217;s findings have been replicated in numerous studies (e.g., <xref ref-type="bibr" rid="B82">Wason 1972</xref>; <xref ref-type="bibr" rid="B6">Carpenter &amp; Just 1975</xref>; <xref ref-type="bibr" rid="B22">Fodor et al. 1975</xref>; <xref ref-type="bibr" rid="B8">Cheng &amp; Huang 1980</xref>; <xref ref-type="bibr" rid="B58">Proctor &amp; Cho 2006</xref>). However, it should be noted that all of these studies tested sentences that were presented without any relevant context. Indeed, later work has convincingly shown that these findings do not always generalise to more contextualised settings. For example, several studies found that explicit negatives can be verified as fast as positives if the context is right (cf. <xref ref-type="bibr" rid="B81">Wason 1965</xref>; <xref ref-type="bibr" rid="B51">Nieuwland &amp; Kuperberg 2008</xref>; <xref ref-type="bibr" rid="B73">Tian et al. 2010</xref>). In what follows, we will largely ignore this important qualification, since almost all verification studies on the processing of scalar inferences have made use of out-of-the-blue contexts (but cf. <xref ref-type="bibr" rid="B62">Ronai &amp; Xiang 2020 for a recent exception</xref>).</p>
<p>To see how the generalisation in (10) may explain van Tiel and colleagues&#8217; observation that only positive scalar words are associated with a B&amp;N effect, consider their target sentence for the positive scalar word &#8216;some&#8217; and its scalar inference in (11).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(11)</td>
<td>Some of the socks are pink.</td>
</tr>
<tr>
<td>&#160;</td>
<td>&#10239; Not all of the socks are pink.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Participants who interpreted the target sentence literally only had to verify a positive sentence, whereas participants who arrived at a two-sided interpretation also had to verify the explicitly negative scalar inference. Given the generalisation in (11), we may expect that verifying the explicitly negative scalar inference leads to elevated response times compared to the literal interpretation. So, even if we assume that participants who arrived at a two-sided interpretation of the target sentence verified the literal interpretation and the scalar inference in parallel, it follows that they are expected to be significantly slowed down compared to participants who interpreted the sentence literally.</p>
<p>Now consider the target sentence for the implicitly negative scalar word &#8216;scarce&#8217; along with its scalar inference in (12).</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(12)</td>
<td>Red flowers are scarce.</td>
</tr>
<tr>
<td>&#160;</td>
<td>&#10239; Red flowers are not absent.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Participants who arrived at a literal interpretation of the target sentence had to verify an implicitly negative sentence, since &#8216;scarce&#8217; is implicitly negative. What about the scalar inference? Superficially, the scalar inference appears to involve a double negation in &#8216;not absent&#8217;. Hence, intuitively, one might suppose that the verification of the scalar inference should take longer than verifying the literal interpretation. However, van Tiel and colleagues contend that the scalar inference in this case is verified at least as fast as the literal interpretation. There are various arguments that may support this proposal.</p>
<p>First, it has been found that, in at least some cases, sentences containing two negative elements are processed more rapidly than the corresponding sentences with a single negative element. For example, Sherman (<xref ref-type="bibr" rid="B67">1976</xref>) found that participants were faster to verify sentences containing the double negation &#8216;no one doubted&#8217; (&#8216;doubt&#8217; is implicitly negative) than sentences containing just the single negative word &#8216;doubted&#8217; in an otherwise positive sentence. Hence, it could be that &#8216;not absent&#8217; is easier to verify than &#8216;scarce&#8217;. Second, it could be the case that the scalar inference is cognitively represented as a positive, i.e., as &#8216;Red flowers are present&#8217;. The double negation could be eliminated on the fly, or, perhaps more plausibly from a psychological standpoint, it could be that the positive form of the scalar inference is directly associated with the scalar word, similarly to Levinson&#8217;s (<xref ref-type="bibr" rid="B44">2000</xref>) defaultist proposal, so that participants who derive the scalar inference interpret the target sentence in (12) as &#8216;Red flowers are scarce but present&#8217;.</p>
<p>In any case, if we assume that the scalar inference in (12) can be verified at least as rapidly as the literal interpretation, and if we furthermore assume that participants who compute the scalar inference verify both the literal interpretation and the scalar inference in parallel, it follows that participants who computed the scalar inference should be equally fast as participants who arrived at a literal interpretation. One might even expect participants who derived the scalar inference to be faster than participants who arrived at the literal interpretation, since the positivity of the scalar inference seems to entail that it should be verified faster than the implicitly negative literal interpretation, and since the sentence may be judged false as soon as the scalar inference is verified. However, psycholinguistic studies have consistently shown that &#8216;false&#8217; responses to positive sentences are generally slightly delayed compared to &#8216;true&#8217; responses, which might mitigate that verification time advantage for positives relative to implicit negatives, and thus lead to roughly equal verification times for literal and two-sided interpretations (e.g., <xref ref-type="bibr" rid="B14">Clark &amp; Chase 1972</xref>).</p>
<p>This polarity-based explanation also harmonises with the previously discussed findings on indirect scalar inferences. To illustrate, (13) shows a target sentence from Cremers and Chemla&#8217;s (<xref ref-type="bibr" rid="B15">2014</xref>) study, as well as its scalar inference.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(13)</td>
<td>Not all dogs are insects.</td>
</tr>
<tr>
<td>&#160;</td>
<td>&#10239; It&#8217;s not the case that not some dogs are insects.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Again, the proposal is that the scalar inference is verified more rapidly than the literal interpretation, either because the double negation is eliminated or because &#8216;not all&#8217; is statistically associated with &#8216;some&#8217; (rather than the equivalent &#8216;not not some&#8217;). The reason that (13) gave rise to the reverse B&amp;N effect&#8212;rather than the absence of any effect, as van Tiel and colleagues found for cases like &#8216;scarce&#8217;&#8212;is that the target sentence contains an explicit negation. As noted before, explicit negatives take longer to verify than implicit negatives. Hence, participants who accepted the target sentence had to verify the more time-consuming literal interpretation, whereas participants who arrived at the two-sided interpretation verified both the literal interpretation and the (positive) scalar inference in parallel. In the latter case, participants could respond with &#8216;false&#8217; as soon as they realised that the scalar inference was false, which took less time than verifying the literal interpretation and responding with &#8216;true&#8217;.</p>
<p><bold><italic><xref ref-type="table" rid="T2">Table 2</xref></italic></bold> succinctly summarises the predictions of the polarity-based explanation. If this explanation is on the right track, it would mean that the B&amp;N effect does not reflect a processing cost for scalar inferencing, since the scalar inferences of negative scalar words are not associated with a B&amp;N effect. Indeed, if correct, it would mean that the B&amp;N effect is only reflective of more general processing difficulties associated with the verification of negative information relative to positive information.</p>
<table-wrap id="T2">
<label>Table 2</label>
<caption>
<p>Predictions of the polarity-based explanation about the polarity properties of the literal interpretation and the scalar inference, and about the B&amp;N effect.</p>
</caption>
<table>
<tr>
<th colspan="4"><hr/></th>
</tr>
<tr>
<th align="left" valign="top"><italic>Scalar word</italic></th>
<th align="left" valign="top"><italic>Literal interpretation</italic></th>
<th align="left" valign="top"><italic>Scalar inference</italic></th>
<th align="left" valign="top"><italic>B&amp;N effect</italic></th>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">positive (e.g., &#8216;some&#8217;)</td>
<td align="left" valign="top">positive</td>
<td align="left" valign="top">expl. negative</td>
<td align="left" valign="top">present</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">impl. negative (e.g., &#8216;scarce&#8217;)</td>
<td align="left" valign="top">impl. negative</td>
<td align="left" valign="top">positive</td>
<td align="left" valign="top">absent</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
<tr>
<td align="left" valign="top">expl. negative (e.g., &#8216;not all&#8217;)</td>
<td align="left" valign="top">expl. negative</td>
<td align="left" valign="top">positive</td>
<td align="left" valign="top">reversed</td>
</tr>
<tr>
<td colspan="4"><hr/></td>
</tr>
</table>
</table-wrap>
<p>However, the current support for the polarity hypothesis is comparatively thin, consisting solely of the data for &#8216;low&#8217; and &#8216;scarce&#8217; (as well as perhaps earlier data on indirect scalar inferences). Moreover, in addition to being the only negative scalar words tested by van Tiel and colleagues, &#8216;low&#8217; and &#8216;scarce&#8217; were also the only <italic>adjectival</italic> scalar words they tested. This may have influenced the results in various ways, e.g., one can imagine that adjectival scales are less salient given the openness of the grammatical class (cf. <xref ref-type="bibr" rid="B77">van Tiel et al. 2016</xref>).</p>
<p>In this study, we test the hypothesis that only positive scalar words give rise to the B&amp;N effect in a more comprehensive and systematic way by investigating the processing of 16 adjectival scalar words of both positive and negative polarity. Rather than relying on one subjective diagnostic to classify scalar words in a binary way as either positive or negative, we combined the outcomes of four objectively measurable diagnostics for polarity to obtain a gradient measure of polarity. Consequently, we tested whether this gradient measure of polarity predicted the presence or absence of a B&amp;N effect. In the next section, we describe our study in more detail.</p>
</sec>
<sec>
<title>1.4 Our study</title>
<p>Our study tested 16 adjectival scales: &#10216;ajar, open&#10217;, &#10216;breezy, windy&#10217;, &#10216;chubby, fat&#10217;, &#10216;content, happy&#10217;, &#10216;cool, cold&#10217;, &#10216;drizzly, rainy&#10217;, &#10216;fair, good&#10217;, &#10216;low, empty&#10217;, &#10216;mediocre, bad&#10217;, &#10216;passable, good&#10217;, &#10216;ripe, overripe&#10217;, &#10216;scarce, absent&#10217;, &#10216;sleepy, asleep&#10217;, &#10216;unlikely, impossible&#10217;, &#10216;warm, hot&#10217;, and &#10216;youthful, young&#10217;. For each scale, we constructed a simple sentence containing the weaker scalar word, and, for each sentence, we created three images: a target image where the sentence was literally true but where its scalar inference was false, and two control images where the sentence was unambiguously true or false. See the Appendix for an overview of the sentences and images that we tested.</p>
<p>Participants in the experiment first saw the sentence. Once they finished reading the sentence, they pressed the space bar to see the image. At that point, they had to indicate whether they felt the sentence they had just read was a good or bad description of the corresponding image. We measured their verification times (i.e., the time between image onset and the point at which one of the response buttons was pressed) to establish the presence or absence of a B&amp;N effect, i.e., to determine whether or not verification times were slower for &#8216;false&#8217; than for &#8216;true&#8217; answers in the target condition, vis-&#224;-vis the control condition.</p>
<p>To test the polarity hypothesis&#8212;i.e., the idea that only positive scalar words give rise to the B&amp;N effect&#8212;we had to determine the polarity of the scalar words in our study. Here, we focus on the stronger word on the scale, since the polarity-based explanation crucially makes reference to the polarity of the negated alternative, though if the literature is right, all words on a scale should share the same polarity (i.e., the <italic>scalarity</italic> constraint, cf. <xref ref-type="bibr" rid="B21">Fauconnier 1975</xref>; <xref ref-type="bibr" rid="B35">Horn 1989</xref>). In the previous section, we discussed five diagnostics that can be used to determine the polarity of adjectives:</p>
<list list-type="roman-lower">
<list-item><p><italic>Scalarity</italic>: Positive words denote a lower bound on their measurement scale.</p></list-item>
<list-item><p><italic>Questions</italic>: Positive adjectives are neutral in &#8216;how&#8217; questions.</p></list-item>
<list-item><p><italic>Ratio</italic>: Positive adjectives are more felicitous in ratio phrases like &#8216;twice as&#8217;.</p></list-item>
<list-item><p><italic>Valence</italic>: Positive words are judged as having more positive connotations.</p></list-item>
<list-item><p><italic>Frequency</italic>: Positive words are more frequent than negative ones.</p></list-item>
</list>
<p>The first three diagnostics reflect the linguistic notion of polarity as markedness; the last two diagnostics reflect the psychological notion of polarity as subjective valence.</p>
<p>Van Tiel and colleagues focused on the scalarity criterion in their study. However, in our study, we wanted to avoid using this criterion for two reasons. First, not all of the scalar words that we tested make reference to a clearly identifiable measurement scale. For example, in the case of &#8216;open&#8217;, it is unclear whether the underlying measurement scale is about openness or closedness. Second, and relatedly, the scalarity criterion crucially relies on researchers&#8217; intuitions, which are not always reliable.</p>
<p>Rather than relying on one specific construal of polarity, or even one specific diagnostic measure, we made use of each of the remaining four diagnostics in the list. Unlike the scalarity criterion, these four diagnostics can be operationalised using objective data. We assume here that each of the four diagnostics offers an approximation of a fundamental latent construct of polarity, and that, by combining these diagnostics, we are able to obtain a relatively reliable estimate of that construct. Crucially, we assume that polarity is gradient rather than binary; that is, words can be positive or negative to varying degrees. In making this decision, we do not want to question the value of focusing on one specific construal of polarity, e.g., in terms of markedness. However, even in the extensive line of work focusing on linguistic polarity, it has been observed that there is no fail-proof way of establishing the polarity of a word that consistently accords with linguists&#8217; intuitions (e.g., <xref ref-type="bibr" rid="B60">Rett 2008</xref>; <xref ref-type="bibr" rid="B64">Sassoon 2010</xref>; <xref ref-type="bibr" rid="B29">Gotzner et al. 2018</xref>). By combining data from different diagnostics, we may mitigate potentially counterintuitive outcomes from any single diagnostic.</p>
<p>Hence, for each of the stronger scalar words in our experiment&#8212;as well as their corresponding antonyms&#8212;we obtained four measures, corresponding to the last four diagnostics in the list above: (i) their frequency in the phrase &#8216;how [adjective]&#8217;, (ii) their frequency in the phrases &#8216;twice as [adjective]&#8217; and &#8216;half as [adjective]&#8217;, (iii) their valence ratings as reported by Mohammad (<xref ref-type="bibr" rid="B48">2018</xref>), and (iv) their overall frequency. The corpus counts for (i), (ii), and (iv) were taken from the ENCOW16A corpus (<xref ref-type="bibr" rid="B66">Sch&#228;fer &amp; Bildhauer 2012</xref>; <xref ref-type="bibr" rid="B65">Sch&#228;fer 2015</xref>), and the counts for (i) and (ii) were relativised to the adjectives&#8217; overall frequency. The corpus frequencies were always logarithmised as a way of reducing skewness. The outcome of each measure for the stronger scalar word was divided by the outcome of that measure for its antonym. Thus, values greater than 1 indicate that the stronger scalar word was positive relative to its antonym; values between 0 and 1 indicate the reverse. These resulting ratio values are provided in <bold><italic><xref ref-type="table" rid="T3">Table 3</xref></italic></bold>.</p>
<table-wrap id="T3">
<label>Table 3</label>
<caption>
<p>Lexical scales tested in the experiment and antonyms of the stronger scalemate. <italic>Question</italic>: Relative frequency of &#8216;how [adjective]&#8217; in the ENCOW16A corpus (<xref ref-type="bibr" rid="B66">Sch&#228;fer &amp; Bildhauer 2012</xref>; <xref ref-type="bibr" rid="B65">Sch&#228;fer 2015</xref>). <italic>Ratio</italic>: Relative frequency of &#8216;twice as [adjective]&#8217; and &#8216;half as [adjective]&#8217; in the ENCOW16A corpus. <italic>Valence</italic>: Relative subjective valence rating (<xref ref-type="bibr" rid="B48">Mohammad 2018</xref>). <italic>Frequency</italic>: Relative overall frequency in the ENCOW16A corpus. <italic>Polarity</italic>: Polarity value based on the first principal component of a principal component analysis on the basis of the values in Question, Ratio, Valence, and Frequency. Missing values due to zero or singular counts were set to 1 and are italicised in the table. *Neither &#8216;overripe&#8217; nor &#8216;unripe&#8217; was tested by Mohammad (<xref ref-type="bibr" rid="B48">2018</xref>); we used the valence ratings for the words &#8216;rotten&#8217; and &#8216;raw&#8217; instead.</p>
</caption>
<table>
<tr>
<th colspan="7"><hr/></th>
</tr>
<tr>
<th align="left"><italic>Scale</italic></th>
<th align="left"><italic>Antonym</italic></th>
<th align="left"><italic>Question</italic></th>
<th align="left"><italic>Ratio</italic></th>
<th align="left"><italic>Valence</italic></th>
<th align="left"><italic>Frequency</italic></th>
<th align="left"><italic>Polarity</italic></th>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;ajar, open&#10217;</td>
<td align="left">closed</td>
<td align="left">1.52</td>
<td align="left"><italic>1.00</italic></td>
<td align="left">2.58</td>
<td align="left">1.09</td>
<td align="left">1.18</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;breezy, windy&#10217;</td>
<td align="left">calm</td>
<td align="left"><italic>1.00</italic></td>
<td align="left">0.38</td>
<td align="left">0.86</td>
<td align="left">0.74</td>
<td align="left">&#8211;0.99</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;chubby, fat&#10217;</td>
<td align="left">skinny</td>
<td align="left"><italic>1.00</italic></td>
<td align="left">0.59</td>
<td align="left">1.21</td>
<td align="left">1.28</td>
<td align="left">1.06</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;content, happy&#10217;</td>
<td align="left">sad</td>
<td align="left">1.10</td>
<td align="left">2.11</td>
<td align="left">4.44</td>
<td align="left">1.11</td>
<td align="left">2.18</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;cool, cold&#10217;</td>
<td align="left">hot</td>
<td align="left">0.99</td>
<td align="left">0.79</td>
<td align="left">0.81</td>
<td align="left">0.99</td>
<td align="left">&#8211;0.36</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;drizzly, rainy&#10217;</td>
<td align="left">dry</td>
<td align="left">0.35</td>
<td align="left"><italic>1.00</italic></td>
<td align="left">1.68</td>
<td align="left">0.81</td>
<td align="left">&#8211;1.27</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;fair, good&#10217;</td>
<td align="left">bad</td>
<td align="left">1.06</td>
<td align="left">1.13</td>
<td align="left">7.50</td>
<td align="left">1.11</td>
<td align="left">2.12</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;low, empty&#10217;</td>
<td align="left">full</td>
<td align="left">0.89</td>
<td align="left">1.00</td>
<td align="left">0.31</td>
<td align="left">0.87</td>
<td align="left">&#8211;1.57</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;mediocre, bad&#10217;</td>
<td align="left">good</td>
<td align="left">0.94</td>
<td align="left">0.88</td>
<td align="left">0.13</td>
<td align="left">0.90</td>
<td align="left">&#8211;0.69</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;passable, good&#10217;</td>
<td align="left">bad</td>
<td align="left">1.06</td>
<td align="left">1.13</td>
<td align="left">7.50</td>
<td align="left">1.11</td>
<td align="left">2.12</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;ripe, overripe&#10217;</td>
<td align="left">unripe</td>
<td align="left"><italic>1.00</italic></td>
<td align="left"><italic>1.00</italic></td>
<td align="left">0.72*</td>
<td align="left">0.96</td>
<td align="left">&#8211;0.28</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;scarce, absent&#10217;</td>
<td align="left">present</td>
<td align="left">0.69</td>
<td align="left"><italic>1.00</italic></td>
<td align="left">0.24</td>
<td align="left">0.80</td>
<td align="left">&#8211;1.31</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;sleepy, asleep&#10217;</td>
<td align="left">awake</td>
<td align="left">0.82</td>
<td align="left"><italic>1.00</italic></td>
<td align="left">0.91</td>
<td align="left">1.04</td>
<td align="left">&#8211;0.03</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;unlikely, impossible&#10217;</td>
<td align="left">possible</td>
<td align="left">1.12</td>
<td align="left">0.00</td>
<td align="left">0.22</td>
<td align="left">0.88</td>
<td align="left">&#8211;1.83</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;warm, hot&#10217;</td>
<td align="left">cold</td>
<td align="left">1.00</td>
<td align="left">1.27</td>
<td align="left">1.23</td>
<td align="left">1.01</td>
<td align="left">0.46</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;youthful, young&#10217;</td>
<td align="left">old</td>
<td align="left">0.85</td>
<td align="left">0.20</td>
<td align="left">1.98</td>
<td align="left">0.97</td>
<td align="left">&#8211;0.85</td>
</tr>
<tr>
<td colspan="7"><hr/></td>
</tr>
</table>
</table-wrap>
<p>Next, we carried out a principal component analysis based on these ratio values.<xref ref-type="fn" rid="n1">1</xref> Principal component analyses are commonly used when trying to extract values from a latent parameter (in the case at hand: polarity) based on values from observable parameters that are assumed to approximate the latent parameter (in the case at hand: the values from the four diagnostics). A principal component analysis allows us to reduce the values from the observable parameters to a single value (i.e., the first principal component) in such a way that as much of the variance in the observable parameters is accounted for as possible. In our case, the first principal component explained 47% of the variance.</p>
<p>The values on the first principal component are shown in <bold><italic><xref ref-type="table" rid="T3">Table 3</xref></italic></bold>. Positive values stand for positive polarity; negative values for negative polarity. Encouragingly, values from the first principal component generally accord with our intuitions. &#8216;Happy&#8217;, &#8216;good&#8217;, and &#8216;open&#8217; were assigned positive polarity values, since they received positive values on all four diagnostics; &#8216;rainy&#8217;, &#8216;windy&#8217;, and &#8216;absent&#8217; received negative polarity values. Less clear-cut cases such as &#8216;overripe&#8217;, &#8216;asleep&#8217;, and &#8216;hot&#8217; were somewhere in between.</p>
<p>We used the values from the first principal component to test the polarity hypothesis, which predicts that the B&amp;N effect should interact with polarity. Recall that the B&amp;N effect consists of slower verification times when responding &#8216;false&#8217; than &#8216;true&#8217; in the target condition vis-&#224;-vis the control condition; i.e., the B&amp;N effect consists in an interaction effect on verification times between condition (target vs. control) and response (&#8216;true&#8217; vs. &#8216;false&#8217;). Hence, the polarity hypothesis predicts a significant three-way interaction between condition (target vs. control), response (&#8216;true&#8217; vs. &#8216;false&#8217;), and polarity, such that the relative increase in verification times for &#8216;false&#8217; compared to &#8216;true&#8217; responses in the target condition increases with increasing polarity of the adjective.</p>
<p>The next section describes our experiment in more detail, followed by the results. All data and analysis files can be accessed at <italic><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/wxmeq/">https://osf.io/wxmeq/</ext-link></italic>.</p>
</sec>
</sec>
<sec>
<title>2 The experiment</title>
<sec>
<title>2.1 Participants</title>
<p>50 participants were recruited on Amazon&#8217;s Mechanical Turk. 20 participants were female, the remaining 30 male. Participants&#8217; mean age was 39 (standard deviation: 11, range: 22&#8211;69). All participants indicated that they were native speakers of English. Participants were paid $1.50 for their participation.</p>
</sec>
<sec>
<title>2.2 Materials</title>
<p>As mentioned above, the materials consisted of 16 adjectival scales. For each scale, we created a simple sentence containing the weaker scalar word. For each sentence, we created three types of images: one image where the sentence was unambiguously true, one image where it was unambiguously false, and one image where the sentence was true on its literal interpretation but false if its scalar inference was derived. <bold><italic><xref ref-type="table" rid="T3">Table 3</xref></italic></bold> shows the lexical scales that were tested; the Appendix provides the sentences and images used in the experiment.</p>
<p>The materials were pretested in two experiments with 25 participants each. Based on these pretests, we made several adjustments to the sentences and images to ensure that participants responded as expected, i.e., rejected the sentence in the false control condition, accepted it in the true control condition, and vacillated between accepting and rejecting the sentence in the target condition. (Here, vacillation was defined as significantly fewer &#8216;true&#8217; responses than in the true control condition and significantly fewer &#8216;false&#8217; responses than in the false control condition.)</p>
<p>The experiment presented each sentence-image pair three times, and thus comprised 16&#215;3&#215;3 = 144 trials in total. The order of presentation was randomised for each participant.</p>
</sec>
<sec>
<title>2.3 Procedure</title>
<p>Participants were instructed to indicate whether or not the sentence was a good description of the image. They could register their judgement by pressing either &#8216;1&#8217; (to answer in the positive) or &#8216;0&#8217; (to answer in the negative) on their keyboard. Trials started with the presentation of the sentence. Upon pressing the space bar, the sentence was replaced by the image, whereupon participants could give their truth judgements. We measured the time from image onset to button press.</p>
</sec>
<sec>
<title>2.4 Data treatment</title>
<p>3 participants were removed from the analyses because their accuracy on control items was below 80%. In addition, we removed trials with a verification time faster than 200 milliseconds or slower than 10 seconds, assuming that these correspond to accidental button presses or inattentiveness to the task at hand. This resulted in the removal of 14 trials (less than 0.1% of the data).</p>
</sec>
<sec>
<title>2.5 Results</title>
<p><bold><italic><xref ref-type="fig" rid="F1">Figure 1</xref></italic></bold> shows the percentages of &#8216;true&#8217; responses for each scalar word and condition. Performance in the control condition was close to ceiling. In the &#8216;true&#8217; control condition, performance ranged from 87% for &#8216;warm&#8217; to 100% for &#8216;sleepy&#8217; and &#8216;content&#8217;. In the &#8216;false&#8217; control condition, performance ranged from 86% for &#8216;unlikely&#8217; to 100% for &#8216;ajar&#8217; and &#8216;youthful&#8217;. In the target condition, the percentage of &#8216;true&#8217; responses ranged from 16% for &#8216;mediocre&#8217; to 80% for &#8216;youthful&#8217;.</p>
<fig id="F1">
<label>Figure 1</label>
<caption><p>Percentage of &#8216;true&#8217; responses for each scalar word and condition. Error bars represent standard errors of the mean.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61796/"/>
</fig>
<p>One of our reviewers rightly observed that the percentage of &#8216;true&#8217; responses in the target condition for &#8216;mediocre&#8217; (16%) was so close to the percentage of errors in the &#8216;false&#8217; control condition (11%) for that scalar word that one might call into question our assumption that the &#8216;not bad&#8217; inference is a bona fide scalar inference rather than being an aspect of the lexical meaning of &#8216;mediocre&#8217;. However, given that &#8216;mediocre&#8217; behaved in line with our assumption in the pretest (26% &#8216;true&#8217; responses in the target condition vs. 7% errors in the &#8216;false&#8217; control condition), we decided to retain this item in our analyses. We want to emphasise that removing &#8216;mediocre&#8217; does not have any noteworthy consequences for the analyses that we report below.</p>
<p>Next, we considered participants&#8217; verification times. <bold><italic><xref ref-type="fig" rid="F2">Figure 2</xref></italic></bold> shows the mean logarithmised verification times for aggregated positive and negative scalar words. Here, we classified a scalar word as positive if its sign on the first principal component was positive; otherwise as negative (cf. <bold><italic><xref ref-type="table" rid="T3">Table 3</xref></italic></bold>). <bold><italic><xref ref-type="fig" rid="F2">Figure 2</xref></italic></bold> shows the B&amp;N effect for positive scalar words, but not for negative ones. Note that, as discussed previously, in the statistical analyses, we treated polarity as a continuous rather than binary factor; this visualisation is only intended to summarise our continuous polarity measure. <bold><italic><xref ref-type="fig" rid="F3">Figure 3</xref></italic></bold> shows the mean logarithmised verification times for each scalar word separately.</p>
<fig id="F2">
<label>Figure 2</label>
<caption><p>Mean logarithmised verification times for positive and negative scalar words. Error bars represent standard errors of the mean.</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61797/"/>
</fig>
<fig id="F3">
<label>Figure 3</label>
<caption><p>Mean logarithmised verification times for each scalar word. Scalar words are ordered from most positive (top left) to most negative (bottom right).</p></caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="/article/id/5412/file/61798/"/>
</fig>
<p>To test the polarity hypothesis, we constructed a linear regression mixed effects model predicting logarithmised response times based on condition (target vs. control), response (&#8216;true&#8217; vs. &#8216;false&#8217;), polarity, and their interactions, including random intercepts for participants and scalar words. For all analyses, degrees of freedom and corresponding <italic>p</italic>-values were estimated using the Satterthwaite procedure, as implemented in the &#8216;lmerTest&#8217; package (<xref ref-type="bibr" rid="B41">Kuznetsova et al. 2013</xref>). The model also included response bias (i.e., the proportion of &#8216;true&#8217; responses in the target condition) and trial number (linearly rescaled to the [0, 1] interval) as main effects. A response bias could lead to slower non-modal responses; trial number was included to capture learning effects. The model showed a highly significant interaction between condition, response, and polarity (<italic>b</italic> = 0.09, <italic>SE</italic> = 0.01, <italic>t</italic> = 6.00, <italic>p</italic> &lt; .001). This interaction confirmed that polarity modulated the presence or absence of the B&amp;N effect. There was no significant effect of response bias (<italic>b</italic> = 0.02, <italic>SE</italic> = 0.02, <italic>t</italic> &lt; 1); however, there was a significant effect of trial number (<italic>b</italic> = &#8211;0.50, <italic>SE</italic> = 0.02, <italic>t</italic> = 32.46, <italic>p</italic> &lt; .001) indicating that participants responded increasingly more rapidly throughout the experiment.</p>
<p>To also obtain a more fine-grained picture of the scope of the polarity hypothesis, we checked, for each scalar word separately, whether a B&amp;N effect was present or not. To this end, for each scalar word, we constructed a linear regression mixed effects model predicting logarithmised response times on the basis of condition, response, their interaction, and trial number, including random intercepts for participants with random slopes for condition and response (but not their interaction). We observed significant interaction effects for &#8216;content&#8217;, &#8216;fair&#8217;, &#8216;passable&#8217;, &#8216;ajar&#8217;, &#8216;chubby&#8217;, &#8216;warm&#8217;, and &#8216;youthful&#8217; (see <bold><italic><xref ref-type="table" rid="T4">Table 4</xref></italic></bold> for the full model parameters).</p>
<table-wrap id="T4">
<label>Table 4</label>
<caption>
<p>Parameters of the interaction effect between condition (target vs. control) and response (&#8216;true&#8217; vs. &#8216;false&#8217;) for each lexical scale. The scales are ordered based on their estimated polarity value (<italic>Polarity</italic>, cf. <bold><italic><xref ref-type="table" rid="T3">Table 3</xref></italic></bold>).</p>
</caption>
<table>
<tr>
<th colspan="6"><hr/></th>
</tr>
<tr>
<th align="left" valign="top" rowspan="3"><italic>Scale</italic></th>
<th align="center" valign="top" rowspan="3"><italic>Polarity</italic></th>
<th align="center" valign="top" colspan="4"><italic>Interaction effect</italic></th>
</tr>
<tr>
<th colspan="4"><hr/></th>
</tr>
<tr>
<th align="center"><italic>b</italic></th>
<th align="center"><italic>SE</italic></th>
<th align="center"><italic>t</italic></th>
<th align="center"><italic>p</italic></th>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;content, happy&#10217;</td>
<td align="right">2.18</td>
<td align="right">&#8211;0.35</td>
<td align="right">0.10</td>
<td align="right">&#8211;3.39</td>
<td align="left">.001**</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;fair, good&#10217;</td>
<td align="right">2.12</td>
<td align="right">&#8211;0.43</td>
<td align="right">0.08</td>
<td align="right">&#8211;5.14</td>
<td align="left">.000***</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;passable, good&#10217;</td>
<td align="right">2.12</td>
<td align="right">&#8211;0.30</td>
<td align="right">0.09</td>
<td align="right">&#8211;3.28</td>
<td align="left">.001**</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;ajar, open&#10217;</td>
<td align="right">1.18</td>
<td align="right">&#8211;0.19</td>
<td align="right">0.08</td>
<td align="right">&#8211;2.48</td>
<td align="left">.014*</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;chubby, fat&#10217;</td>
<td align="right">1.06</td>
<td align="right">&#8211;0.33</td>
<td align="right">0.09</td>
<td align="right">&#8211;3.85</td>
<td align="left">.000***</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;warm, hot&#10217;</td>
<td align="right">0.46</td>
<td align="right">&#8211;0.28</td>
<td align="right">0.09</td>
<td align="right">&#8211;3.14</td>
<td align="left">.002**</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;sleepy, asleep&#10217;</td>
<td align="right">&#8211;0.03</td>
<td align="right">&#8211;0.11</td>
<td align="right">0.10</td>
<td align="right">&#8211;1.20</td>
<td align="left">.235</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;ripe, overripe&#10217;</td>
<td align="right">&#8211;0.28</td>
<td align="right">0.19</td>
<td align="right">0.10</td>
<td align="right">1.96</td>
<td align="left">.054</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;cool, cold&#10217;</td>
<td align="right">&#8211;0.36</td>
<td align="right">&#8211;0.07</td>
<td align="right">0.10</td>
<td align="right">&#8211;0.73</td>
<td align="left">.465</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;mediocre, bad&#10217;</td>
<td align="right">&#8211;0.69</td>
<td align="right">&#8211;0.05</td>
<td align="right">0.09</td>
<td align="right">&#8211;0.57</td>
<td align="left">.567</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;youthful, young&#10217;</td>
<td align="right">&#8211;0.85</td>
<td align="right">&#8211;0.24</td>
<td align="right">0.10</td>
<td align="right">&#8211;2.33</td>
<td align="left">.025*</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;breezy, windy&#10217;</td>
<td align="right">&#8211;0.99</td>
<td align="right">&#8211;0.04</td>
<td align="right">0.08</td>
<td align="right">&#8211;0.51</td>
<td align="left">.613</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;drizzly, rainy&#10217;</td>
<td align="right">&#8211;1.27</td>
<td align="right">&#8211;0.10</td>
<td align="right">0.08</td>
<td align="right">&#8211;1.24</td>
<td align="left">.216</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;scarce, absent&#10217;</td>
<td align="right">&#8211;1.31</td>
<td align="right">0.08</td>
<td align="right">0.08</td>
<td align="right">0.94</td>
<td align="left">.349</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;low, empty&#10217;</td>
<td align="right">&#8211;1.57</td>
<td align="right">&#8211;0.13</td>
<td align="right">0.08</td>
<td align="right">&#8211;1.71</td>
<td align="left">.089</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
<tr>
<td align="left">&#10216;unlikely, impossible&#10217;</td>
<td align="right">&#8211;1.83</td>
<td align="right">0.05</td>
<td align="right">0.09</td>
<td align="right">0.56</td>
<td align="left">.577</td>
</tr>
<tr>
<td colspan="6"><hr/></td>
</tr>
</table>
</table-wrap>
<p>Taken together, these results confirm the polarity hypothesis: the presence or absence of a B&amp;N effect was modulated by the polarity of the scalar words. More specifically, all of the positive scalar words gave rise to a B&amp;N effect, while almost none of the negative scalar words did. The only exception to this rule was &#8216;youthful&#8217;, which was associated with a B&amp;N effect despite being assigned a (slightly) negative polarity value (&#8211;0.85).</p>
</sec>
</sec>
<sec>
<title>3 General discussion</title>
<sec>
<title>3.1 Summary</title>
<p>Pragmatic theories make conflicting predictions about the processing of scalar inferences in out-of-the-blue contexts. Relevance theory predicts that, in such contexts, the literal interpretation should be easier to retrieve than an interpretation that is enriched with a scalar inference. By contrast, Levinson (<xref ref-type="bibr" rid="B44">2000</xref>) predicts that it is the literal interpretation that should incur a processing cost, since it involves overturning the default enriched interpretation.</p>
<p>In a seminal study, Bott &amp; Noveck (<xref ref-type="bibr" rid="B3">2004</xref>) found that the computation of the scalar inference of &#8216;some&#8217; implying &#8216;not all&#8217; was associated with increased sentence verification times. This <italic>B&amp;N effect</italic> seems to provide strong evidence for the relevance-theoretic idea that the derivation of scalar inferences without a facilitating context is cognitively costly. However, more recent studies observed that the B&amp;N effect does not consistently generalise beyond the &#10216;some, all&#10217; scale, which begs the question whether the B&amp;N effect is really caused by the processing of the scalar inference (e.g., <xref ref-type="bibr" rid="B9">Chevallier et al. 2010</xref>; <xref ref-type="bibr" rid="B61">Romoli &amp; Schwarz 2015</xref>; <xref ref-type="bibr" rid="B75">van Tiel et al. 2019b</xref>). To explain these findings, van Tiel and colleagues (<xref ref-type="bibr" rid="B75">2019b</xref>) hypothesised that the presence or absence of a B&amp;N effect depends on the <italic>polarity</italic> of the scalar word.</p>
<p>The polarity-based explanation proceeds from the observation that verification times vary with the polarity of the sentence (e.g., <xref ref-type="bibr" rid="B14">Clark &amp; Chase 1972</xref>; <xref ref-type="bibr" rid="B6">Carpenter &amp; Just 1975</xref>). In particular, positive sentences are verified faster than sentences containing an implicitly negative word (e.g., &#8216;low&#8217;), and the latter are verified faster than sentences containing an explicit negation (e.g., &#8216;not all&#8217;). Correspondingly, given that all words on a scale have the same polarity, there are essentially three options: the words on a lexical scale may be positive, inherently negative, or explicitly negative. Hence, given that scalar inferences consist in the negation of the stronger scalar word, the polarity of the scalar inference is either negative (for positive scalar words) or doubly negative (for inherently or explicitly negative scalar expressions). Crucially, the polarity-based explanation argues that the latter are easier to verify than the former.</p>
<p>Various explanations for this potentially controversial assumption may be given. First, it could be that certain propositions containing a double negation are in fact easier to process than propositions with a single negation. Second, it could be that participants eliminate the double negation on the fly as they encounter it. Both of these explanations can be empirically tested by asking participants to verify the doubly negated scalar inferences (e.g., &#8216;The battery is not empty&#8217;) and comparing the verification times with those for the literal interpretation of the target sentence (i.e., &#8216;The battery is low&#8217;). If either of the foregoing explanations is on the right track, &#8216;false&#8217; responses to the negated scalar inference should be at least as fast as &#8216;true&#8217; responses to the target sentence.</p>
<p>However, concerning the former explanation, prior research has shown that, in many cases, verification times increase with the number of negations (<xref ref-type="bibr" rid="B67">Sherman 1976</xref>). Hence, from a psychological perspective, perhaps the most plausible explanation is the latter: the positive form of the scalar inference is directly associated with its triggering expressions. According to this explanation, from a cognitive standpoint, the derivation of scalar inferences does not involve nonce construction and negation of alternatives, but rather resembles a form of disambiguation (cf. also <xref ref-type="bibr" rid="B46">Marty &amp; Chemla 2013 for relevant comments about the parallelism between scalar inferencing and disambiguation</xref>).</p>
<p>If, finally, we assume that participants who arrive at a two-sided interpretation verify the literal interpretation and the scalar inference in parallel, the correct predictions follow straightforwardly: positive scalar words give rise to the B&amp;N effect, inherently negative scalar words do not, and explicitly negated scalar words lead to the reverse B&amp;N effect.</p>
<p>We extensively and systematically tested this polarity-based explanation by comparing the processing of 16 adjectival scalar inferences using a sentence-picture verification task. We estimated the polarity of the scalar words in our sample (and, hence, of the corresponding scalar inferences) on the basis of four diagnostics measuring their linguistic markedness and psychological valence. We found that the presence or absence of a B&amp;N effect was strongly dependent on the polarity of the lexical scale. Indeed, of the 7 lexical scales whose inferences led to increased verification times, 6 were estimated to be positive. The sole exception was &#10216;youthful, young&#10217;, which was associated with a B&amp;N effect despite being classified as (somewhat) negative.</p>
<p>One interesting observation for this particular scale is that the valence criterion &#8220;correctly&#8221; classified this scale as positive rather than negative (i.e., in accordance with the behaviour of &#8216;youthful&#8217; in the experiment, and in contrast to its estimated polarity). Hence, one may jump to the conclusion that the valence criterion generally offers a better measure of polarity than the other diagnostics. However, this does not hold true across the board. For example, &#8216;rainy&#8217; also had positive valence ratings relative to &#8216;dry&#8217;, but the &#10216;drizzly, rainy&#10217; scale was not associated with a processing cost.</p>
<p>Another scalar word that merits some further discussion is &#8216;unlikely&#8217;. &#8216;Unlikely&#8217; was the only scalar word in our sample that was explicitly marked for negativity by means of the negative prefix &#8216;un-&#8217;. Hence, one might expect to find a reverse B&amp;N effect for this particular scalar word, i.e., one might expect that it patterns with explicitly negative scalar constructions like &#8216;not all&#8217; rather than with implicitly negative scalar words like &#8216;low&#8217;. This prediction was not borne out. However, on closer inspection, this finding is not so surprising. To explain, consider the hierarchy of negation proposed by Fodor and colleagues (<xref ref-type="bibr" rid="B22">1975</xref>). According to Fodor and colleagues, negativity may have various sources. Ranging from &#8220;most negative&#8221; to &#8220;least negative&#8221;, these are as follows:</p>
<list list-type="roman-lower">
<list-item><p>Explicitly negative free morpheme (e.g., &#8216;not&#8217;).</p></list-item>
<list-item><p>Explicitly negative bound morpheme (e.g., &#8216;un-&#8217;).</p></list-item>
<list-item><p>Implicitly negative free morpheme (e.g., &#8216;low&#8217;).</p></list-item>
<list-item><p>Free morphemes that are defined in negative terms (e.g., &#8216;bachelor&#8217; meaning someone who is <italic>not</italic> married, or &#8216;kill&#8217; meaning causing someone to <italic>not</italic> be alive).</p></list-item>
</list>
<p>&#8216;Unlikely&#8217; is of class <italic>ii</italic>, whereas the other negative scalar words that we tested are all of class <italic>iii</italic>. Crucially, however, in terms of cognitive processing, Fodor and colleagues report that negative words of class <italic>ii</italic> pattern with implicitly negative words from class <italic>iii</italic>, rather than with the explicitly negative ones from class <italic>i</italic>.</p>
<p>Taken together, then, it is clear that there is a strong connection between polarity and the B&amp;N effect. Perhaps most forcefully, it seems difficult to explain why the scalar inference of &#8216;warm&#8217; but not &#8216;cool&#8217; was associated with a processing cost without appealing to the notion of polarity, especially given that the sentences and images used for these scalar words were so similar (the images showed transparent drinking glasses containing water at different temperatures, from a block of ice to vigorously boiling, cf. Appendix). Hence, we view our results as strong support for the polarity-based explanation.</p>
</sec>
<sec>
<title>3.2 Processing scalar inferences</title>
<p>Crucially, if the polarity-based explanation is correct, the classic observation that certain scalar inferences lead to increased verification times is not reflective of any processing cost for scalar inferencing, but rather reflects the psychological difficulty of verifying negative information. Indeed, the polarity-based explanation leads us to conclude that scalar inferencing itself is not associated with a processing cost, even in the absence of a facilitating context, contra, e.g., relevance theory. At the same time, however, our results also fail to support the defaultist idea that scalar inferences arise temporally prior to the literal interpretation; nowhere did we observe faster processing times when people computed the scalar inference.</p>
<p>Rather, it seems that scalar inferences are conventionally or statistically associated with their triggering expressions. That is, when hearing an utterance containing the weaker scalar word, people may immediately activate the corresponding scalar inference at no processing cost. However, when verifying this scalar inference, a processing cost may ensue if the scalar inference is negative, since negative information takes longer to be verified (e.g., <xref ref-type="bibr" rid="B14">Clark &amp; Chase 1972</xref>; <xref ref-type="bibr" rid="B6">Carpenter &amp; Just 1975</xref>).</p>
<p>One might suppose that this explanation is not very &#8220;Gricean&#8221; in spirit. Note, however, that Grice (<xref ref-type="bibr" rid="B31">1975</xref>) himself acknowledged the possibility that conversational implicatures are &#8220;intuitively grasped&#8221; (p. 50). What is crucial for Grice is whether or not this intuition is in principle &#8220;replaceable by an argument&#8221; (ibid.) that takes the literal meaning of the utterance and the assumption of cooperativity as its premises (see also <xref ref-type="bibr" rid="B26">Geurts &amp; Rubio-Fern&#225;ndez 2015</xref>). However, such arguments should not necessarily be construed as psychologically real, i.e., hearers presumably do not actually construct such an argument every time they encounter a scalar word. Rather, the Gricean calculations provide a rational grounding of the inferences that hearers are entitled to derive (<xref ref-type="bibr" rid="B40">Kissine 2016</xref>; <xref ref-type="bibr" rid="B25">Geurts 2019</xref>; <xref ref-type="bibr" rid="B16">D&#228;nzer 2020</xref>).</p>
<p>Our proposal makes some empirically testable predictions. In particular, it is predicted that other experimental paradigms that make use of sentence verification should also be susceptible to the polarity effect: if the proposition to be verified contains negative information, its verification should be cognitively costly. One such paradigm that relies on sentence verification was introduced by De Neys &amp; Schaeken (<xref ref-type="bibr" rid="B17">2007</xref>). Their study essentially mirrored Bott and Noveck&#8217;s (<xref ref-type="bibr" rid="B3">2004: Exp. 3</xref>) in that participants gave intuitive truth judgements to underinformative sentences like (14). Crucially, however, De Neys and Schaeken required participants to memorise dot patterns of varying complexity during the process of sentence verification.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(14)</td>
<td>Some dogs are mammals.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>De Neys and Schaeken found that participants were less likely to respond &#8216;false&#8217;, i.e., to derive the scalar inference, when they had to memorise complex dot patterns compared to simple ones (cf. also <xref ref-type="bibr" rid="B17">De Neys &amp; Schaeken 2007</xref>; <xref ref-type="bibr" rid="B20">Dieussaert et al. 2011</xref>; <xref ref-type="bibr" rid="B46">Marty &amp; Chemla 2013</xref>; <xref ref-type="bibr" rid="B76">van Tiel et al. 2019a</xref>; <xref ref-type="bibr" rid="B11">Cho 2020</xref>; <xref ref-type="bibr" rid="B47">Marty et al. 2020</xref>). We refer to this finding as the <italic>D&amp;S effect</italic>.</p>
<p>De Neys and Schaeken explain the D&amp;S effect based on the premise that the derivation of scalar inferences is associated with a processing cost. According to their explanation, participants who had to memorise complex dot patterns had fewer cognitive resources available to derive the scalar inference, and consequently were less likely to carry out the derivation process. However, if the polarity-based explanation is on the right track, the D&amp;S effect could also be modulated by the polarity of scalar words.</p>
<p>Interestingly, van Tiel and colleagues (<xref ref-type="bibr" rid="B75">2019b</xref>) also carried out a working memory load experiment for the same seven lexical scales that they tested in the sentence picture verification task that we discussed in the introduction. They found that, whereas all positive scalar words were associated with a D&amp;S effect, the two negative scalar words in their sample&#8212;i.e., &#8216;low&#8217; and &#8216;scarce&#8217;&#8212;were not. This observation suggests that the D&amp;S effect is also susceptible to polarity in the same way as the B&amp;N effect.</p>
<p>However, in a more recent study, Marty and colleagues (<xref ref-type="bibr" rid="B47">2020</xref>) provide provocative evidence against this conclusion by showing that the D&amp;S effect is attested for indirect scalar inferences. For example, they found that, in displays showing only green apples, participants were less likely to reject sentences such as (15) when they had to memorise complex patterns than when they had to memorise simple ones.</p>
<table-wrap>
<table content-type="example">
<tbody>
<tr>
<td>(15)</td>
<td>Not all of the apples are red.</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>To explain this pattern of results, Marty and colleagues (following <xref ref-type="bibr" rid="B46">Marty &amp; Chemla 2013</xref>) argue that a processing cost emerges when participants make the cognitively costly decision to go beyond the literal interpretation. However, in line with the polarity-based explanation, they also hold that the process of deriving the scalar inference (i.e., the construction and rejection of alternatives) proceeds without any processing cost.</p>
<p>These findings paint a complex picture that obviously calls for a more detailed discussion than we can offer here, but they clearly show that, for other measures, too, there has been debate about whether the locus of the alleged processing cost is in the process of scalar inferencing or in some other relevant cognitive process (e.g., <xref ref-type="bibr" rid="B46">Marty &amp; Chemla 2013</xref>; <xref ref-type="bibr" rid="B57">Politzer-Ahles &amp; Husband 2018</xref>; <xref ref-type="bibr" rid="B72">Sun &amp; Breheny 2019</xref>). This contribution fits into that line of work in showing that polarity is one of the factors influencing verification times in sentence verification tasks.</p>
<p>It is an open question whether polarity also influences other measures of processing cost, e.g., those involving reading times and eye movements. There is at least some evidence indicating that these measures, too, are influenced by the polarity of a sentence. For example, Glenberg et al. (<xref ref-type="bibr" rid="B28">1999</xref>) report longer reading times for negative sentences. Similarly, Tian et al. (<xref ref-type="bibr" rid="B73">2010</xref>) report that eye fixations to the correct image in a visual world paradigm are delayed for negative sentences compared to positive ones. However, both of these studies focus on sentences containing the explicit sentential negation &#8216;not&#8217;, rather than the implicitly negative words that we studied here. Hence, as it stands, it is unclear whether polarity has any explanatory role for experimental studies on the processing of scalar inferences using reading times and eye-tracking. Future work will have to determine whether polarity has pervasively influenced the experimental literature on scalar inference processing, or whether the effect is restricted to sentence verification as we have tacitly assumed throughout this paper.</p>
</sec>
<sec>
<title>3.3 Polarity</title>
<p>To estimate the polarity of the scalar words in our sample, we combined insights from linguistics and psychology. In linguistics, polarity is usually construed in terms of markedness; in psychology, in terms of subjective valence. We obtained measures of both construals, and used those to estimate a hypothesised latent construct of polarity. Here, we depart from (and hopefully improve on) prior research, particularly in linguistics.</p>
<p>Much linguistic research is premised on the idea that polarity is a binary notion, i.e., in any antonym pair, one is positive and the other one negative (e.g., <xref ref-type="bibr" rid="B63">Ruytenbeek et al. 2017</xref>; <xref ref-type="bibr" rid="B29">Gotzner et al. 2018</xref>). This approach regularly leads to an aporia. For example, Sassoon (<xref ref-type="bibr" rid="B64">2010</xref>) sought to determine polarity by looking at the frequency of antonym pairs in the &#8216;twice as [adjective]&#8217; frame. In line with her hypothesis, intuitively positive adjectives tended to be more frequent in such frames than negative ones. However, there were ample exceptions. Thus, Sassoon found that &#8216;twice as bad&#8217; was more frequent than &#8216;twice as good&#8217;, although she intuited that &#8216;good&#8217; is positive and &#8216;bad&#8217; is negative.</p>
<p>We observed many similar conflicts between diagnostics (cf. <bold><italic><xref ref-type="table" rid="T3">Table 3</xref></italic></bold>). For example, in Mohammad&#8217;s (<xref ref-type="bibr" rid="B48">2018</xref>) subjective valence study, &#8216;rainy&#8217; was rated as more positive than &#8216;dry&#8217;, suggesting that &#8216;rainy&#8217; is positive and &#8216;dry&#8217; negative. However, the other diagnostics suggested the opposite conclusion. As one of our reviewers suggested, the &#8220;aberrant&#8221; outcome in terms of subjective valence could be due to the fact that &#8216;dry&#8217; is highly polysemous, and that its subjective valence depends on which meaning is selected. For example, dry weather is generally considered positive, while dry bread is clearly negative. Hence, it could be that participants in Mohammad&#8217;s study tended to construe &#8216;dry&#8217; in the latter negative sense rather than the former.</p>
<p>One way of resolving such clashes between diagnostics is by incorporating the results of multiple diagnostics into the estimation of a gradient measure of polarity. We hope this approach finds a following in linguistic and psychological research on polarity.</p>
</sec>
<sec>
<title>3.4 Conclusion</title>
<p>Perhaps above all, our results emphasise the importance of testing broader samples of scalar words in research on scalar inferences&#8212;not just to determine whether psychological effects generalise across the entire family of scalar words, but also because it offers an insight into the structural linguistic constructs that underlie language processing. Thus, our study has shown the psychological relevance of the notion of polarity. We hope our research will inspire others to revisit the many interesting findings that have been reported on the scalar inference of &#8216;some&#8217; to see if they generalise and, if not, what factors may explain the observed scalar diversity, so that we may come to a better understanding of the cognitive processes that underlie the derivation of scalar inferences&#8212;and ultimately pragmatic inferences more generally.</p>
</sec>
</sec>
<sec>
<title>Data Accessibility Statement</title>
<p>All data and analysis files can found at <italic><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://osf.io/wxmeq/">https://osf.io/wxmeq/</ext-link></italic>.</p>
</sec>
<sec sec-type="supplementary-material">
<title>Additional File</title>
<p>The additional file for this article can be found as follows:</p>
<supplementary-material id="S1" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="https://doi.org/10.5334/gjgl.1457.s1">
<!--[<inline-supplementary-material xlink:title="local_file" xlink:href="gjgl-6-1457-s1.pdf">gjgl-6-1457-s1.pdf</inline-supplementary-material>]-->
<label>Appendix</label>
<caption>
<p>The Appendix shows the sentences and images used in the experiment. DOI: <italic><uri>https://doi.org/10.5334/gjgl.1457.s1</uri></italic></p>
</caption>
</supplementary-material>
</sec>
</body>
<back>
<fn-group>
<fn id="n1"><p>This excellent idea was suggested to us by one of our anonymous reviewers.</p></fn>
</fn-group>
<ack>
<title>Acknowledgements</title>
<p>This research was presented at the workshop on degree expressions and polarity effects (DegPol2020) that was held at the Leibniz-Zentrum f&#252;r Allgemeine Sprachwissenschaft. We thank the audience there for raising important questions and issues. We also thank Min-Joo Kim and our three anonymous reviewers at &#8216;Glossa&#8217; for extremely valuable feedback on an earlier version of this article.</p>
</ack>
<sec>
<title>Funding information</title>
<p>This research was funded by the German Research Council (grant DFG FR 3482/2-1, KR951/14-1, SA 925/17-1) within SPP 1727 (<italic><ext-link ext-link-type="uri" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://xprag.de/">Xprag.de</ext-link></italic>) and by the Dutch Science Organisation (Gravitation grant &#8216;Language in Interaction&#8217;, 024.001.006); both of which are gratefully acknowledged.</p>
</sec>
<sec>
<title>Competing Interests</title>
<p>The authors have no competing interests to declare.</p>
</sec>
<ref-list>
<ref id="B1"><label>1</label><mixed-citation publication-type="journal"><string-name><surname>Barbet</surname>, <given-names>C&#233;cile</given-names></string-name> &amp; <string-name><given-names>Guillaume</given-names> <surname>Thierry</surname></string-name>. <year>2018</year>. <article-title>When <italic>some</italic> triggers a scalar inference out of the blue. An electrophysical study of a Stroop-like conflict elicited by single words</article-title>. <source>Cognition</source> <volume>177</volume>. <fpage>58</fpage>&#8211;<lpage>68</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2018.03.013</pub-id></mixed-citation></ref>
<ref id="B2"><label>2</label><mixed-citation publication-type="journal"><string-name><surname>Benjafield</surname>, <given-names>J.</given-names></string-name> &amp; <string-name><given-names>J.</given-names> <surname>Adams-Webber</surname></string-name>. <year>1976</year>. <article-title>The golden section hypothesis</article-title>. <source>British Journal of Psychology</source> <volume>67</volume>. <fpage>11</fpage>&#8211;<lpage>15</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.2044-8295.1976.tb01492.x</pub-id></mixed-citation></ref>
<ref id="B3"><label>3</label><mixed-citation publication-type="journal"><string-name><surname>Bott</surname>, <given-names>Lewis</given-names></string-name> &amp; <string-name><given-names>Ira A.</given-names> <surname>Noveck</surname></string-name>. <year>2004</year>. <article-title>Some utterances are underinformative: The onset and time course of scalar inferences</article-title>. <source>Journal of Memory and Language</source> <volume>51</volume>. <fpage>437</fpage>&#8211;<lpage>457</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2004.05.006</pub-id></mixed-citation></ref>
<ref id="B4"><label>4</label><mixed-citation publication-type="journal"><string-name><surname>Boucher</surname>, <given-names>Jerry</given-names></string-name> &amp; <string-name><given-names>Charles E.</given-names> <surname>Osgood</surname></string-name>. <year>1969</year>. <article-title>The Pollyanna hypothesis</article-title>. <source>Journal of Verbal Learning and Verbal Behavior</source> <volume>8</volume>. <fpage>1</fpage>&#8211;<lpage>8</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0022-5371(69)80002-2</pub-id></mixed-citation></ref>
<ref id="B5"><label>5</label><mixed-citation publication-type="journal"><string-name><surname>Breheny</surname>, <given-names>Richard</given-names></string-name>, <string-name><given-names>Napoleon</given-names> <surname>Katsos</surname></string-name> &amp; <string-name><given-names>John</given-names> <surname>Williams</surname></string-name>. <year>2006</year>. <article-title>Are generalized scalar implicatures generated by default? An online investigation into the role of context in generating pragmatic inferences</article-title>. <source>Cognition</source> <volume>100</volume>. <fpage>434</fpage>&#8211;<lpage>463</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2005.07.003</pub-id></mixed-citation></ref>
<ref id="B6"><label>6</label><mixed-citation publication-type="journal"><string-name><surname>Carpenter</surname>, <given-names>Patricia A.</given-names></string-name> &amp; <string-name><given-names>Marcel A.</given-names> <surname>Just</surname></string-name>. <year>1975</year>. <article-title>Sentence comprehension: A psycholinguistic model of sentence verification</article-title>. <source>Psychological Review</source> <volume>82</volume>. <fpage>45</fpage>&#8211;<lpage>73</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/h0076248</pub-id></mixed-citation></ref>
<ref id="B7"><label>7</label><mixed-citation publication-type="journal"><string-name><surname>Chemla</surname>, <given-names>Emmanuel</given-names></string-name> &amp; <string-name><given-names>Lewis</given-names> <surname>Bott</surname></string-name>. <year>2014</year>. <article-title>Processing inferences at the semantics/pragmatics frontier: Disjunctions and free choice</article-title>. <source>Cognition</source> <volume>130</volume>. <fpage>380</fpage>&#8211;<lpage>396</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2013.11.013</pub-id></mixed-citation></ref>
<ref id="B8"><label>8</label><mixed-citation publication-type="journal"><string-name><surname>Cheng</surname>, <given-names>Chao-Ming</given-names></string-name> &amp; <string-name><given-names>Huei-Jane</given-names> <surname>Huang</surname></string-name>. <year>1980</year>. <article-title>The process of verifying affirmative and negative sentences against pictures</article-title>. <source>Memory &amp; Cognition</source> <volume>8</volume>. <fpage>573</fpage>&#8211;<lpage>583</lpage>. DOI: <pub-id pub-id-type="doi">10.3758/BF03213777</pub-id></mixed-citation></ref>
<ref id="B9"><label>9</label><mixed-citation publication-type="journal"><string-name><surname>Chevallier</surname>, <given-names>Coralie</given-names></string-name>, <string-name><given-names>Deirdre</given-names> <surname>Wilson</surname></string-name>, <string-name><given-names>Francesca</given-names> <surname>Happ&#233;</surname></string-name> &amp; <string-name><given-names>Ira</given-names> <surname>Noveck</surname></string-name>. <year>2010</year>. <article-title>Scalar inferences in autism spectrum disorders</article-title>. <source>Journal of Autism and Developmental Disorders</source> <volume>40</volume>. <fpage>1104</fpage>&#8211;<lpage>1117</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s10803-010-0960-8</pub-id></mixed-citation></ref>
<ref id="B10"><label>10</label><mixed-citation publication-type="journal"><string-name><surname>Chevallier</surname>, <given-names>Coralie</given-names></string-name>, <string-name><given-names>Ira A.</given-names> <surname>Noveck</surname></string-name>, <string-name><given-names>Tatjana</given-names> <surname>Nazir</surname></string-name>, <string-name><given-names>Lewis</given-names> <surname>Bott</surname></string-name>, <string-name><given-names>Valentina</given-names> <surname>Lanzetti</surname></string-name> &amp; <string-name><given-names>Dan</given-names> <surname>Sperber</surname></string-name>. <year>2008</year>. <article-title>Making disjunctions exclusive</article-title>. <source>Quarterly Journal of Experimental Psychology</source> <volume>61</volume>. <fpage>1741</fpage>&#8211;<lpage>1760</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470210701712960</pub-id></mixed-citation></ref>
<ref id="B11"><label>11</label><mixed-citation publication-type="journal"><string-name><surname>Cho</surname>, <given-names>Jacee</given-names></string-name>. <year>2020</year>. <article-title>Memory load effect in the real-time processing of scalar implicature</article-title>. <source>Journal of Psycholinguistic Research</source> <volume>49</volume>. <fpage>865</fpage>&#8211;<lpage>884</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s10936-020-09726-3</pub-id></mixed-citation></ref>
<ref id="B12"><label>12</label><mixed-citation publication-type="book"><string-name><surname>Clark</surname>, <given-names>Herbert H</given-names></string-name>. <year>1974</year>. <chapter-title>The chronometric study of meaning components</chapter-title>. In <string-name><given-names>Jacques</given-names> <surname>Mehler</surname></string-name> (ed.), <source>Problems actuels en psycholinguistique</source>, <fpage>490</fpage>&#8211;<lpage>505</lpage>. <publisher-loc>Paris, France</publisher-loc>: <publisher-name>Centre National de la Recherche Scientifique</publisher-name>.</mixed-citation></ref>
<ref id="B13"><label>13</label><mixed-citation publication-type="book"><string-name><surname>Clark</surname>, <given-names>Herbert H.</given-names></string-name> &amp; <string-name><given-names>Eve V.</given-names> <surname>Clark</surname></string-name>. <year>1977</year>. <source>Psychology and language: An introduction to psycholinguistics</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Harcourt</publisher-name>.</mixed-citation></ref>
<ref id="B14"><label>14</label><mixed-citation publication-type="journal"><string-name><surname>Clark</surname>, <given-names>Herbert H.</given-names></string-name> &amp; <string-name><given-names>William G.</given-names> <surname>Chase</surname></string-name>. <year>1972</year>. <article-title>On the process of comparing sentences against pictures</article-title>. <source>Cognitive Psychology</source> <volume>3</volume>. <fpage>472</fpage>&#8211;<lpage>517</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/0010-0285(72)90019-9</pub-id></mixed-citation></ref>
<ref id="B15"><label>15</label><mixed-citation publication-type="book"><string-name><surname>Cremers</surname>, <given-names>Alexandre</given-names></string-name> &amp; <string-name><given-names>Emmanuel</given-names> <surname>Chemla</surname></string-name>. <year>2014</year>. <chapter-title>Direct and indirect scalar implicatures share the same processing signature</chapter-title>. In <string-name><given-names>Salvatore Pistoia</given-names> <surname>Reda</surname></string-name> (ed.), <source>Pragmatics, semantics and the case of scalar implicatures</source>, <fpage>201</fpage>&#8211;<lpage>227</lpage>. <publisher-loc>London, United Kingdom</publisher-loc>: <publisher-name>Palgrave Macmillan</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1057/9781137333285_8</pub-id></mixed-citation></ref>
<ref id="B16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>D&#228;nzer</surname>, <given-names>Lars</given-names></string-name>. <year>2020</year>. <article-title>The explanatory project of Gricean pragmatics</article-title>. <source>Mind &amp; Language</source>. DOI: <pub-id pub-id-type="doi">10.1111/mila.12295</pub-id></mixed-citation></ref>
<ref id="B17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>De Neys</surname>, <given-names>Wim</given-names></string-name> &amp; <string-name><given-names>Walter</given-names> <surname>Schaeken</surname></string-name>. <year>2007</year>. <article-title>When people are more logical under cognitive load: Dual task impact on scalar implicature</article-title>. <source>Experimental Psychology</source> <volume>54</volume>. <fpage>128</fpage>&#8211;<lpage>133</lpage>. DOI: <pub-id pub-id-type="doi">10.1027/1618-3169.54.2.128</pub-id></mixed-citation></ref>
<ref id="B18"><label>18</label><mixed-citation publication-type="journal"><string-name><surname>Degen</surname>, <given-names>Judith</given-names></string-name> &amp; <string-name><given-names>Michael K.</given-names> <surname>Tanenhaus</surname></string-name>. <year>2016</year>. <article-title>Availability of alternatives and the processing of scalar implicatures: A visual world eye-tracking study</article-title>. <source>Cognitive Science</source> <volume>40</volume>. <fpage>172</fpage>&#8211;<lpage>201</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/cogs.12227</pub-id></mixed-citation></ref>
<ref id="B19"><label>19</label><mixed-citation publication-type="book"><string-name><surname>Degen</surname>, <given-names>Judith</given-names></string-name> &amp; <string-name><given-names>Michael K.</given-names> <surname>Tanenhaus</surname></string-name>. <year>2019</year>. <chapter-title>Constraint-based pragmatic processing</chapter-title>. In <string-name><given-names>C.</given-names> <surname>Cummins</surname></string-name> &amp; <string-name><given-names>N.</given-names> <surname>Katsos</surname></string-name> (eds.), <source>Handbook of experimental pragmatics</source>, <fpage>21</fpage>&#8211;<lpage>38</lpage>. <publisher-loc>Oxford, United Kingdom</publisher-loc>: <publisher-name>University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1093/oxfordhb/9780198791768.013.8</pub-id></mixed-citation></ref>
<ref id="B20"><label>20</label><mixed-citation publication-type="journal"><string-name><surname>Dieussaert</surname>, <given-names>Kristien</given-names></string-name>, <string-name><given-names>Suzanne</given-names> <surname>Verkerk</surname></string-name>, <string-name><given-names>Ellen</given-names> <surname>Gillard</surname></string-name> &amp; <string-name><given-names>Walter</given-names> <surname>Schaeken</surname></string-name>. <year>2011</year>. <article-title>Some effort for some: Further evidence that scalar implicatures are effortful</article-title>. <source>Quarterly Journal of Experimental Psychology</source> <volume>64</volume>. <fpage>2352</fpage>&#8211;<lpage>2367</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470218.2011.588799</pub-id></mixed-citation></ref>
<ref id="B21"><label>21</label><mixed-citation publication-type="journal"><string-name><surname>Fauconnier</surname>, <given-names>Gilles</given-names></string-name>. <year>1975</year>. <article-title>Pragmatic scales and logical structure</article-title>. <source>Linguistic Inquiry</source> <volume>6</volume>. <fpage>353</fpage>&#8211;<lpage>375</lpage>.</mixed-citation></ref>
<ref id="B22"><label>22</label><mixed-citation publication-type="journal"><string-name><surname>Fodor</surname>, <given-names>Janet D.</given-names></string-name>, <string-name><given-names>Jerry A.</given-names> <surname>Fodor</surname></string-name> &amp; <string-name><given-names>Merrill F.</given-names> <surname>Garrett</surname></string-name>. <year>1975</year>. <article-title>The psychological unreality of semantic representations</article-title>. <source>Linguistic Inquiry</source> <volume>6</volume>. <fpage>515</fpage>&#8211;<lpage>531</lpage>.</mixed-citation></ref>
<ref id="B23"><label>23</label><mixed-citation publication-type="book"><string-name><surname>Gazdar</surname>, <given-names>Gerald</given-names></string-name>. <year>1979</year>. <source>Pragmatics: implicature, presupposition, and logical form</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</mixed-citation></ref>
<ref id="B24"><label>24</label><mixed-citation publication-type="book"><string-name><surname>Geurts</surname>, <given-names>Bart</given-names></string-name>. <year>2010</year>. <source>Quantity implicatures</source>. <publisher-loc>Cambridge, United Kingdom</publisher-loc>: <publisher-name>University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1017/CBO9780511975158</pub-id></mixed-citation></ref>
<ref id="B25"><label>25</label><mixed-citation publication-type="journal"><string-name><surname>Geurts</surname>, <given-names>Bart</given-names></string-name>. <year>2019</year>. <article-title>Communication as commitment sharing: Speech acts, implicatures, common ground</article-title>. <source>Theoretical Linguistics</source> <volume>45</volume>. <fpage>1</fpage>&#8211;<lpage>30</lpage>. DOI: <pub-id pub-id-type="doi">10.1515/tl-2019-0001</pub-id></mixed-citation></ref>
<ref id="B26"><label>26</label><mixed-citation publication-type="journal"><string-name><surname>Geurts</surname>, <given-names>Bart</given-names></string-name> &amp; <string-name><given-names>Paula</given-names> <surname>Rubio-Fern&#225;ndez</surname></string-name>. <year>2015</year>. <article-title>Pragmatics and processing</article-title>. <source>Ratio</source> <volume>28</volume>. <fpage>446</fpage>&#8211;<lpage>469</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/rati.12113</pub-id></mixed-citation></ref>
<ref id="B27"><label>27</label><mixed-citation publication-type="book"><string-name><surname>Giv&#243;n</surname>, <given-names>Talmy</given-names></string-name>. <year>1979</year>. <source>On understanding grammar</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</mixed-citation></ref>
<ref id="B28"><label>28</label><mixed-citation publication-type="journal"><string-name><surname>Glenberg</surname>, <given-names>Arthur M.</given-names></string-name>, <string-name><given-names>David A.</given-names> <surname>Robertson</surname></string-name>, <string-name><given-names>Jennifer L.</given-names> <surname>Jansen</surname></string-name> &amp; <string-name><given-names>Mina C.</given-names> <surname>Johnson-Glenberg</surname></string-name>. <year>1999</year>. <article-title>Not propositions</article-title>. <source>Journal of Cognitive Systems Research</source> <volume>1</volume>. <fpage>19</fpage>&#8211;<lpage>33</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S1389-0417(99)00004-2</pub-id></mixed-citation></ref>
<ref id="B29"><label>29</label><mixed-citation publication-type="journal"><string-name><surname>Gotzner</surname>, <given-names>Nicole</given-names></string-name>, <string-name><given-names>Stephanie</given-names> <surname>Solt</surname></string-name> &amp; <string-name><given-names>Anton</given-names> <surname>Benz</surname></string-name>. <year>2018</year>. <article-title>Scalar diversity, negative strenghtening, and adjectival semantics</article-title>. <source>Frontiers in Psychology</source> <volume>9</volume>. <fpage>1659</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2018.01659</pub-id></mixed-citation></ref>
<ref id="B30"><label>30</label><mixed-citation publication-type="book"><string-name><surname>Greenberg</surname>, <given-names>Joseph H</given-names></string-name>. <year>1966</year>. <source>Language universals, with special reference to feature hierarchies</source>. <publisher-loc>The Hague, The Netherlands</publisher-loc>: <publisher-name>Mouton</publisher-name>.</mixed-citation></ref>
<ref id="B31"><label>31</label><mixed-citation publication-type="book"><string-name><surname>Grice</surname>, <given-names>H. P</given-names></string-name>. <year>1975</year>. <chapter-title>Logic and conversation</chapter-title>. In <string-name><given-names>Peter</given-names> <surname>Cole</surname></string-name> &amp; <string-name><given-names>Jerry L.</given-names> <surname>Morgan</surname></string-name> (eds.), <source>Syntax and semantics, volume 3: Speech acts</source>, <fpage>41</fpage>&#8211;<lpage>58</lpage>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</mixed-citation></ref>
<ref id="B32"><label>32</label><mixed-citation publication-type="journal"><string-name><surname>Grodner</surname>, <given-names>Daniel J.</given-names></string-name>, <string-name><given-names>Natalie M.</given-names> <surname>Klein</surname></string-name>, <string-name><given-names>Kathleen M.</given-names> <surname>Carbary</surname></string-name> &amp; <string-name><given-names>Michael K.</given-names> <surname>Tanenhaus</surname></string-name>. <year>2010</year>. <article-title>&#8220;Some,&#8221; and possibly all, scalar inferences are not delayed: Evidence for immediate pragmatic enrichment</article-title>. <source>Cognition</source> <volume>116</volume>. <fpage>42</fpage>&#8211;<lpage>55</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cognition.2010.03.014</pub-id></mixed-citation></ref>
<ref id="B33"><label>33</label><mixed-citation publication-type="book"><string-name><surname>Heim</surname>, <given-names>Irene</given-names></string-name>. <year>2008</year>. <chapter-title>Decomposing antonyms?</chapter-title> In <string-name><given-names>A.</given-names> <surname>Gr&#248;nn</surname></string-name> (ed.), <source>Proceedings of Sinn und Bedeutung 12</source>, <fpage>212</fpage>&#8211;<lpage>225</lpage>. <publisher-loc>Oslo, Norway</publisher-loc>: <publisher-name>ILOS</publisher-name>.</mixed-citation></ref>
<ref id="B34"><label>34</label><mixed-citation publication-type="thesis"><string-name><surname>Horn</surname>, <given-names>Laurence R</given-names></string-name>. <year>1972</year>. <source>On the semantic properties of logical operators in English</source>: <publisher-name>University of California, Los Angeles</publisher-name> dissertation.</mixed-citation></ref>
<ref id="B35"><label>35</label><mixed-citation publication-type="book"><string-name><surname>Horn</surname>, <given-names>Laurence R</given-names></string-name>. <year>1989</year>. <source>A natural history of negation</source>. <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University Press</publisher-name>.</mixed-citation></ref>
<ref id="B36"><label>36</label><mixed-citation publication-type="book"><string-name><surname>Huang</surname>, <given-names>Yan</given-names></string-name>. <year>2014</year>. <source>Pragmatics</source>. <publisher-loc>Oxford, United Kingdom</publisher-loc>: <publisher-name>University Press</publisher-name>.</mixed-citation></ref>
<ref id="B37"><label>37</label><mixed-citation publication-type="journal"><string-name><surname>Huang</surname>, <given-names>Yi Ting</given-names></string-name> &amp; <string-name><given-names>Jesse</given-names> <surname>Snedeker</surname></string-name>. <year>2018</year>. <article-title>Some inferences still take time: Prosody, predictability, and the speed of scalar implicatures</article-title>. <source>Cognitive Psychology</source> <volume>102</volume>. <fpage>105</fpage>&#8211;<lpage>126</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.cogpsych.2018.01.004</pub-id></mixed-citation></ref>
<ref id="B38"><label>38</label><mixed-citation publication-type="journal"><string-name><surname>Ingram</surname>, <given-names>Joanne</given-names></string-name>, <string-name><given-names>Christopher J.</given-names> <surname>Hand</surname></string-name> &amp; <string-name><given-names>Greg</given-names> <surname>Maciejewski</surname></string-name>. <year>2016</year>. <article-title>Exploring the measurement of markedness and its relationship with other linguistic variables</article-title>. <source>PLOS ONE</source> <volume>11</volume>. <elocation-id>e0157141</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1371/journal.pone.0157141</pub-id></mixed-citation></ref>
<ref id="B39"><label>39</label><mixed-citation publication-type="journal"><string-name><surname>Kennedy</surname>, <given-names>Chris</given-names></string-name> &amp; <string-name><given-names>Louise</given-names> <surname>McNally</surname></string-name>. <year>2005</year>. <article-title>Scale structure, degree modification, and the semantics of gradable predicates</article-title>. <source>Language</source> <volume>81</volume>. <fpage>345</fpage>&#8211;<lpage>381</lpage>. DOI: <pub-id pub-id-type="doi">10.1353/lan.2005.0071</pub-id></mixed-citation></ref>
<ref id="B40"><label>40</label><mixed-citation publication-type="journal"><string-name><surname>Kissine</surname>, <given-names>Mikhail</given-names></string-name>. <year>2016</year>. <article-title>Pragmatics as metacognitive control</article-title>. <source>Frontiers in Psychology</source> <volume>6</volume>. <fpage>2057</fpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2015.02057</pub-id></mixed-citation></ref>
<ref id="B41"><label>41</label><mixed-citation publication-type="journal"><string-name><surname>Kuznetsova</surname>, <given-names>Alexandra</given-names></string-name>, <string-name><given-names>Per Bruun</given-names> <surname>Brockhoff</surname></string-name> &amp; <string-name><given-names>Rune Haubo Bojensen</given-names> <surname>Christensen</surname></string-name>. <year>2013</year>. <article-title>lmerTest: Tests for random and fixed effects for linear mixed effect models (lmer objects of lme4 package) [R package]</article-title>.</mixed-citation></ref>
<ref id="B42"><label>42</label><mixed-citation publication-type="journal"><string-name><surname>Lehrer</surname>, <given-names>Adrienne</given-names></string-name>. <year>1985</year>. <article-title>Markedness and antonymy</article-title>. <source>Journal of Linguistics</source> <volume>21</volume>. <fpage>397</fpage>&#8211;<lpage>429</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S002222670001032X</pub-id></mixed-citation></ref>
<ref id="B43"><label>43</label><mixed-citation publication-type="journal"><string-name><surname>Lehrer</surname>, <given-names>Adrienne</given-names></string-name> &amp; <string-name><given-names>Keith</given-names> <surname>Lehrer</surname></string-name>. <year>1982</year>. <article-title>Antonymy</article-title>. <source>Linguistics and Philosophy</source> <volume>5</volume>. <fpage>483</fpage>&#8211;<lpage>501</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/BF00355584</pub-id></mixed-citation></ref>
<ref id="B44"><label>44</label><mixed-citation publication-type="book"><string-name><surname>Levinson</surname>, <given-names>Stephen</given-names></string-name>. <year>2000</year>. <source>Presumptive meanings: The theory of generalized conversational implicature</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.7551/mitpress/5526.001.0001</pub-id></mixed-citation></ref>
<ref id="B45"><label>45</label><mixed-citation publication-type="book"><string-name><surname>Lyons</surname>, <given-names>John</given-names></string-name>. <year>1968</year>. <source>Introduction to theoretical linguistics</source>. <publisher-loc>Cambridge, United Kingdom</publisher-loc>: <publisher-name>University Press</publisher-name>.</mixed-citation></ref>
<ref id="B46"><label>46</label><mixed-citation publication-type="journal"><string-name><surname>Marty</surname>, <given-names>Paul</given-names></string-name> &amp; <string-name><given-names>Emmanuel</given-names> <surname>Chemla</surname></string-name>. <year>2013</year>. <article-title>Scalar implicatures: Working memory and a comparison with only</article-title>. <source>Frontiers in Psychology</source> <volume>4</volume>. <fpage>1</fpage>&#8211;<lpage>12</lpage>. DOI: <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00403</pub-id></mixed-citation></ref>
<ref id="B47"><label>47</label><mixed-citation publication-type="confproc"><string-name><surname>Marty</surname>, <given-names>Paul</given-names></string-name>, <string-name><given-names>Jacopo</given-names> <surname>Romoli</surname></string-name>, <string-name><given-names>Yasutada</given-names> <surname>Sudo</surname></string-name>, <string-name><given-names>Bob</given-names> <surname>van Tiel</surname></string-name> &amp; <string-name><given-names>Richard</given-names> <surname>Breheny</surname></string-name>. <year>2020</year>. <article-title>Processing implicatures: A comparison between direct and indirect sis</article-title>. <conf-name>Oral presentation at Experiments in Linguistic Meaning (ELM)</conf-name>, <conf-loc>Philadelphia, PA</conf-loc>, <conf-date>September 16&#8211;18, 2020</conf-date>.</mixed-citation></ref>
<ref id="B48"><label>48</label><mixed-citation publication-type="confproc"><string-name><surname>Mohammad</surname>, <given-names>Saif M</given-names></string-name>. <year>2018</year>. <article-title>Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 english words</article-title>. In <conf-name>Proceedings of the annual conference of the association for computational linguistics (ACL)</conf-name>. <conf-loc>Melbourne, Australia</conf-loc>. DOI: <pub-id pub-id-type="doi">10.18653/v1/P18-1017</pub-id></mixed-citation></ref>
<ref id="B49"><label>49</label><mixed-citation publication-type="thesis"><string-name><surname>Moracchini</surname>, <given-names>Sophie</given-names></string-name>. <year>2019</year>. <source>Morphosyntax and semantics of degree constructions</source>: <publisher-name>Massachussetts Institute of Technology</publisher-name>, <publisher-loc>Boston, MA</publisher-loc> dissertation.</mixed-citation></ref>
<ref id="B50"><label>50</label><mixed-citation publication-type="book"><string-name><surname>Morzycki</surname>, <given-names>Marcin</given-names></string-name>. <year>2015</year>. <source>Modification</source>. <publisher-loc>Cambridge, United Kingdom</publisher-loc>: <publisher-name>University Press</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1017/CBO9780511842184</pub-id></mixed-citation></ref>
<ref id="B51"><label>51</label><mixed-citation publication-type="journal"><string-name><surname>Nieuwland</surname>, <given-names>Mante S.</given-names></string-name> &amp; <string-name><given-names>Gina R.</given-names> <surname>Kuperberg</surname></string-name>. <year>2008</year>. <article-title>When the truth is not too hard to handle</article-title>. <source>Psychological Science</source> <volume>19</volume>. <fpage>1213</fpage>&#8211;<lpage>1218</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/j.1467-9280.2008.02226.x</pub-id></mixed-citation></ref>
<ref id="B52"><label>52</label><mixed-citation publication-type="book"><string-name><surname>Nouwen</surname>, <given-names>Rick</given-names></string-name>. <year>2020</year>. <chapter-title>Evaluation, extent, and Goldilocks</chapter-title>. Unpublished manuscript, <publisher-name>Utrecht University</publisher-name>, <publisher-loc>The Netherlands</publisher-loc>.</mixed-citation></ref>
<ref id="B53"><label>53</label><mixed-citation publication-type="journal"><string-name><surname>Noveck</surname>, <given-names>Ira A.</given-names></string-name> &amp; <string-name><given-names>A.</given-names> <surname>Posada</surname></string-name>. <year>2003</year>. <article-title>Characterizing the time course of an implicature: An evoked potentials study</article-title>. <source>Brain and Language</source> <volume>85</volume>. <fpage>203</fpage>&#8211;<lpage>210</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0093-934X(03)00053-1</pub-id></mixed-citation></ref>
<ref id="B54"><label>54</label><mixed-citation publication-type="book"><string-name><surname>Noveck</surname>, <given-names>Ira A.</given-names></string-name> &amp; <string-name><given-names>Dan</given-names> <surname>Sperber</surname></string-name>. <year>2007</year>. <chapter-title>The why and how of experimental pragmatics: The case of &#8216;scalar inferences&#8217;</chapter-title>. In <string-name><given-names>N.</given-names> <surname>Burton-Roberts</surname></string-name> (ed.), <source>Advances in pragmatics</source>, <fpage>184</fpage>&#8211;<lpage>212</lpage>. <publisher-loc>Basingstoke, United Kingdom</publisher-loc>: <publisher-name>Palgrave</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1057/978-1-349-73908-0_10</pub-id></mixed-citation></ref>
<ref id="B55"><label>55</label><mixed-citation publication-type="journal"><string-name><surname>Osgood</surname>, <given-names>Charles</given-names></string-name> &amp; <string-name><given-names>Meredith Martin</given-names> <surname>Richards</surname></string-name>. <year>1973</year>. <article-title>From Yang and Yin to <italic>and</italic> or <italic>but</italic></article-title>. <source>Language</source> <volume>49</volume>. <fpage>380</fpage>&#8211;<lpage>412</lpage>. DOI: <pub-id pub-id-type="doi">10.2307/412460</pub-id></mixed-citation></ref>
<ref id="B56"><label>56</label><mixed-citation publication-type="journal"><string-name><surname>Paradis</surname>, <given-names>Carita</given-names></string-name>, <string-name><given-names>Joost</given-names> <surname>van de Weijer</surname></string-name>, <string-name><given-names>Caroline</given-names> <surname>Willners</surname></string-name> &amp; <string-name><given-names>Magnus</given-names> <surname>Lindgren</surname></string-name>. <year>2012</year>. <article-title>Evaluative polarity of antonyms</article-title>. <source>Lingue e Linguaggio</source> <volume>11</volume>. <fpage>199</fpage>&#8211;<lpage>214</lpage>.</mixed-citation></ref>
<ref id="B57"><label>57</label><mixed-citation publication-type="journal"><string-name><surname>Politzer-Ahles</surname>, <given-names>Stephen</given-names></string-name> &amp; <string-name><given-names>Matthew E.</given-names> <surname>Husband</surname></string-name>. <year>2018</year>. <article-title>Eye movement evidence for context-sensitive derivation of scalar inferences</article-title>. <source>Collabra</source> <volume>1</volume>. <fpage>1</fpage>&#8211;<lpage>13</lpage>. DOI: <pub-id pub-id-type="doi">10.1525/collabra.100</pub-id></mixed-citation></ref>
<ref id="B58"><label>58</label><mixed-citation publication-type="journal"><string-name><surname>Proctor</surname>, <given-names>Robert W.</given-names></string-name> &amp; <string-name><given-names>Yang Seok</given-names> <surname>Cho</surname></string-name>. <year>2006</year>. <article-title>Polarity correspondence: A general principle for performance of speeded binary classification tasks</article-title>. <source>Psychological Bulletin</source> <volume>132</volume>. <fpage>416</fpage>&#8211;<lpage>442</lpage>. DOI: <pub-id pub-id-type="doi">10.1037/0033-2909.132.3.416</pub-id></mixed-citation></ref>
<ref id="B59"><label>59</label><mixed-citation publication-type="journal"><string-name><surname>R&#233;canati</surname>, <given-names>Fran&#231;ois</given-names></string-name>. <year>1995</year>. <article-title>The alleged priority of literal interpretation</article-title>. <source>Cognitive Science</source> <volume>19</volume>. <fpage>207</fpage>&#8211;<lpage>232</lpage>. DOI: <pub-id pub-id-type="doi">10.1207/s15516709cog1902_2</pub-id></mixed-citation></ref>
<ref id="B60"><label>60</label><mixed-citation publication-type="book"><string-name><surname>Rett</surname>, <given-names>Jessica</given-names></string-name>. <year>2008</year>. <source>The semantics of evaluativity</source>. <publisher-loc>Oxford, United Kingdom</publisher-loc>: <publisher-name>University Press</publisher-name>.</mixed-citation></ref>
<ref id="B61"><label>61</label><mixed-citation publication-type="book"><string-name><surname>Romoli</surname>, <given-names>Jacopo</given-names></string-name> &amp; <string-name><given-names>Florian</given-names> <surname>Schwarz</surname></string-name>. <year>2015</year>. <chapter-title>An experimental comparison between presuppositions and indirect scalar implicatures</chapter-title>. In <string-name><given-names>F.</given-names> <surname>Schwarz</surname></string-name> (ed.), <source>Experimental perspectives on presuppositions</source>, <fpage>215</fpage>&#8211;<lpage>240</lpage>. <publisher-loc>Cham, Germany</publisher-loc>: <publisher-name>Springer</publisher-name>. DOI: <pub-id pub-id-type="doi">10.1007/978-3-319-07980-6_10</pub-id></mixed-citation></ref>
<ref id="B62"><label>62</label><mixed-citation publication-type="journal"><string-name><surname>Ronai</surname>, <given-names>Eszter</given-names></string-name> &amp; <string-name><given-names>Ming</given-names> <surname>Xiang</surname></string-name>. <year>2020</year>. <article-title>Pragmatic inferences are QUD sensitive: An experimental study</article-title>. <source>Journal of Linguistics</source>. DOI: <pub-id pub-id-type="doi">10.1017/S0022226720000389</pub-id></mixed-citation></ref>
<ref id="B63"><label>63</label><mixed-citation publication-type="journal"><string-name><surname>Ruytenbeek</surname>, <given-names>Nicolas</given-names></string-name>, <string-name><given-names>Steven</given-names> <surname>Verheyen</surname></string-name> &amp; <string-name><given-names>Benjamin</given-names> <surname>Spector</surname></string-name>. <year>2017</year>. <article-title>Asymmetric inference towards the antonym: Experiments into the polarity and morphology of negated adjectives</article-title>. <source>Glossa</source> <volume>2</volume>. <fpage>1</fpage>&#8211;<lpage>27</lpage>. DOI: <pub-id pub-id-type="doi">10.5334/gjgl.151</pub-id></mixed-citation></ref>
<ref id="B64"><label>64</label><mixed-citation publication-type="journal"><string-name><surname>Sassoon</surname>, <given-names>Galit W</given-names></string-name>. <year>2010</year>. <article-title>The degree functions of negative adjectives</article-title>. <source>Natural Language Semantics</source> <volume>18</volume>. <fpage>141</fpage>&#8211;<lpage>181</lpage>. DOI: <pub-id pub-id-type="doi">10.1007/s11050-009-9052-8</pub-id></mixed-citation></ref>
<ref id="B65"><label>65</label><mixed-citation publication-type="confproc"><string-name><surname>Sch&#228;fer</surname>, <given-names>Roland</given-names></string-name>. <year>2015</year>. <article-title>Processing and querying large web corpora with the COW14 architecture</article-title>. In <string-name><given-names>Piotr</given-names> <surname>Ba&#324;ski</surname></string-name>, <string-name><given-names>Hanno</given-names> <surname>Biber</surname></string-name>, <string-name><given-names>Evelyn</given-names> <surname>Breiteneder</surname></string-name>, <string-name><given-names>Marc</given-names> <surname>Kupietz</surname></string-name>, <string-name><given-names>Harald</given-names> <surname>L&#252;ngen</surname></string-name> &amp; <string-name><given-names>Andreas</given-names> <surname>Witt</surname></string-name> (eds.), <conf-name>Proceedings of challenges in the management of large corpora 3 (CMLC-3)</conf-name>, <fpage>28</fpage>&#8211;<lpage>34</lpage>. <conf-loc>Lancaster, United Kingdom</conf-loc>: <conf-sponsor>IDS</conf-sponsor>.</mixed-citation></ref>
<ref id="B66"><label>66</label><mixed-citation publication-type="confproc"><string-name><surname>Sch&#228;fer</surname>, <given-names>Roland</given-names></string-name> &amp; <string-name><given-names>Felix</given-names> <surname>Bildhauer</surname></string-name>. <year>2012</year>. <article-title>Building large corpora from the web using a new efficient tool chain</article-title>. In <string-name><given-names>Nicoletta</given-names> <surname>Calzolari</surname></string-name>, <string-name><given-names>Khalid</given-names> <surname>Choukri</surname></string-name>, <string-name><given-names>Thierry</given-names> <surname>Declerck</surname></string-name>, Mehmet U?ur Do?an, <string-name><given-names>Bente</given-names> <surname>Maegaard</surname></string-name>, <string-name><given-names>Joseph</given-names> <surname>Mariani</surname></string-name>, <string-name><given-names>Asuncion</given-names> <surname>Moreno</surname></string-name>, <string-name><given-names>Jan</given-names> <surname>Odijk</surname></string-name> &amp; <string-name><given-names>Stelios</given-names> <surname>Piperidis</surname></string-name> (eds.), <conf-name>Proceedings of the eighth international conference on language resources and evaluation (LREC&#8217;12)</conf-name>, <fpage>486</fpage>&#8211;<lpage>493</lpage>. <conf-loc>Istanbul, Turkey</conf-loc>: <conf-sponsor>ELRA</conf-sponsor>.</mixed-citation></ref>
<ref id="B67"><label>67</label><mixed-citation publication-type="journal"><string-name><surname>Sherman</surname>, <given-names>Mark A</given-names></string-name>. <year>1976</year>. <article-title>Adjectival negation and the comprehension of multiply negated sentences</article-title>. <source>Journal of Verbal Learning and Verbal Behavior</source> <volume>15</volume>. <fpage>143</fpage>&#8211;<lpage>157</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/0022-5371(76)90015-3</pub-id></mixed-citation></ref>
<ref id="B68"><label>68</label><mixed-citation publication-type="journal"><string-name><surname>Soames</surname>, <given-names>Scott</given-names></string-name>. <year>1982</year>. <article-title>How presuppositions are inherited: A solution to the projection problem</article-title>. <source>Linguistic Inquiry</source> <volume>13</volume>). <fpage>483</fpage>&#8211;<lpage>545</lpage>.</mixed-citation></ref>
<ref id="B69"><label>69</label><mixed-citation publication-type="journal"><string-name><surname>Solt</surname>, <given-names>Stephanie</given-names></string-name>. <year>2015</year>. <article-title>Measurement scales in natural language</article-title>. <source>Language and Linguistics Compass</source> <volume>9</volume>. <fpage>14</fpage>&#8211;<lpage>32</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/lnc3.12101</pub-id></mixed-citation></ref>
<ref id="B70"><label>70</label><mixed-citation publication-type="journal"><string-name><surname>Sperber</surname>, <given-names>Dan</given-names></string-name> &amp; <string-name><given-names>Deirdre</given-names> <surname>Wilson</surname></string-name>. <year>1987</year>. <article-title>Pr&#233;cis of relevance: communication and cognition</article-title>. <source>Behavioral and Brain Sciences</source> <volume>10</volume>. <fpage>697</fpage>&#8211;<lpage>754</lpage>. DOI: <pub-id pub-id-type="doi">10.1017/S0140525X00055345</pub-id></mixed-citation></ref>
<ref id="B71"><label>71</label><mixed-citation publication-type="book"><string-name><surname>Sperber</surname>, <given-names>Dan</given-names></string-name> &amp; <string-name><given-names>Deirdre</given-names> <surname>Wilson</surname></string-name>. <year>1995</year>. <source>Relevance: communication and cognition</source> (<edition>2nd</edition> edition). <publisher-loc>Oxford, United Kingdom</publisher-loc>: <publisher-name>Blackwell</publisher-name>.</mixed-citation></ref>
<ref id="B72"><label>72</label><mixed-citation publication-type="journal"><string-name><surname>Sun</surname>, <given-names>Chao</given-names></string-name> &amp; <string-name><given-names>Richard</given-names> <surname>Breheny</surname></string-name>. <year>2019</year>. <article-title>Another look at the online processing of scalar inferences: An investigation of conflicting findings from visual-world eye-tracking studies</article-title>. <source>Language, Cognition and Neuroscience</source>. DOI: <pub-id pub-id-type="doi">10.1080/23273798.2019.1678759</pub-id></mixed-citation></ref>
<ref id="B73"><label>73</label><mixed-citation publication-type="journal"><string-name><surname>Tian</surname>, <given-names>Ye</given-names></string-name>, <string-name><given-names>Richard</given-names> <surname>Breheny</surname></string-name> &amp; <string-name><given-names>Heather J.</given-names> <surname>Ferguson</surname></string-name>. <year>2010</year>. <article-title>Why we simulate negated information: A dynamic pragmatic account</article-title>. <source>Quarterly Journal of Experimental Psychology</source> <volume>63</volume>. <fpage>2305</fpage>&#8211;<lpage>2312</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470218.2010.525712</pub-id></mixed-citation></ref>
<ref id="B74"><label>74</label><mixed-citation publication-type="journal"><string-name><surname>Tomlinson</surname>, <given-names>John M.</given-names> <suffix>Jr.</suffix></string-name>, <string-name><given-names>Todd M.</given-names> <surname>Bailey</surname></string-name> &amp; <string-name><given-names>Lewis</given-names> <surname>Bott</surname></string-name>. <year>2013</year>. <article-title>Possibly all of that and then some: Scalar implicatures are understood in two steps</article-title>. <source>Journal of Memory and Language</source> <volume>69</volume>. <fpage>18</fpage>&#8211;<lpage>35</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2013.02.003</pub-id></mixed-citation></ref>
<ref id="B75"><label>75</label><mixed-citation publication-type="journal"><string-name><surname>van Tiel</surname>, <given-names>Bob</given-names></string-name>, <string-name><given-names>Elizabeth</given-names> <surname>Pankratz</surname></string-name> &amp; <string-name><given-names>Chao</given-names> <surname>Sun</surname></string-name>. <year>2019b</year>. <article-title>Scales and scalarity: Processing scalar inferences</article-title>. <source>Journal of Memory and Language</source> <volume>105</volume>. <fpage>427</fpage>&#8211;<lpage>441</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/j.jml.2018.12.002</pub-id></mixed-citation></ref>
<ref id="B76"><label>76</label><mixed-citation publication-type="book"><string-name><surname>van Tiel</surname>, <given-names>Bob</given-names></string-name>, <string-name><given-names>Elizabeth</given-names> <surname>Pankratz</surname></string-name>, <string-name><given-names>Paul</given-names> <surname>Marty</surname></string-name> &amp; <string-name><given-names>Chao</given-names> <surname>Sun</surname></string-name>. <year>2019a</year>. <chapter-title>Scalar inferences and cognitive load</chapter-title>. In <string-name><given-names>M.</given-names> <surname>Teresa Espinal</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Castroviejo</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Leonetti</surname></string-name>, <string-name><given-names>L.</given-names> <surname>McNally</surname></string-name> &amp; <string-name><given-names>C.</given-names> <surname>Real-Puigdollers</surname></string-name> (eds.), <source>Proceedings of Sinn und Bedeutung 23</source>, <fpage>429</fpage>&#8211;<lpage>443</lpage>. <publisher-loc>Bellaterra, Spain</publisher-loc>: <publisher-name>Universitat Aut&#242;noma de Barcelona</publisher-name>.</mixed-citation></ref>
<ref id="B77"><label>77</label><mixed-citation publication-type="journal"><string-name><surname>van Tiel</surname>, <given-names>Bob</given-names></string-name>, <string-name><given-names>Emiel</given-names> <surname>van Miltenburg</surname></string-name>, <string-name><given-names>Natalia</given-names> <surname>Zevakhina</surname></string-name> &amp; <string-name><given-names>Bart</given-names> <surname>Geurts</surname></string-name>. <year>2016</year>. <article-title>Scalar diversity</article-title>. <source>Journal of Semantics</source> <volume>33</volume>. <fpage>107</fpage>&#8211;<lpage>135</lpage>. DOI: <pub-id pub-id-type="doi">10.1093/jos/ffu017</pub-id></mixed-citation></ref>
<ref id="B78"><label>78</label><mixed-citation publication-type="journal"><string-name><surname>van Tiel</surname>, <given-names>Bob</given-names></string-name>, <string-name><given-names>Michael</given-names> <surname>Franke</surname></string-name> &amp; <string-name><given-names>Uli</given-names> <surname>Sauerland</surname></string-name>. <year>2021</year>. <article-title>Probabilistic pragmatics explains gradience and focality in natural language quantification</article-title>. <source>Proceedings of the National Academy of Sciences of the United State of America</source> <volume>118</volume>. <elocation-id>e200545311</elocation-id>. DOI: <pub-id pub-id-type="doi">10.1073/pnas.2005453118</pub-id></mixed-citation></ref>
<ref id="B79"><label>79</label><mixed-citation publication-type="journal"><string-name><surname>van Tiel</surname>, <given-names>Bob</given-names></string-name> &amp; <string-name><given-names>Walter</given-names> <surname>Schaeken</surname></string-name>. <year>2016</year>. <article-title>Processing conversational implicatures: Alternatives and counterfactual reasoning</article-title>. <source>Cognitive Science</source> <volume>41</volume>. <fpage>1</fpage>&#8211;<lpage>36</lpage>. DOI: <pub-id pub-id-type="doi">10.1111/cogs.12362</pub-id></mixed-citation></ref>
<ref id="B80"><label>80</label><mixed-citation publication-type="journal"><string-name><surname>Wason</surname>, <given-names>P. C</given-names></string-name>. <year>1959</year>. <article-title>The processing of positive and negative information</article-title>. <source>Quarterly Journal of Experimental Psychology</source> <volume>11</volume>. <fpage>92</fpage>&#8211;<lpage>107</lpage>. DOI: <pub-id pub-id-type="doi">10.1080/17470215908416296</pub-id></mixed-citation></ref>
<ref id="B81"><label>81</label><mixed-citation publication-type="journal"><string-name><surname>Wason</surname>, <given-names>P. C</given-names></string-name>. <year>1965</year>. <article-title>The contexts of plausible denial</article-title>. <source>Journal of Verbal Learning and Verbal Behavior</source> <volume>4</volume>. <fpage>7</fpage>&#8211;<lpage>11</lpage>. DOI: <pub-id pub-id-type="doi">10.1016/S0022-5371(65)80060-3</pub-id></mixed-citation></ref>
<ref id="B82"><label>82</label><mixed-citation publication-type="journal"><string-name><surname>Wason</surname>, <given-names>P. C</given-names></string-name>. <year>1972</year>. <article-title>In real life negatives are false</article-title>. <source>Logique et Analyse</source> <volume>15</volume>. <fpage>17</fpage>&#8211;<lpage>38</lpage>.</mixed-citation></ref>
<ref id="B83"><label>83</label><mixed-citation publication-type="thesis"><string-name><surname>Westera</surname>, <given-names>Matthijs</given-names></string-name>. <year>2017</year>. <source>Exhaustivity and intonation: A unified theory</source>: <publisher-name>University of Amsterdam</publisher-name>, <publisher-loc>The Netherlands</publisher-loc> dissertation.</mixed-citation></ref>
</ref-list>
</back>
</article>