Deconstructing information structure

The paper argues that a core part of what is traditionally referred to as ‘ information structure’ can be deconstructed into genuine morphosyntactic features that are visible to syntactic operations, contribute to discourse-related expressive meanings, and just happen to be spelled out prosodically in Standard American and British English. We motivate two features, [FoC] and [G], and we track the fate of those features at and beyond the syntax-semantics and the syntaxphonology interfaces. [FoC] and [G] are responsible for two distinct obligatory strategies for establishing discourse coherence. A [G]-marked constituent signals a match with a discourse referent, whereas a [FoC]-marked constituent invokes alternatives and thereby signals a contrast. In Standard American and British English [FoC] aims for highest prosodic prominence in the intonational phrase, whereas [G] lacks phrase-level prosodic properties. There is no grammatical marking of newness: The apparent prosodic effects of newness are the result of default prosody.


Introduction
Information structure, as the term is commonly used, covers concepts related to givenness, contrast, and topicality. Despite many years of research, there is no common ground on how to identify those concepts, what their place in grammar is, or whether there is any theoretical unity or value to them. Here we will set aside topicality and limit our discussion to givenness and contrast. 1 We will build a case supporting the view that Standard American and British English has two syntactically well-motivated features triggering distinct discourse requirements related to givenness and contrast. One feature, Givenness marking ([G]-marking, Féry & Samek-Lodovici 2006), signals that an individual, concept, or proposition has been mentioned before or is otherwise present in the context. The other feature, FoCus marking ([FoC]-marking), evokes alternatives to a mentioned individual, concept, or proposition, and thereby signals a contrast in the surrounding discourse. Mere newness remains unmarked.
Givenness and contrast have been discussed as categories of information structure at least since Chafe (1976). Rochemont (2016) has an in-depth discussion of givenness in the sense intended here and distinguishes it from kindred notions like presuppositionality, definiteness, repetition, and predictability. Following Rochemont, we capitalize Givenness whenever the targeted notion of givenness is the one responsible for the lack of prominence on content words under certain discourse conditions in Standard American and British English. In a similar vein, we use the spelling FoCus when the intended notion of focus is tied to the introduction of alternatives to signal a contrast. 2 Crucially, FoCus needs to be distinguished from information focus ('newness focus'), which applies to expressions that merely present new information. As will be made clear in section 8, the targeted notion of FoCus is a very general kind of contrastive focus that evokes alternatives to signal a contrast with a discourse referent. It's through its interaction with additional operators that FoCus can participate in a diverse range of more specialized contrastive interpretations, including exhaustive and mirative focus, or contrastive topic interpretations. Pretheoretically, we'll continue to use the standard, neutral, spellings for givenness and focus, and that includes occasions when we discuss examples or proposals from sources that do not necessarily assume the information structure notions we do.
Examples (1) to (2)  (2) Me: Sarah mailed the caramels. In your answer in (1), the VP have eaten the clementines is Given. The concept of having eaten the clementines has just been mentioned. Its Givenness is reflected by the lack of prosodic prominence on eaten and clementines in Standard American and British English. The context of your answer in (1) discourages (but doesn't exclude completely) an interpretation where Paula is a FoCus. On its most natural interpretation, you aren't contrasting Paula with other people who might have eaten the clementines. Paula is merely new, then. Anticipating arguments still to come, Paula isn't marked with any feature in (1). In (2), the VP mailed the caramels in your reply is Given, too, but the subject Eliza is now a FoCus, not merely new. FoCus on Eliza evokes alternatives to Eliza: other people who might have mailed the caramels. Since Sarah is one of them and has just been mentioned, your reply represents a contrast with what I said. In her reply in (2), Aunt repeats the verb mailed and places FoCus on both Ewan and the chocolates. As a result, the set of FoCus alternatives determined by her utterance contains the two propositions expressed by you and me in the earlier parts of this discourse. Aunt's contribution to the conversation in (2) thus represents a contrast with both my utterance and your reply. The phonological and semantic/pragmatic properties of [G]-marking and [FoC]-marking will be discussed in detail in sections 7 and 8, so we will not go beyond this introductory illustration for now. Features are the building blocks of natural language and are standard currency in phonology. Yet, unlike phonology, "syntax has no articulatory tract in which to ground its features" and, as a consequence, "the algorithms that build sentences are better researched than the atoms these algorithms operate over." 3 Features related to information structure notions are well-established in cartographic approaches to syntax since Rizzi (1997), and are also assumed to influence the mapping from syntactic to prosodic structure in Bocci (2013). Yet they don't appear in Corbett's (2012) list of canonical morphosyntactic features, for example. Nor are they standard currency in contemporary theories of information structure. For many scholars the existence of morphosyntactic features for givenness or focus is still very much in doubt. For example, Williams (1997: 610) concludes "that it would not be appropriate to supply syntax with a feature [+Focus] assigned to focused constituents, where that feature would have the obvious semantic, syntactic, and phonological interpretations." At the end of his paper on givenness, Schwarzschild notes that the F(ocus)-feature representations used throughout his paper "have no significant syntactic properties. From the point of view of the grammar overall, they are a nuisance and do not shed light on the real question of what semantic information is relevant to phonology and what parts of the phonology see this information. Ultimately, they should be done away with" (Schwarzschild 1999: 175). Zubizarreta & Vergnaud (2006: 561) level a similar objection against the F(ocus)-feature of Selkirk (1984;: "This feature is undesirable because it lacks independent justification. It is only needed to establish the relation between pitch accent and the informational structure of the sentence." Those authors all see a remedy in representations via metrical trees or metrically annotated syntactic trees, along with Ladd (1980;, Reinhart (2006), Szendrői (2001;2017a;2017b), Wagner (2005;, Calhoun (2010), Williams (2012), and Büring (2015b).
Metrical tree approaches to information structure can be traced back to Ladd (1980). On some implementations, the input for the computation calculating the discourse anaphoric impact of prosodic prominence in English are binary branching metrical trees, as in Liberman & Prince (1977), or metrically annotated syntactic trees, as in Zubizarreta (1998), building on Halle & Vergnaud (1987). Metrical trees or metrically annotated syntactic trees represent relative prominence relations between sister constituents. For English, it is claimed that the default is for a left-hand sister to be weak and for a righthand sister to be strong. Deviations from the default signal the impact of information structure. When a sister that should be weak by default is actually strong, we can infer that it is a focus, and when a sister that should be strong by default is actually weak, we know that it is given. On such an approach, the computation of discourse requirements attached to givenness and focus would track deviations from the default prominence pattern. 4 A separate representation of givenness or focus via features might seem superfluous.
Metrical tree approaches to givenness and focus imply that in English, for example, the computation of meanings has access not only to the actual metrical trees associated with an expression, but also to information about English default prosody. While this architecture for the computation of meanings is technically possible, as demonstrated in Büring (2015b), it flies in the face of current views on grammatical architecture that assume that the meanings of complex expressions are compositionally projected from the meanings of lexical items and features and the way they are syntactically arranged. The question is then whether the mere fact that languages like Standard American and British English happen to rely on prosody to convey meanings related to givenness and focus forces us to abandon established assumptions about the computation of meaning in grammar.

Syntactic evidence for features [FoC] and [G]
In this section, we present a typological argument against metrical tree or metrically annotated syntactic tree representations of notions related to givenness and focus, or more generally, against any representation of those notions that does not rely on features. We will review examples from the literature that establish that, crosslinguistically, givenness and contrast-related notions of focus can trigger movement, agreement, or ellipsis, hence are responsible for syntactic behavior that has been taken to be diagnostic of the presence of features. Moreover, crosslinguistically, those features are not necessarily realized pro-sodically. Featureless metrical trees or metrically annotated syntactic trees don't look like the right cure for the problems with F(ocus)-feature representations like those of Selkirk (1984;, then. A syntactically unmotivated feature representation would be replaced by a grammatical architecture that doesn't generalize beyond languages like English and its kin. We need to find a different cure. Cartographic studies since Rizzi (1997) have accumulated a rich body of evidence documenting that features related to information structure can drive movement to the left periphery. Aboh (2007a; documents that the Gbe language Gungbe (spoken in Benin) uses overt particles to mark topics and (a contrast-related type of) focus. The particles appear in left-peripheral positions and attract topical or focused constituents to the edge of their projections. 3(a) and (b) illustrate constructions with the focus particle wɛ. 5 (3) Gungbe (Aboh 2007a: 289) a. Sɛśínú wɛ̀dà Àsíàbá. Sessinou foc marry Asiaba 'sessinou married Asiaba.' b. Àsíàbá wɛ̀Sɛśínú dà. Asiaba foc Sessinou marry 'Sessinou married asiaba.' According to Aboh (personal communication), neither he nor other native speakers who have worked on those constructions perceive any prosodic difference between neutral and focused constituents, but he cautions that there hasn't yet been any systematic research on this. Fiedler & Jannedy's (2013) phonetic study of focus constructions in the related Gbe language Ewe concludes that there is no primary prosodic focus marking in Ewe. More specifically, they did not find evidence for any prosodic properties of ex situ (displaced) focus phrases that could not be attributed to general prosodic effects of phrasing and the lexical high tone of the focus marking particle.
Like Gungbe, Wolof, an Atlantic language spoken in Senegal and the Gambia, has a left-peripheral position that can serve as the landing site for (contrastively) focused constituents (Torrence 2013). 6 (4) Wolof (Torrence 2013: 182) a. Xale bi l-a-a gis. child the xpl-cop-1sg see 'It's the child that I saw.' b. Ca lekkool ba l-a-a gis-e Isaa. p school the xpl-cop-1sg see-appl Isaa 'It's at school that I saw Isaa.' c. Gaaw l-a-a ubbe-e bunt bi. quickly xpl-cop-1sg open-mann door the 'It's quickly that I opened the door.' Torrence (2013) analyzes 4(a) to (c) as cleft constructions that are the result of movement of the clefted constituent into the left periphery. Martinovič (2015) maintains that constructions like those illustrated in 4(a) to (c) are not clefts, but genuinely monocausal.
On either account, Wolof has movement into the left periphery that is triggered by focus. Most importantly for our argument, acoustic studies have shown that there are no prosodic reflexes of focus in Wolof. Rialland & Robert (2001) conducted acoustic analyses of several natural and elicited Wolof corpora and found that Wolof has no prosodic marking of focus: "The originality of Wolof is that it has no prosodic marking of focus, even optionally" (Rialland & Robert 2001: 937). There is essentially level pitch in all sentence types, except at the edges of intonational phrases, where tonal morphemes independent of information structure appear.
The fact that contrast-related notions of focus are not necessarily realized prosodically speaks against an architecture where their meaning contributions are represented by metrical trees. That the manifestation of those notions may trigger displacement into the left periphery in some languages looks like evidence for the presence of a syntactic feature. The general relevance of contrast, as opposed to mere newness, for movement into the left periphery is emphasized in Molnár & Winkler (2010), building on earlier work documenting the linguistic importance of contrast, including Vallduví & Vilkuna (1998) and Molnár (2006).
When relying on crosslinguistic facts as evidence for the existence of a feature like our [FoC], we need to contend with a concern voiced in Horvath (2010). Horvath points out that, in Hungarian, a merely contrastive interpretation is not sufficient for a constituent to appear in the left periphery. What's required is an exhaustive interpretation, excluding all but the mentioned alternative (Szabolcsi 1981). More generally, looking at constructions where focused constituents appear in the left periphery, we typically find subtly different interpretations. Apart from exhaustive interpretations, we may see contrastive topic or mirative interpretations, for example.
In current semantic work on association with focus, exhaustive, contrastive topic, or mirative interpretations are compositionally derived through the interaction of focused constituents with particular focus-sensitive operators, which might be silent, but could also be spelled out tonally or segmentally (Constant 2012;2014;Bianchi et al. 2016;Fox & Spector 2018;Bade & Sachs 2019). For example, contrastive topics in English trigger a requirement for a discourse antecedent corresponding to a complex, sorted, question (Büring 2003). Constant (2012;2014) showed that this discourse requirement can be computed compositionally from a syntactic configuration where a left-peripheral contrastive topic operator attracts a focused phrase from a sister constituent that has another focused phrase in it. On such an approach, a contrastive topic phrase and an exhaustive focus phrase, for example, both carry a focus feature. What distinguishes the two is their association with different focus attracting operators: a covert exhaustivity operator for exhaustive focus, and a covert contrastive topic operator for contrastive topics. In line with cartographic work (Rizzi 1997), those operators might very well occupy distinct left-peripheral positions, thus providing distinct landing sites. Assuming compositional accounts along those lines, our [FoC]-feature could be at the core of a whole range of subtly different interpretations, all involving FoCus in one way or other. In each case, an operator that evaluates FoCus alternatives attracts a [FoC]-marked phrase that provides those alternatives. It seems, then, that we are justified in taking the crosslinguistic facts featured in this section as evidence for the presence of a [FoC]-feature, even though the observed interpretations might be subtly different from each other. By way of illustration, we will spell out the technical details of how our [FoC]-feature associates with an exhaustivity operator in section 8. 7 A contrast-related notion of focus can also trigger overt agreement in some languages. In the 'focus concord' constructions of Sinhala, Pre-Modern Japanese, and the Ryukyuan languages (Aldridge 2018;Kishimoto 2018;Slade 2018;Whitman 1997), (contrastively) focused phrases are marked with a particle that covaries with special inflection on the predicate. (5) is an example from Sinhala, an Indo-Iranian language spoken in Sri Lanka. 8
Ranjit Chitra that book foc read-a that know-e 'It is that book that Ranjit knows that Chitra read.' In 5(a) and (b), the particle tamay marks ee potǝ ('that book') as a focus. The scope of the focus is indicated by the -e ending of the verb, which has to appear here instead of the default -a ending. In 5(a), the scope of the focus is just the embedded sentence. 5(a) conveys that Ranjit knows that what Chitra read was that book (and not anything else).
In 5(b), on the other hand, the scope of the focus is the whole sentence. 5(b) conveys that what Ranjit knows Chitra read is that book (and not anything else). 5(b), but not 5(a) should thus be compatible with a situation where, unbeknownst to Ranjit, Chitra also read a magazine. Hagstrom (1998; and Kishimoto (2018) suggest that there is feature agreement between the focus particle tamay and the scope site of focus marked by the e-form of the verb in examples like 5(a) or (b). Both authors argue (in different ways) that the nature of this relation forces the focus particle to move to its scope site overtly or covertly. 9 (6) would be the result of an overt instance of this movement.

(6)
Sinhala (Kishimoto 2018: 3) [Ranjit ee potǝ kieuw-a] tamay Ranjit that book read-a foc 'It was only that Ranjit read that book.' In (6), tamay appears clause finally and with the a-form of the verb. In this position, tamay doesn't delimit the focused constituent, as in 5(a) and (b), but marks the scope site of the focus. (6) has several interpretations depending on which part of the scope of tamay is understood as focused. It may convey that it was Ranjit who read that book, that it was that book that Ranjit read, that Ranjit did read that book, and so on.
A feature agreement relation between particles that mark (contrastively) focused constituents and inflection on a nearby predicate has also been posited for the kakari-musubi construction found in Premodern Japanese and in the Ryukyuan languages (Whitman 1997;Aldridge 2018). 10 the same model. Constant (2012;2014) and Bianchi et al. (2016) provide the technical details of how to derive contrastive topic and mirative interpretations along the lines we have outlined above. 8 Glosses: a for -a inflection, e for -e inflection, foc for focus. 9 For Hagstrom, the -e form of the verb has an uninterpretable focus feature that needs to be checked. For Kishimoto, the movement of the focus particle is criterial in the sense of Rizzi (1997): the particle moves into a dedicated focus position in the CP layer of the sentence. 10 Glosses: neg for negation, iz for izen 'realis' inflection, top for topic, mod for modal, rt for rentai 'adnominal' inflection.

(7)
Old Japanese (Aldridge 2018: 7) Pito = koso sira-ne matu = pa siru ramu. Person = koso know-neg.iz pine = top know mod.rt 'Though people do not understand, the pine may know.' In (7), the contrastive focus particle koso triggers the izen 'realis' inflection on the predicate, which would not be used here in the absence of koso. This dependence between a focus particle and inflection on the predicate has been analyzed as feature agreement by several researchers, including Ikawa (1998), Kuroda (2007), and Aldridge (2018).
Taken together, the data we have reviewed so far warrant the conclusion that there are languages that represent contrast-related notions of focus with features that are visible to syntactic operations like displacement and agreement. Most importantly for our argument here, among those languages are some where contrast-related notions of focus are not realized prosodically. This precludes grammatical architectures that directly link discourse effects related to givenness and focus to representations of prosody or to metrically interpreted syntactic trees.
Turning to potential syntactic effects of givenness, Šimík and Wierzba (2015) argue (against Kučerová 2012) that Givenness (in the sense intended here), rather than presuppositionality, is reflected in Czech word order variation. According to Šimík and Wierzba, Czech Given phrases avoid stress, but, unlike English Given phrases, they move to left-peripheral positions to escape the canonical, rightmost, sentence stress position in Czech. Šimík and Wierzba's work establishes a bridge to the work of Reinhart (2006) and Szendrői (2001;2017a;2017b), pointing to the possibility that the [G]-feature, and possibly also the [FoC]-feature in some languages, might drive movement that aims at creating an output that conforms to general prosodic constraints for a language.
Givenness also seems to play a crucial role in VP-Preposing in English, as analyzed in Samko (2016), 11 and in licensing ellipsis, as observed in Tancredi (1992), Rooth (1992b), and Winkler (2016). A syntactic feature related to Givenness, e-Givenness, was posited in Merchant (2001) to license ellipsis under certain conditions, some of which are syntactic, and some are semantic in nature. The e-Givenness feature requires a strong version of Givenness for its sister constituent, licenses non-pronunciation of that constituent at the syntax-phonology interface, and has a distribution that depends on syntactic properties of the phrase structure trees it occurs in. Merchant's e-Givenness feature thus has the kind of properties that we expect [G] and [FoC] to have,too. 12 This section has presented crosslinguistic evidence for the existence of syntactically well-motivated features related to Givenness and FoCus. Our next move is now to bring out the disconnect between syntactically well-motivated morphosyntactic features like [FoC] and [G] and the widely assumed more general F(ocus)-marking systems originating with Selkirk (1984; and standardly assumed in semantic work. The following section will present the essentials of the F(ocus)-marking systems of Rooth (1992a) and Schwarzschild (1999), and thereby steer the discussion towards a theoretical dilemma: a feature system that has rightly been accused of lacking independent morphosyntactic motivation comes with what looks like unrivalled theoretical elegance in delivering a unified account of the discourse anaphoric effects of givenness and focus. Once the dilemma is identified, we can begin to look for ways to overcome it.

Just F(ocus)-marking?
In this section, we'll present the leading ideas common to Rooth and Schwarzschild in a unified system. We will use Rooth's Alternatives Semantics (Rooth 1992a) to state Schwarzschild's (1999) discourse anaphoric requirements for Givenness and FoCus, following the presentation in Rooth (2015). 13 A fundamental assumption of Rooth (1985;1992a) and Schwarzschild (1999), which is inherited from Selkirk (1984;, is that there is a single [F]-feature that uniformly marks constituents that are FoCused (in our sense) and those that are merely new (hence not Given in our sense). Given constituents remain unmarked. This is illustrated in (8) and (9) (10) that most vividly bring out the disconnect with bona fide morphosyntactic features that could also play a role in syntactic operations like displacement or agreement. Yet Rooth and Schwarzschild have proposed accounts that are capable of computing the apparently distinct discourse requirements imposed by FoCus and Givenness from representations that only have [F]-marking. By way of illustration, look again at your answer in (9), repeated here as (11): (11) [Eliza] F mailed the caramels.
(11) is not acceptable as an out-of-the-blue utterance. The VP mailed the caramels needs to be Given in our sense, and the sentence as a whole might also express a contrast, possibly with something that was said earlier. Schwarzschild (1999) proposes a unified characterization of those two discourse anaphoric relations in terms of a generalized notion of givenness that subsumes both our Givenness and FoCus. Schwarzschild requires that any constituent that is not [F]-marked be given in this generalized sense. In (11), neither the VP or its parts, nor the sentence as a whole are [F]-marked, hence those constituents all need to come out as given on his approach. The Alternatives Semantics of Rooth provides a convenient characterization of Schwarzschild's generalized notion of givenness, referred to as A-Givenness from now on. 14 We can say that a constituent α is A-Given (in a context) just in case there is a salient discourse referent (an individual, concept, or proposition) that is a member of the alternatives set associated with α. In Alternatives Semantics, every expression is assigned two semantic values: its O(rdinary)-value, and its A(lternatives)-value, which is its alternatives set. For example, the O-value of (11) is just the proposition that Eliza mailed the caramels. Its A-value is the set of propositions in (12).
Since the proposition that Sarah mailed the caramels is in the alternatives set (12) for (11) and, in the context of (9), has just been mentioned, (11) as a whole satisfies Schwarzschild's A-Givenness requirement in that context. To compute the alternatives set for (11) compositionally, we combine the A-values of its immediate constituents, the [F]-marked subject [Eliza] F and the VP mailed the caramels. The A-value of [Eliza] F is the set of all individuals − Eliza, Sarah, Leif, and anybody else in our domain of discourse. What about the A-value of the VP mailed the caramels, which contains no [F]-marks? In Rooth's Alternatives Semantics, that VP's A-value is a singleton set, the set containing the VP's O-value as its only member. That's the singleton set containing the property of having mailed the caramels. The A-value of (11) as a whole is computed by pointwise combination of the A-values of [Eliza] F and the VP mailed the caramels: {Eliza, Sarah, Leif, …} × {'mailed the caramels'}. The result is the alternatives set (12).
Our Givenness falls out as a special case of A-Givenness. Since the VP mailed the caramels in (11) has a singleton alternatives set, it is A-Given just in case its only member, the property of having mailed the caramels, is salient in the discourse context, hence is Given in our sense. That, too, is the case in the context of (9).
Rooth's and Schwarzschild's systems provide unified accounts of the discourse requirements triggered by FoCus and Givenness. It looks like there really aren't TWO such discourse requirements. Givenness and FoCus come out as two sides of the same coin. Rooth's and Schwarzschild's systems only require a single focus-related feature: [F]-marking. Neither FoCus nor Givenness need to be represented separately. The case for a unified account of Givenness and FoCus in terms of [F]-marking is strong indeed. Given the theoretical elegance and unifying power of [F]-marking systems, should we still worry about the disconnect with syntactically motivated features?
We should. The following section will collect together empirical problems for uniform [F]-marking approaches. We will present data telling us in no uncertain terms that grammar makes a distinction between FoCus and discourse newness. The two notions shouldn't be represented by a single feature.

Newness and FoCus can't be lumped together
Since at least Chafe (1976), syntacticians, phonologists, and phoneticians, have documented differences between different types of focus in a number of languages. Many of those authors have pointed to differences in the way languages mark constituents that are merely discourse new (newness focus, information focus), as opposed to constituents that (i) Every [cat] F is a complainer.
For Schwarzschild, (i) as a whole is given just in case (ii) is entailed by prior context:  (14) is true just in case the mentioned alternative 'He would offer that Modigliani to MoMA' is the only alternative in (15) that is true. The scenario described in (13) also excludes the possibility that Modigliani in our target sentence might be a contrastive topic scoping over only. In the context of (13), (14) can't be understood as contrasting the mentioned Modigliani painting, which Gary would only offer to MoMA, with other paintings of his that he might also offer to the Metropolitan or the Guggenheim. Such an interpretation would go against what we are being told in the story, namely that Gary doesn't do business with the Metropolitan or the Guggenheim.
Finally, interpreting the merely new direct object outside the scope of only wouldn't be an option in (16) (still understood as a continuation of (13)), where the object contains a negative polarity item that is licensed by only, hence is trapped within the scope of only.
In it-cleft sentences the clefted constituent is a FoCus. In (17), Jane's phone is contrasted with her keys. In (17), it so happens that the material in the cleft clause is Given and lacks pitch accent(s). But as Prince (1978) observed, and Hedberg (1990; discussed, the material in the cleft clause may also be new, hence accented. (18)  The dilemma for an [F]-marking-only approach is that the mechanism comput-ing the set of FoCus alternatives wouldn't want to have any [F]-marks in the cleft clause, while the mechanism computing the prosody would need them. On an [F]-marking-only approach we would again have no syntactic representation from which to compute the right alternatives set on the one hand, and the right prosody on the other. Our last example in this section illustrates an alternation in answers to constituent questions that has been reported for several languages, including Italian (É. Kiss 1998;Belletti 2001), Hungarian (É. Kiss 1998), Spanish (Zubizarreta 1998), Finnish (Molnár 2001), Gungbe (Aboh 2007a;, and Kwa languages more generally (Ameka 2010). For illustration, we will look at an Italian example. In Italian, a simple constituent question like 21(a) can be answered as in 21(b) or 21(c).

(21)
Italian a. Chi ha scritto questo articolo? who has written this article 'Who wrote this article?' b. L' ha scritto Gennaro.
it has written Gennaro 'Gennaro wrote it.' c. Gennaro l'ha scritto. Gennaro it has written. 'Gennaro wrote it.' As discussed by É. Kiss and Belletti, a question like 21(a) can be answered with a postverbal subject, as in 21(b), or a preverbal subject, as in 21 (c). As an answer to 21(a), 21(c) necessarily expresses a contrast. Gennaro must be a FoCus, it can't be merely new.
In 21(b), Gennaro could also be a FoCus, but doesn't have to be. 19 It could also be merely new. Placing a FoCus on Gennaro would be natural in contexts where the possibility that others might have written this article is already in the air or could be evoked with such a FoCus. In the latter case, the information that Gennaro is the author of the article would be framed against the background of the possibility that others, one of the 'non-Gennaros', whoever they may be, might have written the article. We could think of this as an accommodated FoCus: it doesn't require an explicit discourse antecedent.
To bring out intuitions about preverbal vs. postverbal subjects in Italian more clearly, consider 22(a) and (b) below, still understood as answers to the question in 21(a). We made the answers longer, adding new information that was not explicitly asked for. This change seems to make it harder (not completely impossible) to accommodate a contrastive interpretation for the subject Gennaro. 22(a) and (b) are not naturally understood as contrasting Gennaro with others who might have written this article while being in Graduate School. As a result, there is pressure for Gennaro to appear postverbally. 22(b) is judged infelicitous as an answer to 21(a). 20 Italian a. Credo che l'abbia scritto Gennaro quando era in Graduate I think that it has-SUBJ written Gennaro when was in Graduate School. School. 'I think Gennaro wrote it when he was in Graduate School.' b. #Credo che Gennaro l'abbia scritto quando era in Graduate I think that Gennaro it has-SUBJ written when was in Graduate School. School. 'I think Gennaro wrote it when he was in Graduate School.' É. Kiss and Belletti argue, following Rizzi (1997), that the preverbal subjects in sentences like 21(c) and 22(b) occupy a left-peripheral position reached via movement. The postverbal position of the subject is either its original position (É. Kiss) or a low position in the verb's functional projection (Belletti).
What is it that prevents Gennaro in (22) from moving to a left-peripheral position? Is it because Gennaro is not contrastive, hence not a FoCus? Or could it be that it is FoCused, but movement of Gennaro to the left periphery would imply association with an operator that would result in an implausible interpretation? In answers to constituent questions, any silent left-peripheral operator would most plausibly be an exhaustivity operator. But we constructed the example so that the answers in (22) are understood to be exhaustive: we infer that Gennaro and only Gennaro wrote that article. If there was a silent leftperipheral exhaustivity operator that a FoCused instance of Gennaro could associate with, then, there should be no obstacle for Gennaro to appear preverbally, contrary to fact. We conclude that what prevents Gennaro from appearing preverbally in (22) is that it's not naturally understood as FoCused. Examples like those in (22) thus show that the syntax of Italian makes a distinction between constituents that are FoCused and those that are merely new.
Summarizing the results of this section, we conclude that an [F]-feature-only account isn't able to connect meaning, syntax, and prosody in the right way. The grammars of natural language distinguish FoCused phrases from those that are merely new. The two notions can't be lumped together into a single feature. Accepting this conclusion, it raises an important question about the status of newness. If newness and FoCus can't be lumped together into a single feature, is newness represented by a separate morphosyntactic feature or does it remain unmarked?

No feature for newness
The last section concluded that FoCus and newness cannot be lumped together into a single feature. The question before us in this section is the theoretical status of newness. Does the universal repertoire of morphosyntactic features include a separate feature marking new information? The available crosslinguistic data do not allow us to answer this question conclusively. The discussion of answers to questions in Italian in the last section brought out the empirical difficulty. We saw that Italian makes a difference between subjects that are merely new and those that are contrastive, but to bring out that difference in a convincing way, we couldn't use a simple question-answer test. We had to make some effort to construct an example that discouraged accommodation of a contrastive interpretation for the new part of the answer. Crosslinguistic research happens to mostly use answers to questions as tests for focus. Answers to constituent questions are still "the most widespread and most widely used test for focus" (van der Wal 2016: 265). But, as we have seen, the new part of an answer to a constituent question might be contrastive or merely new, hence a simple question-answer test won't distinguish the two. As a consequence, the available crosslinguistic record on whether there are languages that mark mere newness is inconclusive. 21 Given the lack of crosslinguistic evidence, we will turn our attention to English for the remainder of this article. We will argue that the grammar of Standard American and British English marks Givenness and FoCus, but is blind to newness. Material that is merely new ('information focus') remains unmarked in English. From a theoretical perspective, there is no such thing as 'information focus' in English.
The consequences of the assumption that newness might be unmarked in a language are momentous and haven't been fully appreciated in the literature. We have seen that the part of an answer that corresponds to the wh-part of a constituent question can be merely new. But if mere newness is unmarked and thus doesn't correspond to any grammatically relevant kind of focus, we can no longer say things like the "classical pragmatic use of focus is to highlight the part of an answer that corresponds to the wh-part of a constituent question" (Krifka 2008: 250), nor that in "an answer to a constituent question, the element corresponding to the <wh>-phrase must be a focus" (Büring 2016: 12). Büring, Krifka, and much of the traditional and contemporary literature on focus take 'answer focus' to be an unquestioned subtype of focus. The rationale for classifying 'answer focus' as a type of focus along with contrastive focus (our FoCus) is the assessment that these foci "behave identically in all respects" (Büring 2016: 23). The data we saw in the previous section go against this assessment.
There are even more far-reaching consequences of positing a category like 'answer focus'. If just about anything we say can be understood as being part of an answer to an implicit question under discussion, as Roberts (1996; has argued, then just about anything new we say should be part of an 'answer focus', which should then be marked for focus. But then we would again uniformly mark FoCus and newness, which we should not, as the previous section has shown. Finally, if there is evidence that newness is unmarked in the grammar of English, the problematic data discussed in section 4 could not be accounted for by simply adding a [G]-feature to an [F]-marking-only system like that presented in section 3. Nor could we have a two-feature-system like that of Beaver & Velleman (2011), where a separate [N]-feature marks new (or, more accurately, unpredictable) material.
Our argument for the unmarked status of newness ('information focus') in Standard American and British English comes from the syntax-phonology interface. The difference between marking vs. not marking newness is brought out most dramatically in out-ofthe-blue utterances like (23), where everything is new information. Accounts where newness is marked would produce representations that look minimally as in 23(a) (with an [N]-feature used for illustration), whereas accounts where newness is unmarked posit representations like 23(b).
[N]-marking every content word in 23(a) is necessary since newness of a constituent in no way implies that its parts are new as well. Representations with merely broad [N]marking like (24) for out-of-the-blue utterances would thus be inadequate.
appear in situ or ex situ in Hausa, and they conclude from this that "information focus" can be realized in both of those positions. Since they didn't' control for accommodated contrasts in the answers, we can't quite draw that conclusion, though. Interestingly, in a corpus study, Hartmann and Zimmermann found that almost 80% of all cases they identified as "information focus" were realized in situ, while more than 90% of all cases they identified as "selective, contrastive, or corrective focus" were realized ex situ.
The difference between 23(a) and (b) is critical for figuring out the architecture of the syntax-phonology interface. With 23(a), the prosody of the sentence would have to be read off a representation that has several instances of [N]-marks. With 23(b), on the other hand, the right prosody would have to be determined on the basis of a representation that does not contain any information structure features at all. This means that if 23(b) is right, there has to be a default prosody for English, all new, out-of-the-blue utterances whose principles are independent of any impact of information structure. To defend representations like 23(b) against 23(a), then, we need to make a case that there is such a default prosody and show how it can be derived within an otherwise plausible general theory of prosodic phonology. If the demonstration succeeds, we would have to reject 23(a) on conceptual grounds: All [N]-marks in 23(a) would be entirely superfluous as far as prosody is concerned. Accounts presupposing the existence of a default prosody for English that is independent of information structure have been proposed since the earliest studies of prosody within Generative Grammar (Chomsky & Halle 1968;Bresnan 1971;Chomsky 1971;Jackendoff 1972). The recognition of a default prosody for English is also at the very center of approaches linking the discourse effects of givenness and focus to prosodic structures or metrically interpreted syntactic trees. 22 While there have been dissenting opinions in the past (Bolinger 1965;Schmerling 1976), the existence of English default prosodic phonology is now generally acknowledged (Truckenbrodt 1995;Gussenhoven 2004;Féry & Samek-Lodovici 2006;Selkirk 2008;Büring 2016;Féry 2017). We therefore expect that representations like 23(a) for all-new sentences are not an option.
If there is no such thing as 'information focus' or 'newness focus' in Standard American and British English, and the apparent prosodic effects of newness are the result of default prosody, we can seriously consider the possibility that the effects of information structure on prosody in Standard American and British English can be traced back to the impact of the syntactically well-motivated features [FoC] and [G] and other equally well-motivated features like them. The cure for the disconnect between information structure features like the F(ocus)-feature of Selkirk (1984; and syntactically motivated features would then not consist in abandoning feature representations for information structure altogether. What we would need to do is reconceptualize and reconfigure the repertoire of such features. Selkirk (1984;, Rooth (1985;1992a), Schwarzschild (1999), and Beaver & Velleman (2011) went wrong by assuming that English represents newness. There is no feature for newness in English, nor is there a feature that lumps together FoCus and newness.

Deriving the prosody of all-new declarative sentences
In this section we will put a proposal on the table showing how prosodic representations of all new declarative sentences in Standard American and British English can be derived by default. Our account of the default prosody of Standard American and British English includes the representation of prosodic constituent structure, of prosodic prominence patterns, and of accent and edge tones. One part of the account specifies the relation between prosodic constituency and the morphosyntactic constituency of a sentence. The other shows how the phonology of Standard American and British English derives the distribution of tone and prosodic prominence as a function of prosodic constituency. While this is not the place to engage in detailed comparisons of competing accounts of default prosody, we will make sure that every part of our proposal is typologically well-motivated quite independently of information structure. In fact, there won't be, and there shouldn't be, any mention of any information structure notions in this section.

Theoretical background
The sentence types we will consider will be 'pragmatically neutral' in that they do not make use of any of the surface tones or tone combinations ('tunes') that are used in Standard American and British English to convey a variety of discourse-related meanings (Ladd 1980;Pierrehumbert 1980;Gussenhoven 1983;Pierrehumbert & Hirschberg 1990;Bartels 1999;Truckenbrodt 2012b). The predictable pitch accents that appear with merely new constituents in Standard American and British English will be argued to be instances of meaningless epenthetic tones such as are commonly found crosslinguistically.
Our account assumes the three distinct levels of grammatical representation illustrated in 25(a) to (c). 23

(25)
Sarah mailed the caramels a. Morphosyntactic output representation (MSO):  (25) is schematized in (26). 23 Our displays of MSOs use familiar syntactic labels, even though for the mapping from MSO to PI, category differences between Words (e.g. n vs. v) or between Phrases (e.g. np vs. vp) do not play a role.

Phonetic Interpretation
A few remarks on the notation in (25) and Figure 1: The non-italic orthographic representations of the words in PO (and PI) are a convenient substitute for the phonological feature representations of their segmental content. The notations H and L stand for high and low tones. (The ~ indicates the optional status of a following H.) The orthographic acute and grave accent marks on the vowels indicate the head or prominent ('stressed') status of the syllables containing them: orthographic acute accent for the syllable that has the head prominence ('main stress') of the prosodic word w, grave accent for the syllable with merely foot-head prominence. A superscript acute accent on a prosodic w or phonological phrase j indicates its head-prominence status at the next higher level of prosodic constituency.
Regarding the grammatical model we are assuming, we'd like to emphasize that this kind of architecture is not an unfamiliar one. Most properties of the three successive representations are widely assumed. Consider first the output phonological representation (PO), which is submitted to phonetic interpretation. Notably, its prosodic constituent organization, the prosodic prominence patterns and the tones (illustrated in 25(c) and Figure 1) have to be there since they may all contribute to the quantitative phonetic interpretation of the sentence (Shattuck-Hufnagel & Turk 1996; Keating et al. 2004).
A fundamental property of the phonological output representation PO (illustrated in 25(c)) is that it includes a linear organization of the words of the sentence. This linearization is largely inherited from the phonological input PI (illustrated by 25(b)). A linearized PI for the sentence as a whole, as in 25(b), is presupposed in any work which treats phonological phenomena that apply between successive words of a sentence. These phenomena include inter-word vowel coalescence, segmental feature assimilation or lexical tone spread, for example. In cases of inter-word phonological phenomena there are phonological constraint-induced changes between the distribution of segmental and tonal phonological features in the PI and their distribution in the PO of the sentence (see e.g. Nespor & Vogel 1986, Truckenbrodt 1999, and the articles in Selkirk & Lee 2015). The contribution of the morphosyntactic output representation (MSO) to the linearization of PI is the topic of active research, and so are possible phonologically induced changes in linear order. In the model of grammar we are entertaining, phonological effects on linearization might be understandable as part of spellout (the mapping from MSO to PI) or as part of the phonology per se (the mapping from PI to PO). 24 The grammatical architecture displayed in (26) is compatible with Kiparsky's (2000) proposal that word phonology is characterized in terms of an optimality theoretic constraintbased input-output phonology for words, and that the output of the word phonology in 24 Recent work has shown a broad range of phonological factors to be involved in characterizing attested variation in the order of words (see e.g. the papers in Gribanova & Shih 2016). Of special interest to our concerns here are the demonstrations by Agbayani & Golston (2010;, Bennett, Elfner & McCloskey (2016), and Kusmer (2019) that certain word orders observed in the phonological output representation involve movements of constituents defined in prosodic structure terms from and/or into positions that must be defined in terms of prosodic structure. Such cases would have a place in the Phonology module in (26), which relates a prosodically structured phonological input representation PI to its phonological output representation PO, as illustrated above in (25)/ Figure 1.
turn provides the input for an OT constraint-based input-output phonology for sentences. We could assume that output representations of the word phonology form part of the MSO representation (25a) that maps onto the PI in (illustrated in 25 (b)). Our writing with italics of the component words of the MSO is meant to indicate that we remain agnostic as to the precise phonological status of these words in MSO, an issue not central to our concerns. The architecture displayed in (26) is also compatible with a Distributed Morphology theory of post-syntactic morphological operations (see e.g. Halle & Marantz 1993;Embick & Noyer 2007). Our choice of the term 'MSO' ('morphosyntactic output') reflects this possible connection.
Since the phrasal head prominence ('stress') and tone of all-new pragmatically neutral declarative sentences are phonologically predictable in Standard American and British English, they are only introduced in the PO, as language-particular consequences of phonological markedness constraints relating prosodic constituent structure to prosodic head prominence and, directly or indirectly, to tone. The demonstration that the surface (output) phrasal tone and prominence in all new pragmatically neutral declarative sentences is phonologically predictable is our main argument that no morphosyntactic feature for newness is required in the grammar. 25

Deriving the prosodic constituent structure of a sentence
Recall our illustration of the proposed derivation of the prosody of all-new utterances in American and British English, which is reproduced here: In line with Match Theory (Selkirk 2009;Elfner 2012; we assume that the morphosyntactic Words of the MSO representation 25(a) correspond to prosodic words, labeled w, in the PI representation 25(b), the syntactic Phrases correspond to phonological phrases, labeled j, and the Clause or Root node corresponds to an intonational phrase, labelled i. The type of morphosyntactic constituent we are calling a "Word" consists of a lexical root and whatever lexical or functional heads might be adjoined to it, through head movement or morphological operations. 26 What we are calling a "Phrase" minimally dominates a Word and possibly another Phrase. 27 (27) below states our first constraint on phonological spellout, MatchWord. 28 MatchWord says that every instance of a Word in MSO corresponds to exactly one instance of a prosodic word in PI. 25 At the same time this demonstration will show that there is also no motivation for a focus morpheme with lexical H tone contra Selkirk (1984). 26 Inclusion of (certain) functional heads within a Word has the consequence of eliminating the related functional projections in MSO. It's for this reason that the phrase structure of the MSO 25(a) looks so simple -the only overt functional head that heads a Phrase is the determiner the. 27 Research on what counts as a Clause, a Phrase, or a Word in spelling out MSO is ongoing. See e.g. Selkirk (2009;, Ito & Mester (2012;, the articles in Selkirk & Lee (2015), Bennett et al. (2016), Bennett & Elfner (2019). 28 The statements of MatchWord and MatchPhrase here adopt a formalization in terms of a terminals-based correspondence of syntactic and prosodic constituents, as proposed by Elfner (2012;.

(27) MatchWord
For every instance of a Word in MSO there is exactly one instance of a prosodic word w that spells it out phonologically in PI.
Similarly, on the basis of the Phrase-level constituency in 25(a), a matching j-level constituency is generated in the PI of 25(b), in response to the phonological spellout condition MatchPhrase (28): For every instance of a Phrase in MSO there is exactly one instance of a phonological phrase φ that spells it out phonologically in PI.
The strongest arguments in favor of positing a categorial distinction between corresponding syntactic and prosodic constituents (Word vs. w and Phrase vs. φ) come from mismatches between morphosyntactic constituency and the constituency of phonological output representations which directly affects the phonological patterning observed in the output PO, as well as its phonetic interpretation. The majority of such mismatches appear to be driven by phonological markedness constraints. A simple example of a phonologically driven mismatch is provided by the monosyllabic functional head the in (25), best viewed in Figure 1. It is j-initial in the PI representation 25(b), because it is Phrase-initial in the MSO representation 25(a). But it is prosodically encliticized to a preceding w in the PO representation 25(c)/ Figure 1. 29 The key to understanding this particular mismatch is the purely phonological markedness constraint StrongStart (Selkirk 2011;Bennett et al. 2016), which would be violated by the PI-faithful PO representation (s j) j , but not by the actual PO representation (w s) w . A mismatch between PI and PO will arise when a phonological markedness constraint outranks a phonological faithfulness constraint on the relation between PI and PO. Inputoutput mismatches of various types are a core matter for the phonology (McCarthy & Prince 1995;. We account for phonology-driven constituency mismatches between MSO and PO by proposing that Match constraints form part of the interface between MSO and PI. A PI supplied with prosodic constituent structure as defined by MatchWord and MatchPhrase forms the crucial link to understanding such mismatches.
An important advantage of our proposal that prosodic constituent structure forms part of the PI representation of the sentence is that it makes it possible to exploit McCarthy & Prince's theory of faithfulness constraints on the relation between input and output phonological representations as the basis for a theory of possible mismatches between the prosodic constituency of PI and PO. 30 In section 7, where we discuss the phonological properties of sentences containing [G]-marked and [FoC]-marked constituents, we will see additional advantages of assuming that prosodic constituency at word level and above forms part of the input phonological representation PI.
The following subsections 6.3 and 6.4 are devoted to showing that the presence of prosodic constituency in the phonological output representation PO makes it possible to offer a typologically well-motivated account for the distribution of default tone and default prominence in the all-new sentences of Standard American and British English.

Default tones in all-new sentences
In standard American and British varieties of English, there are no segmental morphemes that include lexical tone. But there is a rich array of surface tones and tone combinations ('tunes') in non-declarative and pragmatically non-neutral sentences (Ladd 1980;Pierrehumbert 1980;Gussenhoven 1983;, Pierrehumbert & Hirschberg 1990Bartels 1999;Truckenbrodt 2012b). This suggests that these phonologically nonpredictable tonal entities are segment-less tonal morphemes which carry distinct pragmatic meanings. No such tonal morphemes appear in the MSOs of the all-new, pragmatically neutral, declarative sentences that we are considering here.
A generalization that has emerged over time is that the tones in the PO representations of pragmatically neutral all-new declarative sentences in Standard American and British English are (i) H tone 'pitch accents', which are associated with the head/prominent syllable of a phrase level prosodic constituent, and (ii) L 'boundary tones', or 'edge tones', which appear on a syllable at the right edge of a prosodic phrase. These factual generalizations come from accumulated informal observations and from experimental investigations (e.g. Schmerling 1976;Gussenhoven 1983;, Selkirk 1984Ladd 1996;, Truckenbrodt 1995Katz & Selkirk 2011). The surface pattern of H and L tone distribution is illustrated in the PO representation of the all-new sentence Sarah mailed the caramels in 25(c), repeated here. The phonological j phrases corresponding to the subject and object noun phrases Sarah and caramels necessarily contain two tones: a high tone H on the head/prominent syllable of the sole prosodic word w in the j, and a low tone L on the final syllable of the j. The transitive verb mailed, which is not a j, predictably carries no L phrase-edge tone in an all-new sentence like (25). It may carry an optional ~H tone, however, discussed below. In many other languages as well, tones which have no lexical or morphemic status appear predictably, 'by default', in PO representations. It is not uncommon for a H tone to be predictably associated with the head syllable of every prosodic word w in any sentence of some language. Cairene Arabic (Hellmuth 2006; offers a particularly well-studied example. It is also not uncommon for the head syllable of a phonological phrase j to be predictably associated with a H tone. An example is modern Irish (Elfner 2012;. Kolkata Bengali is an example of a language where it's a predictable L tone that appears on the head syllable of any j in all-new sentences (Hayes & Lahiri 1991). Several varieties of German also show an L tone on the phrasal head, with a H tone at the right phrase edge (Truckenbrodt 2002). Truckenbrodt (2006; and Ladd (1996; argue that the presence of accent tone is predicted by the distribution of phrasal prominence ('stress').
In an optimality theoretic phonology the appearance of predictable elements in the phonological output is driven by phonological markedness constraints (Prince & Smolensky 1993;McCarthy & Prince 1995;. Markedness constraints collectively define the 'ideal', unmarked, PO form of a sentence. Faithfulness constraints, on the other hand, call for identity between PI and PO representations. They ensure that the distinctive lexical (meaning-tied) phonological properties of the PI are retrievable from the PO representation. In OT, it is a language-particular ranking of universal phonological markedness and faithfulness constraints which determines whether and under what prosodic conditions a predictable, 'default', tone will be inserted (epenthesized) on a syllable of the PO representation.
The widely attested predictable appearance of tone on a prosodically prominent syllable is evidence that the universal set of phonological markedness constraints on PO representations must include constraints which call for a prosodic head prominence to be associated to some tone (deLacy 2002;Yip 2002;Elordieta & Selkirk 2018). A phonological markedness constraint like Head(j):Tone in (29) below would have responsibility for the predictable epenthesis of the tone commonly referred to as a H 'pitch accent' associated with phrasal head/prominence in all-new sentences in English. In languages like English, Head(j):Tone would be higher ranked than the tonal inputoutput faithfulness constraint that would rule out any presence of a tone in the output that is not already present in the input (Myers 1997).

(29)
Head-of-j-has-Tone [Head(j):Tone] The head syllable (s´) of a phonological phrase (j) must be associated with tone.
The head syllable of a phonological phrase is the head syllable of the head foot of the head prosodic word of a phonological phrase.
Preference for it to be a H tone that is associated with a prosodic head, whether alone (as in English H) or in combination with a L tone (as in Tokyo Japanese HL), could be independently specified by further phonological markedness constraints (de Lacy 2002). 31 Additional evidence for the existence of a universal phonological markedness constraint like Head(j):Tone comes from other, very different, types of tonal phenomena. In the Bantu languages Digo (Kisseberth 1984) and Giryama (Volk 2011: 17), a lexical H tone associated with a morpheme in a nonfinal word of a j in PI is shifted rightwards in PO to the penultimate syllable of the final word of that j. In that j, the phrasal penult syllable is plausibly the head syllable s´ of the j (Hyman 2011; 2019). The tone shift thus allows for the satisfaction of Head(j):Tone in PO. In the so-called pitch accent language Lekeitio Basque, where morphemes are lexically distinguished in PI by whether they are 'accented' (that is, tone-bearing) or not, any lexical accent tone(s) of morpheme(s) within a word must be realized in PO on the (plausibly stressed) penultimate syllable of that word and nowhere else in the word (Hualde et al. 1994;Elordieta & Selkirk 2018).
The epenthesis of a H tone onto the head syllable of j in Standard American and British English, the shift of lexical H tone to the head syllable of j in Digo and Giryama, and the shift of a lexical H tone to the head syllable of a prosodic word w in Lekeitio Basque are all a matter of the phonology per se. We propose to see them as language-particular responses to the need to satisfy universal phonological markedness constraints like Head(j):Tone or Head(w):Tone. In the phonological constraint ranking of languages like Digo or Lekeitio Basque, where tone spreads or shifts instead of being epenthesized, any faithfulness constraint(s) calling for the toned or toneless status of a tone-bearing unit in PI to be the same in PO must then be lower ranked than a markedness constraint like Head(j):Tone or Head(w):Tone (see Myers 1997).
Another family of phonological markedness constraints that plays a role in determining the default surface distribution of tone in Standard American and British English involves the edges, or boundaries, of prosodic constituents. The sentence in (25), with its predictable L tones at the right edge of the phonological phrases (Sarah) and (mailed 31 In some varieties of English, a predictable, epenthetic, L accent tone before the morphemic H edge tone appears in a yes-no question. Arguably the Obligatory Contour Principle (OCP) rules out sequences of identical tones within a j in these varieties. The lexical H quality of the morphemic edge tone is preserved by a tonal faithfulness constraint while satisfying Head(j):Tone together with the OCP results in a L accent tone that is epenthesized in this case. See Hayes & Lahiri (1991) and Selkirk (2007) on the role of the OCP in determining the quality of predictable tonal accents in Bengali.

Right-Edge-of-j-has-Tone [R-Edge(j):Tone]
The syllable at the right edge of a j must be associated with some tone. 33 It is very common crosslinguistically for there to be a predictable appearance of some tone or tonal complex at the right or left edge of a phrase in the surface prosodic structure representation of a sentence (see e.g. Beckman & Pierrehumbert 1986;Gussenhoven 2004; the papers in Jun 2005; 2014; Féry 2017). The phonological markedness constraint (30) can be seen as providing the pressure for the predictable insertion/epenthesis of a default edge tone at the right edge of a j in Standard American and British English. Looking beyond languages like English, which lack lexical tone, (30) could also be seen as responsible for the special variety of rightward H tone spread in Copperbelt Bemba reported in Kula & Bickmore (2015). There a lexical H tone spreads rightward from its underlying position in PI through a sequence of toneless syllables within a j in PO, but only if it can reach a toneless syllable that is in final position in that j. There is no spreading at all from its underlying source position if that H tone can't reach the right edge of j -due to the presence of a lexical H tone in j-final position, for example. This suggests that this spreading is the consequence of the requirement expressed in the constraint (30) that in PO there be some tone on the syllable at the right edge of a j. If the demand is already met by the presence of an underlying tone associated with the final syllable of a j in PI, then there's no pressure for a tone located earlier in the j in PI to spread from the left in order to satisfy R-Edge(j):Tone in PO. Finally, there's a third type of predictable, default, epenthetic tone in all-new sentences of Standard American and British English. Recall that a H tone may optionally appear on the verb mailed preceding the necessarily pitch-accented direct object in 25(c). It has already been observed (e.g. Gussenhoven 1992;Ladd 1996;) that in Standard American and British English additional accent tones may optionally appear preceding, but not following, the 'nuclear accent' that is associated with the head/'main prominence' of a j. We should thus entertain the possibility that these optional tones in Standard English represent a third category of epenthetic tone, introduced by a constraint with the properties of (31).

(31)
Head-at-left-edge-of-j-has-tone [Hd@L(j):T] The closest prosodic head to the left edge of a j is associated with some tone.
As Gussenhoven (2004: 285) points out, in a recursive j structure like the one that characterizes (mailed the (caramels)j )j in (25), where the verb mailed lies at the left edge of a j, its first (and only) stressed syllable could -but only optionally -receive an 'edge-accent' H tone. In its optionality and in its restriction to a left-j-edge word in Standard American and British English, 34 this epenthetic edge accent tone is distinct from the nuclear accent tone that appears on the head syllable of a j.
32 Barnes et al. (2010) point to some variability in the precise phonetic alignment position of the phrasal rightedge L boundary tone in Standard American English. 33 The quality (L vs. H) of the edge tone is not specified in (30); it is sometimes contextually predictable on the basis of the quality of the preceding pitch accent, or the tonal quality of the pitch accent may determine the quality of the edge tone, depending on the language. 34 A promising approach to the characterization of optionality or variation in the pitch accent epenthesis due to the positionally-sensitive phonological markedness constraint (31) is seen in the optimality theoretic accounts by A. Kaplan (2016) of variation in flapping in standard American English and schwa deletion in standard French.
There is crosslinguistic motivation for the sort of edge-accent called for in (31). A simultaneous appeal to a righthand phrase edge and a lower-level prosodic prominence nearest to that edge characterizes the sort of post-nuclear 'phrase accents' which Grice, Ladd & Arvaniti (2000) document for various languages of Eastern Europe. Moreover, Elfner (2012; shows that in Connemara Irish, along with the H tone that always appears on the prosodic head of the final word in any j, there is a LH tonal complex that obligatorily appears on the head syllable of the w lying at the left edge of a nonminimal j. 35 Summing up, the distribution of the tones of all-new, pragmatically neutral, declarative sentences in Standard English is characterizable via crosslinguistically supported phonological markedness constraints targeting the relation between tone and prosodic structure in PO representations. This phonological markedness approach meshes well with the recent proposal of Torreira & Grice (2018) that metrical/prosodic structure plays a role in determining the association properties of 'intonational tones' in languages without lexical tone.

Prosodic head prominence
The PO representations 25(c) and Figure 1 (the tree version is repeated below) contain representations of prosodic headedness or prominence.
The formal status of a prosodic constituent as a prosodic head or prominence of a higher prosodic constituent is abstract, but it comes with a broad variety of phonological and/or phonetic consequences, depending on the language. As we've just seen, in all-new, pragmatically neutral, declarative sentences in Standard American and British English, a phonological consequence of prosodic headedness is the appearance of a predictable H tone on the head syllable of a phonological phrase. Moreover, in Standard American and British English, the prosodic head status of a syllable within a foot (f) or a prosodic word (w) − with or without accent tone − is interpreted by the phonetics with degrees of duration reflecting its hypothesized degrees of abstract prominence (see e.g. Sugahara 2012). Duration patterns reflecting higher levels of prominence are seen as well in the Katz & Selkirk (2011) data on phrasal prominence and FoCus, which we will come back to in section 7. 36 By comparison, Standard Japanese, for example, shows no such durational effect on syllables with accent tone (Beckman 1982;, even though tonal pitch accent is arguably associated with prosodic headedness or prominence in Standard Japanese (Ito & Mester 2016;Elordieta & Selkirk 2018). Both the predictable epenthesis/insertion of prominence-dependent accent tones in Standard British and American English and the phonetic interpretation of prominence as duration (and other phonetic properties) require us to assume that prosodic head prominence is represented in the PO representation of the phonology. The question is how this abstract prosodic prominence becomes a property of the PO of all-new sentences.
We would expect that predictable prosodic head prominence ('stress') is only represented in the PO representation of all-new sentences of Standard British and American English, and this is reflected in the MSO-PI-PO triple for (25) introduced earlier.

mèls)w′ )j′ )j )i H L~H H L
A purely phonological, entirely general, characterization of prosodic headedness that applies to any prosodic constituent that dominates another prosodic constituent is definable for PO representations. Just as a foot has a unique most prominent head (a 'stressed syllable'), so a phonological phrase (j) has a unique prominent head w (which contains the 'nuclear stressed' syllable of the j), and so on. Head-Prominence-in-j in (32) is one example of a relevant constraint.
HeadProminence-in-j belongs to a larger family of phonological markedness constraints that call for a head/prominence within prosodic constituents of all types that dominate another prosodic constituent, be they f, w, j, or i. It has been recently argued that phonological constraints that assign prosodic headedness are violable (Ito & Mester 2016;Elordieta & Selkirk 2018). This violability supports the notion that headedness is assigned by a phonological markedness constraint and is not part of the necessary 'hardware' of prosodic constituents. 37 Truckenbrodt (1995;) makes a case for the descriptive generalization behind HeadProminence-in-j in Standard American and British English. The generalization is that, by default, every syntactic phrase of a sentence contains a stress/prominence associated with a pitch accent. For Truckenbrodt, the generalization is accounted for via a syntax-phonology interface constraint, Stress XP, that directly links syntactic representations to phonological output representations: Each XP of a sentence corresponds to a phonological phrase containing stress (Truckenbrodt 2006). But Stress XP is superfluous in a theory of prosodic constituent structure representation like the one assumed here, in which the Phrase structure of MSO is spelled out by MatchPhrase as the (potentially recursive) j structure of prosodic structure representation. Our HeadProminence-in-j is an instance of a general set of markedness constraints calling for head prominence that hold of all levels of prosodic structure, including the purely phonological foot. In other words, Truckenbrodt's generalization can now be captured via an independently needed account of the morphosyntactic-prosodic constituency relation and a general theory of the distribution of prosodic prominence within prosodic constituents of the various types f, w, j, or i.
The question still remains why it is that the head prominence of the j corresponding to the VP in (25(c))/ Figure 1 is the j' corresponding to the object rather than the w corresponding to the verb. This is a general pattern. As Cinque (1993) and Truckenbrodt (1995; have pointed out, in German transitive sentences the verb (a word) will never have greater prominence than its sister phrase, regardless of the linear order in which they appear. Kahnemuyipour (2009) gives crosslinguistic support for this generalization. This generalization suggests an additional phonological markedness constraint determining which one of multiple daughters of a prosodic category node has the status of head. (33)

HeavySisterProminence [HvSisProm]
When sisters within a prosodic constituent are of unequal prosodic category, the heavy sister is the head. The heavy sister is the constituent whose category is higher in the prosodic hierarchy.
HeavySisterProminence imposes head status for the object phrase (caramels)j´ within the j corresponding to the VP in (25). Importantly, HvSisProm makes the general prediction that the higher, 'heavier', prosodic category type will be the head: in a prosodic word consisting of a foot and a syllable, it calls on the foot to be the head; in a recursive w structure where a w dominates another w and a clitic s, the sister w would be the head, and so on, regardless of order.
Summarizing this section as a whole, we have seen that the distribution of phrasal head prominences and of accent and edge tones in the phonological output representations of all-new pragmatically neutral sentences in Standard American and British English is phonologically predictable on the basis of the morphosyntactic output structure (MSO) of a sentence. A system of independently motivated interface Match constraints yielding prosodic constituency in PI and a system of prosodic-structure-sensitive markedness constraints in the phonology have key roles in explaining the presence and location of prominence and tone properties in PO. This means that the phonological system of Standard American and British English generates the output prosodic properties of all-new sentences from their morphosyntactic output representations without any appeal to newness, or to any other information structure notion for that matter.

Spelling out [FoC] and [G] in Standard American and British English
It's generally recognized that focus and givenness in Standard American and British English are phonologically realized in terms of prominence, or the lack of it. Section 2 established that both [FoC] and [G] are syntactically well-motivated features. Crosslinguistically, they can trigger displacement and agreement or license ellipsis. As syntactically represented features, the two features should also be able to influence the mapping from syntactic structure to phonological representation at the syntax-phonology interface. However, beyond the syntax-phonology interface, we would expect the phonology, and the phonology alone, to determine the final output. In this section we present an account of English prosody that conforms to those expectations within the MSO-PI-PO model of sentence phonology introduced in the previous section.

Spelling out [FoC]
Our account honors the longstanding insight that in Standard American and British English the focus of a sentence is the locus of greatest prominence, where this prominence has been defined as the main (nuclear) stress of a sentence (Jackendoff 1972), as the constituent that is dominated by an unbroken chain of strong (S-labelled) nodes in the metrical tree mirroring the syntactic structure of a sentence (Ladd 1980, and many others), or as the prosodic constituent that is more prominent than any other within the domain of a focus (Truckenbrodt 1995).
Within our framework of assumptions, the prosodic impact of [FoC] in Standard American and British English would have to be a spellout constraint affecting the mapping from MSO to PI and it would be expected to be a constraint on prosodic structure. We propose that a [FoC]-marked constituent at MSO is spelled out as an i-level head in PI. 38

(34)
[FoC]=i-Level-Head A [FoC]-marked constituent of MSO is spelled out as an i-level head in PI. An i-level head is a link in a chain of prosodic heads that ends in the prosodic head of i.
The proposal in (34) presumes that a sentence (Clause) constituent in MSO is spelled out as an i by MatchClause, one in the family of constituency spellout constraints that relates MSO and PI.
(35) MatchClause For every instance of a Clause of MSO there is exactly one instance of an intonational phrase i that spells it out phonologically in PI.
Since a prosodic constituent like i can have at most one prosodic head, it follows from (34) that a [FoC]-marked constituent will be more prominent than any other in the same i. By way of illustration, consider the case of subject FoCus in Figure 2. The non-FoCus constituents in this example are new in the discourse, not Given. The sentence in Figure 2 could be spoken in a context where mailed the caramels is not Given in the current discourse, hence not [G]-marked, and where Sarah's having mailed the caramels is being contrasted with possible alternatives like Ewan's or Allie's having done so. 39 In Figure 2 above, i-level heads are indicated by an arrow and are marked in bold. Figure 2(b) only includes the head prominence that spells out [FoC] in the MSO Figure 2(a). This prominence has to be maintained in the PO Figure 2 tive within our framework of assumptions. Prosodic i-structures would have to be stipulated to represent FoCus domains, that is, they would have to be stipulated to have representational properties that serve the exclusive needs of languages that realize FoCus prosodically. 39 For graphic convenience, the MSO representations of the sentence Sarah mailed the caramels in this section will not include any functional projections. The DP dominating the is therefore absent. 40 To ensure the appearance in PO of the i-level heads that spell out [FoC] in PI, a prosodic faithfulness constraint requiring i-level heads of PI to correspond to i-level heads of PO must be higher ranked than any other constraints which might weigh against the presence of those i-level heads in PO. The prosodic properties of the PO representation of a sentence containing a [FoC]marked constituent are thus only partly determined by the greater prominence of the [FoC]-marked constituent. In sentences where a [FoC]-marked phrase and phrases that are merely new cohabit, the default phonology supplies j-level prominence for those merely new constituents and for the [FoC]-marked constituent as well. The result is that in the PO representation, both [FoC]-marked and merely new constituents carry the tonal pitch accents supplied by the phonology to the prominent w' head of a j. What distinguishes [FoC]-marked from merely new, unmarked, constituents is the status of their corresponding j as an i-level head prominence in PI. That prominence in PO is inherited from PI.
The necessity of making a distinction between FoCus and merely new, and the possibility of combining them in the same sentence, has largely gone unrecognized in the previous literature on the phonology and phonetics of sentences containing a FoCus. 41 Most studies have examined sentences where a FoCus is preceded and/or followed by Given constituents, which lack both j-level prominence and pitch accenting. But, as pointed out in Katz & Selkirk (2011), it is only by investigating combinations of FoCused and merely new constituents in the same sentence that it can be established that it is the higher prominence, or 'stress', of a FoCused constituent that is grammatically distinctive. 42 Katz & Selkirk (2011) Drawing on a rich literature, Katz & Selkirk (2011) discuss the pitch scaling effects in triples like those in (36). In 36(a) the considerable downstepping (⇓) between the FoCused direct object, and the following merely new phrase testifies to a higher level of prosodic prominence for the FoCused constituent (Gussenhoven et al. 1997;Truckenbrodt 2004 Kügler & Féry (2017) for German, and Ishihara (2007; for Japanese. Post-FoCus 'destressing', 'deaccenting', or 'dephrasing' could be a language-particular phonological effect in some languages. In Standard American and British English, it is a consequence of the [G]-marking of a constituent. New-FoCus case 36(b), the lack of downstep, or presence of an optional small upstep (↑), again implies greater prominence for the FoCused phrase (Ladd 1996(Ladd /2008Gussenhoven 2004). An overall, non-prominence-sensitive, default pitch downtrend which is present in any sentence (van den Berg et al. 1992;Truckenbrodt 2004) explains the lesser pitch height of the FoCus. In the all-new case 36(a), by comparison, the significantly different, but small degree of default downstepping (↓) between the two post-verbal phrases is consistent with a PO representation where neither j carries i-level prominence, suggesting that, in the default case of all-new sentences, an i-level head prominence is not defined. This suggestion is in line with the conclusion by Ladd (2008: 257) that "there is no phonetic justification for treating the last accent of a sentence [of an all-new sentence] as more prominent." Frota (2000) reports a similar finding for European Portuguese.
We assume here that the presence of head prominence in a prosodic constituent of PO is called for by phonological markedness constraints and so may be found in constituents at any level of the prosodic hierarchy. This means that the apparent lack of i-level head prominence for j daughters of i in all-new sentences like 36(c) requires an explanation. Our suggestion is that, in the phonological constraint ranking of English, the markedness constraint HeadProminence-in-i, which calls for head prominence in i, is subordinated to a prosodic faithfulness constraint that prohibits the presence in PO of any i-level head prominence not present in PI. As a consequence, only i-level heads required by the spellout of [FoC]-marking in PI would be present in PO. The absence of any i-level 'nuclear stress' on the rightmost j in the all-new 36(c) would thereby be explained.
Summarizing, it is just FoCused constituents, rather than merely new ones, that are given distinctive phonetic salience in Standard American and British English. On our account, this can be ascribed to (i) their being spelled out as i-level heads in PI, in accordance with the spellout constraint [FoC]=i-Level-Head, and (ii) their corresponding i-level head-prominence in PO, faithfully inherited from PI.
Sentences with multiple [FoC]-marked constituents present a challenge for the proposal that a [FoC]-marked constituent of MSO corresponds to a i-level-head in PI. Assuming an inviolable constraint saying that a prosodic constituent can have at most one head, only one daughter of an i can be the head of that i. The two conjuncts in (37) illustrate the problem: each conjunct sentence in (37) The spellout constraint FoCus=i-Level-Head requires that each [FoC]-marked constituent be an i-level-head in PI, but this would be impossible in Dual FoCus cases like (37), unless each conjunct was broken up into two i in PI. To our ear, there seems to be no phonetic 'disjunction' between the two js in the conjuncts of (37), however. It looks like there is just a single i. Moreover, it does not seem, to our ear, that either of the two FoCi is systematically more prominent than the other. If future experimental investigation confirms our impression, we could assume that FoCus=i-Level Head goes unsatisfied with one of the two [FoC]-marked constituents, and we will need to search for an explanation for this. The tonal facts of (37) suggest nonetheless that at this point more can be said about multiple FoCus cases. In (37) all four [FoC]-marked constituents (the v and np of each conjoined sentence) have an obligatory H accent tone, indicating that both verbs and both nouns bear at least j-level prominence. Moreover, each of the four [FoC]-marked constituents in (37) shows the characteristic L edge tone indicating j-final position in PO. Each [FoC]-marked verb, then, should correspond to a headed j in PO, just like each [FoC]-marked object phrase. This is a significant fact, since, given the status of verbs as mere Words in MSO, we would expect them to have mere w status in PI. The 'promotion' of the verbs to j status in double FoCus cases like (37) must therefore be an effect of their being [FoC]-marked. This effect could be due to an additional spellout constraint for [FoC] that would apply at the j level, namely [FoC]=j-Level-Head. For [FoC]-marked verbs in MSO to satisfy [FoC]=j-Level-Head in PI, their j-level head status would have to be introduced in PI. That is, in PI the [FoC]-marked verb would both have the status of a j and be the prominent w´ that is the head of that j . And, assuming that the js and heads of js of PI are faithfully inherited as corresponding js and heads of js in PO, their H accent tone and the L edge tone to their right in PO would be explained.
Summarizing, multiple FoCus cases like (37) suggest that the spellout of [FoC]-marking is not limited to [FoC]=i-Level Head.
[FoC]=i-Level Head seems to be just one of a family of Syntax-Phonology interface constraints that also includes [FoC]=j-Level Head, and even [FoC]=w-Level Head, as we will argue in 7.3. In general, we are proposing that languageparticular choices of spellout constraints for the phonological expression of [FoC]-marking determine the input phonological representation for the phonology per se, where languageparticular rankings of relevant prosodic markedness and faithfulness constraints determine the character of the PO representation which is submitted to phonetic interpretation.
A comparison to the spellout, phonology, and phonetics of focus in Mandarin Chinese discussed in Kabagema-Bilan et al. (2011) supports this approach to the spellout of FoCus in terms of head prominence and to its phonological and phonetic interpretation. Their study targets the prosodic properties of sentences with multiple focus constituents in Mandarin Chinese. Unlike English, in Mandarin Chinese, the tone borne by a word in PO is an idiosyncratic lexical property; it is not present as a reflex of head prominence at either word or phrase level. Xu (1999) found that in all-new subject-verb-object sentences containing a sequence of words with only lexical H tones the F 0 contour is absolutely flat, with no pitch protrusion anywhere. By contrast if the same type of sentence has a focused constituent, it shows a significant F 0 peak in relation to the surrounding H-tone words. Kabagema-Bilan et al. (2011) go on to examine H-tone-word sentences containing two focused constituents. They find that both of the focus constituents show a significant F 0 peak, and that the rightmost of the two peaks is significantly higher than the first. This result is interpreted as showing (i) that each of the two focus constituents must carry prosodic prominence/stress at the phonological phrase (j) level and (ii) that i-level prominence is placed on the rightmost of the two constituents bearing j-level prominence.

Spelling out [G]
The prosody of Given constituents has been described as absence of prominence or 'stress', whether in the metrical structure formulation of 'stress shift away from given' of Ladd (1980), or in the Destress Given constraint of Féry & Samek-Lodovici (2006). Positing Destress Given as the constraint that spells out [G]-marking in PI would not be implementable in our approach, however, because no distinction could be made between a [G]-marked and a merely new constituent in PI. Neither [G]-marked nor merely new constituents would have j-level prominence in PI, but both would get j-level prominence by default in PO. Destress Given is not an option, then. We propose instead that a [G]-marked constituent in MSO is spelled out with a lack of j status in PI. Lack of prominence would follow from the lack of j in PO, inherited from PI. We might dub this spellout constraint for [G]-marking in Standard American and British English 'DephraseGiven'. (38) [G]=No-j (DephraseGiven) A [G]-marked constituent in MSO corresponds to a prosodic constituent in PI which is not a j and contains no j.
(38) has further empirical consequences. Recall that the j status of a PO constituent is responsible for the presence of an obligatory H accent tone and for the presence of the L edge tone at the right edge of j in Standard American and British English. If a [G]-marked constituent lacks j status in PI, then, the corresponding constituent in PO should lack both the H accent tone and the L edge tone. These predictions are borne out. The three sentences in (39) show contrasts in the distribution of tone, even though they differ only with respect to [G]-marking: Sarah would be Given in that context. Assuming that there has been no mention of mailing the caramels in that particular discourse (even though the issue is a familiar one), the VP mailed the caramels would count as new. Ladd (2008: 300 ff.) discusses the tonal properties of Given subject phrases and their implications for a prosodic structure analysis. Here is how his observations might apply to 39(b) when adapted to our framework of assumptions. First, there is no obligatory H accent tone on the [G]-marked subject. This indicates absence of phrasal stress, hence absence of head/prominence of j, hence absence of a head w´ (marked with an arrow in the tree Figure 3 below). Second, there is no L edge tone at the right edge of the [G]-marked subject. This indicates the absence of j constituent status. The optional ~H at the left edge of the sentence as a whole must be an edge accent, coinciding here with the left edge of i. 44 In sum, then, the 'dephrasing' induced by [G] explains the particular prosodic properties of Sarah in 39(b), which is given in tree form in Figure 3. 44 Since there is no left edge of j preceding the sentence-initial [G]-marked subject, it must be that a markedness constraint analogous to the edge-tone constraint (30) calls for the presence of a tone at the left edge of an intonational phrase i. Our proposal that a [G]-marked phrase of MSO is 'dephrased' in PI predicts the correct prosodic properties of the Given direct object phrase in 39(c) as well. The PO representation of 39(c) is given in Figure 4.
In Figure 4, the absence of the H accent tone on the Given object phrase caramels reflects the lack of j-level head/prominence. Since the direct object does not correspond to a j in PO, the lack of any optional ~H accent at the left edge of the direct object is expected, too. The right-edge L tone present in Figure 4 has typically been analyzed as a clause-level tonal morpheme (Pierrehumbert & Hirschberg 1991;Bartels 1997).
DephraseGiven also lays the groundwork for explaining the appearance of the obligatory H accent tone on the VP-initial, merely new, verbal w' sister of the w corresponding to the [G]-marked object in the PO in Figure 4. This apparent 'stress shift' to the verb has been much discussed since Ladd (1980). Since the [G]-marked direct object in Figure 4 corresponds to an w in PI that is preceded by another w (the one corresponding to the verb), the locus of head prominence within the j corresponding to the VP would have to be determined by a 'head assigning' prosodic markedness constraint. The HeavySisterProminence constraint (33) is inapplicable here. A 'head assigning' constraint would come into play when a node in prosodic structure only has same-level daughters. Depending on the language and the level of prosodic constituency at issue, head prominence may then be 'assigned' leftmost or rightmost (Prince 1983;Hayes 1995;Kager 2001;McCarthy 2003). For example, in Standard American and British English, the head prominence of a foot is the leftmost syllable, the head prominence of a w is the rightmost foot, the head prominence of a simple compound w falls on the leftmost w. We propose, then, that the apparent 'stress shift' to the verb in VPs with a Given direct object, is the result of a phonological markedness constraint like that in (40)  An optimality theoretic constraint-ranking-based approach to the MSO-PI interface module of grammar also allows for an account of the phonological properties of [FoC]-marked constituents that are contained within [G]-marked constituents, to be discussed in the next subsection.

Second Occurrence FoCus: [FoC]-marking within [G]-marking
A Second Occurrence FoCus (SOF) is a FoCused constituent contained within a Given constituent. SOF constructions provide a window into the interaction of [FoC] and [G], and are thus ideal testing grounds for our account of the phonology of the two features. The phonological puzzle posed by SOFs is that SOFs do not display the obligatory H accent tone expected of a [FoC]-marked constituent in Standard American and British English. The small discourse in (42) illustrates. The example is from Beaver et al. (2007: 256), which experimentally examined the phonological and phonetic properties of SOF sentences. (42)  marking seems to be responsible for the requirement that a SOF contain at least a w-level prosodic head, namely a head f´. This requirement can be seen at work in constructions where the SOF is a pronoun. As first observed by San Tunstall and reported in von Fintel (1994), when an SOF is a pronoun, it must appear in its strong form, not in its weak form in English. For him, for example, the weak form is the footless, 'stressless', syllabic nasal m. That weak form could not occupy the place of the SOF [Sid] FoC in the MSO 42(b). The strong form him would be required. We conclude that the spellout of [FoC] must be responsible for the realization of SOF pronouns as w-level heads, that is, with the status of a head foot of a w. Due to MatchWord, nouns like Sid should have the status of w, and the sole syllable of Sid would thus be the head of the foot that is the head of that w. 45 It seems, then, that the syntax-phonology interface module of Standard American and British English contains the spellout constraint [FoC]= w-Level-Head, in addition to [FoC] FoC] in Standard American and British English that affect the prosodic constituency of the counterparts of [G]-marked phrases and the prominence level of the counterparts of [FoC]-marked phrases in the input phonological representation. We showed that in interaction with the proposed constraints of the default phonology and their ranking, our proposals for [FoC] and [G] explain well-known and much-discussed observations like the destressing and dephrasing of Given phrases and the lack of a pitch accent for a Second Occurrence FoCus. Most importantly, though, we showed that our account also derives the Katz & Selkirk facts relating to combinations of FoCus and merely new, which have mostly gone unnoticed or unexplained in the literature.

The meaning and distribution of [FoC] and [G]
This section will track the effects of [FoC] and [G] at the syntax-semantics/pragmatics interface and show that the two features are responsible for two distinct pressures for marking discourse coherence in Standard American and British English. The existence of two distinct pressures telling discourse participants to signal both matches and contrasts in the ongoing discourse is obscured by the unifying F(ocus) approaches of Rooth and Schwarzschild, as well as by the various featureless approaches to information structure.
As in section 3, we will rely on the Alternatives Semantics of Rooth (1992a;. Since the discourse requirements triggered by [FoC] and [G] depend on the linguistic and non-linguistic discourse context, we also need some representation of context. We are not assuming any particular representations of discourse contexts here, as long as they provide an updatable record of available discourse referents within a relevant window. Recall that in Alternatives Semantics, expressions are assigned two semantic values: O-values (ordinary meanings) and A-values (sets of alternatives). The direct contribution of [FoC] is to introduce alternatives as A-values, as stated in (45). The [G]-feature places a Givenness requirement on the discourse context, which amounts to establishing an anaphoric relationship with a preceding discourse referent.

(46)
Givenness An expression α is Given with respect to an individual, property, or proposition in C iff ⟦α⟧ A, C = { }.
For an expression α to have a chance to be Given with respect to some discourse referent according to (46), it has to have a singleton alternatives set. For this to be possible, it can't contain any [FoC]-marked constituent that has generated alternatives that have not been blocked from propagating ('consumed') by some alternatives-evaluating operator within α. This property of Givenness has important consequences for the analysis of Second Occurrence FoCus (SOF), as we will see shortly.
To implement the Givenness requirement attached to [G], we are assuming that, technically, [G] is indexed with a contextually salient discourse referent, a point we have neglected up to now in our representations, and will continue to neglect for convenience when not relevant. We have then: 46 (47)

The contribution of [G]
O-values ⟦ [α] G ⟧ O, C is defined iff is a discourse referent in the window preceding C, and α is Given with respect to . 46 If defined, The [G]-feature introduces a Givenness requirement that has to be satisfied by the utterance context, regardless of how deeply embedded the feature may be. [G] does not contribute anything to the truth-conditional content of the expressions it attaches to, nor does it affect the computation of alternatives. Its contribution is purely use-conditional or expressive in the sense of Kaplan (1999), Kratzer (1999;, Potts (2005), McCready (2010), or Gutzmann (2015;. In that respect, [G] resembles discourse particles like German ja or doch, as suggested in Kratzer (2004). However, [G] also differs from discourse particles and other expressive meanings discussed in the literature in placing a much simpler condition on the discourse context. The appropriateness of [G] does not depend on the mutual beliefs or expectations of discourse participants. All [G] cares about is whether something has been said before or is otherwise contextually salient. We chose to implement the contribution of [G] as a contextual presupposition (Schlenker 2007), but we remain open to the possibility that expressive (use-conditional) meanings are a distinguished class of meanings requiring their own architecture, as argued in Potts (2005) and Gutzmann (2015;. Comparing (45) and (47), a crucial difference between [G] and [FoC] becomes apparent.
[G] directly imposes a discourse requirement related to Givenness. [FoC] all by itself does not trigger any discourse requirement, it merely introduces alternatives that can then be used by alternatives-evaluating operators (see Beck 2016 for an overview). Following Rooth (1992a), we are assuming that the contrast requirement that comes with [FoC]marking is introduced by a separate operator ~ (the 'squiggle') marking the scope of the FoCus. The ~ operator can have a detectable presence in syntax. For example, we saw in section 2 that Sinhala has verbal inflection marking the scope of a possibly distant FoCused constituent, suggesting a configuration similar to wh-constructions.
[FoC]-marked constituents must be c-commanded by a ~ operator. 47 This requirement might be enforced in the syntax via a feature agreement relationship between an interpretable occurrence of [FoC]  Like [G], the meaning contribution of the ~ operator is expressive (use-conditional). Also like [G], the ~ operator establishes an anaphoric relation to a discourse referent. But with the ~operator, the relation has to be one of contrast. In our example, the proposition that Sarah mailed the caramels is an obvious propositional antecedent. That proposition is distinct from the proposition that Eliza mailed the caramels and is also among the alternatives determined by the scope of the ~ operator. In this way your reply in (48) represents a contrast with what I said before. (49) defines the notion of contrast representation that we are after. (49)

Contrast representation
An expression α represents a contrast with a discourse referent (individual, property, proposition) in context C just in case conditions (i) to (iii) are satisfied: (iii) There is no FoC/G-variant β of α such that ⟦β⟧ A, C ⊂ ⟦α⟧ A, C and ∈ ⟦β⟧ A, C .
Conditions (i) and (ii) say that a contrasting discourse referent for an expression α has to be in the alternatives set for α and it can't have the same meaning (ordinary denotation) as α. 49 Condition (iii) is there to prevent overFoCusing. Its formulation is adapted from unpublished work by Roger Schwarzschild (Schwarzschild 1993, reported in Truckenbrodt 1995. Without condition (iii), (49) would incorrectly predict that 50(b) below represents a contrast with the proposition expressed by 50 (  It's too easy for those representations to satisfy the contrast requirement for FoCus, since the alternatives sets generated for such [FoC]-marking configurations are completely unrestricted: they contain any entity whatsoever of the relevant type. Such alternatives sets are trivial in the sense that they coincide with a semantic domain D t for some type t. Definition (49) doesn't make 52(a) to (c) ineligible for representing contrasts with some discourse antecedent. We think that this is right, since examples like 52(a), for example, might easily be uttered by a grade school teacher giving dictation. However, as we will discuss shortly, there is pressure in Standard American and British English to represent contrasts with discourse referents from the preceding context. That pressure does not seem to extend to trivial contrasts. While there is nothing ungrammatical about representing trivial contrasts on our account, then, there is no pragmatic pressure to do so. With a workable condition for contrast representation in place, a meaning definition for the ~operator can be stated as in (53).

(53)
The ~ operator O-values ⟦~ α ⟧ O, C is only defined, if is a discourse referent in the discourse window surrounding C, and α represents a contrast with . If defined, Rather than merely representing a contrast with a single discourse antecedent , as assumed in (53), an expression α might also represent a contrast with each member of a set of contextually salient discourse antecedents ℭ. A relevant example was the Katz & Selkirk example from section 4. There, we were told about an art dealer, Gary, who doesn't do business with the Guggenheim or the Metropolitan, but would only offer his Modigliani painting to MoMa. In this mini-discourse, the proposition that Gary would offer his Modigliani to MoMa is contrasted with two contextually salient propositions, namely that he would offer the painting to the Guggenheim, and that he would offer it to the Metropolitan. To allow for multiple contrasting antecedents, we could adopt (54) instead of (53).

(54)
The ~ operator (allowing multiple antecedents) O-values ⟦~ℭ α ⟧ O, C is only defined, if ℭ is a set of discourse referents in the discourse window surrounding C, and α represents a contrast with each member of ℭ. If defined, While (54) allows multiple contrastive antecedents, it differs crucially from Rooth (1992a) and Schwarzschild (1999) in having no provision for questions as possible antecedents. As we saw in section 4, questions do not have a special, direct, relation to FoCus. While constituent questions denote sets of propositional alternatives (Hamblin 1973), they do not automatically make those alternatives available as contrasting discourse antecedents without additional contextual support. Unlike Rooth's ~ operator, ours has the effect that [FoC]-marked constituents always signal a contrast.
With Rooth (1992a), (53) and (54) assume that the ~ operator stops the propagation of alternatives and thereby blocks access to those alternatives for higher operators. 51 Association with FoCus-sensitive operators like only or even is then indirect on Rooth's account, and is mediated via two occurrences of a contextual variable (our ℭ), as illustrated in 55(b), which presents the essential pieces of the logical form of 55(a). A simplified version of the semantics of only could now look as follows.
(56) The semantics of only (good enough for now) The semantics for only in (56) has the effect that for a sentence like 55(a) to be true, Amanda has to have passed the bar exam and all other propositions in a contextually salient set of propositions ℭ have to be false. Since there is a second instance of ℭ that comes with the ~ operator adjacent to only (as shown in 55(b)), each proposition in ℭ has to be in the alternatives set 57(b) for 57(a). In this way, only indirectly associates with a FoCus in its scope.
[Amanda] FoC passed the bar exam. b. {Amanda passed the bar exam, Noah passed the bar exam, Eliza passed the bar exam, Leif passed the bar exam …}.
On our account, all the ~ operator itself contributes to the interpretation of 55(a), formalized as 55(b), is an expressive meaning component, that is, a requirement for contrast with one or more salient discourse referents. In this particular example, the contrasting discourse referents might be contextually salient propositions about other candidates who might or might not have passed the bar exam. In contexts where it's not mutually known who those other candidates were, Amanda could simply be contrasted with the 'non-Amandas', whoever they may be. In the place of only in 55(a) and (b), we might also have a silent exhaustivity operator of the kind identified for Hungarian (Szabolcsi 1981;É. Kiss 1998;Horvath 2010 The first instance of the faculty in (60) is [FoC]-marked to convey the intended contrastive interpretation. But then it can't also be [G]-marked. This instance of the faculty is thus expected to show the prosodic effects of FoCus. It has a pitch accent. The second occurrence of the faculty in (60), on the other hand, is a [FoC]-marked phrase contained within a VP that is headed by only next to a ~ operator. 53 The ~ operator consumes the alternatives generated in its scope, and only isn't FoCused, hence doesn't introduce any alternatives of its own. As a consequence, the VP headed by only has a singleton alternatives set. Since the VP also has a match in the previous discourse, it can be [G]-marked. The second occurrence of the faculty in (60) is thus expected to show the prosodic effects of SOF. It has no pitch accent. However, to complete our account of SOF, we still need to reckon with the finding of Beaver et al. (2007) that the duration and intensity of an SOF word is not identical to that of a merely Given word, an issue we had to leave open in the previous section. Since an SOF word is [FoC]-marked, hence is not Given and thus cannot be [G]-marked, we could account for the difference between SOF words and Given words by positing an additional spellout constraint saying that a [G]-marked word cannot be an w. 54 The difference between SOF words and Given words would then be that an SOF word has to be an w, while a Given word can't be.
We can now see that the simple and intuitive idea of a Second Occurrence FoCus as a FoCus contained within a Given constituent, as proposed in Selkirk (2008), is viable, contrary to the assessment of Büring (2015a). The facts can be accounted for by natural assumptions about FoCus, Givenness, and the propagation and consumption of alternatives within Alternatives Semantics.
Our analysis of (60) is not complete yet, though. There are still issues we need to attend to. In the mini-discourse (60), both instances of the faculty are intuitively given. Yet both are [FoC]-marked in (60) We are not out of the woods with example (60), though.
[FoC]-marking of the first instance of the faculty isn't a mere option in (60). It is obligatory. Why is that? It looks like there isn't only pressure for [G]-marking, there is also pressure for [FoC]-marking. To probe into this pressure, we will look at an example of the kind discussed in Williams 53 Note that the embedded ~ operator has no linguistic antecedent. We have to accommodate a contrasting discourse referent, then, like the property of quoting the people who are not members of the faculty, for example. 54 The prosodic structure representations Figure 3 and Figure 4 in section 7 would have to be adjusted accordingly. 55 The formulation in (61) (62) is deviant in the given context, and severely so. According to Williams (1997: 599), the problem with cases like (62) is that "cumulative destressing is not allowed. One cannot destress one phrase and then destress its neighbor, with separate licensing of each." Williams (1997;, Schwarzschild (1999), and Wagner (2005; have accounts that, each in their own way, converge on Williams' diagnosis of what is wrong with cases like (62). Rephrasing the diagnosis in our own words, there appears to be a problem when two sister constituents α and β lack prominence without their mother constituent αβ being Given. In your reply in (62), both doubted and that it would be cold lack prominence, but doubted that it would be cold as a whole is not Given.
Williams' diagnosis doesn't seem quite right yet. 63(a) and (b)  63(a) and (b) are acceptable without prominence on any of the Given constituents, even though Max's getting some of Sally's money or the possibility of his getting a Borsalino hat are not understood to be contextually implied by the respective stretches of preceding discourse. But then 63(a) and (b) have configurations where two sister constituents lack prominence without their mother constituent being Given. What, then, is the difference between the deviant example (62) on the one hand, and the acceptable 63(a) and (b) on the other? Our assessment of the deviance of your reply in (62) is that in Standard American and British English, there is pressure for representing non-trivial contrasts with salient discourse referents from the preceding discourse. 58 (62) is deviant because opportunities for representing non-trivial contrasts have been overlooked. There were two such opportunities, those in (64) 57 We are assuming that [Max some of the money] is a constituent in 63(a). This is compatible with Harley (1995; and other analyses of double object constructions. With Green (1974) and Harley, we assume further that the double object configuration here includes a silent HAVE, so that we have a constituent [Max [HAVE some of the money]]. 58 While English allows contrasts to be represented with discourse referents in the subsequent context, there doesn't seem to be pressure to do so. 63(a) and (b) were constructed so as to not provide opportunities for representing (nontrivial) contrasts with salient discourse referents from the preceding context. Without those opportunities, the pressure for representing contrasts seems off. We propose the principle in (66), which forces [FoC]-marking when there are opportunities for representing non-trivial contrasts. (66)

Pressure for [FoC]-Marking
Represent non-trivial contrasts with salient discourse referents in the preceding discourse.
(66) is a very different kind of principle from (61). (66) faults an expression if it could have represented a (non-trivial) contrast with a salient discourse referent, but missed the opportunity. Unlike (61) (60)) that represents a contrast with a salient discourse referent in the preceding discourse. An important question which we will have to leave for more detailed corpus and experimental investigations is what happens when there are several opportunities for representing non-trivial contrasts that couldn't all be realized simultaneously. (64) vs. (65) present a situation of this kind. In this case, there are two mutually incompatible ways of representing a contrast with the previous discourse, and both options are equally acceptable. But might there also be cases where there is a ranking, as Truckenbrodt (1995) has suggested? We don't know (yet).
A more fundamental question that is now on the table is why there are such principles as Pressure for [FoC]-Marking and Pressure for [G]-Marking to begin with. What is it that makes [G]-marking and [FoC]-marking obligatory under certain discourse conditions in American and British English? It is tempting to hold Heim's (1991) Maximize Presuppositions principle responsible, as proposed in Wagner (2005;. As Heim was well aware, though, such a principle would need to be formulated very carefully. In our case, we would have to explain, for example, why discourse particles like German ja and doch, which only have expressive meanings, too, are not obligatory, even when the presuppositions they place on the discourse context are mutually known to be satisfied. Why should presence vs. absence of [G]-marking or [FoC]-marking be decided by Maximize Presuppositions, while presence vs. absence of a discourse particle would not be? The answer to this question requires an answer to the more general question of what it is that makes two or more expressions competitors with respect to Maximize Presuppositions. This last question is rarely addressed in the extant literature. We thus do not (