1 Rationale for our Experimental Study

We find gender systems in a significant number of languages. Yet evidence of how these systems arise is limited. The best evidence points to sources with a semantic basis, and the semantic justification of the system may be preserved in gender systems or may be partly obscured (Corbett 2013). This fact already prompts the question as to why a potentially useful semantic distinction should not be fully retained. We report on research that is novel in showing an additional, and unlikely potential source for gender, namely possessive classifiers. These seem an improbable source, since they express the changeable relation of the possessor to the possessum. Again, we face the loss of useful information. Furthermore, the change is in progress; we have no tradition of texts for these languages, but we are able to conduct revealing experiments in the field.

We are intrigued by the old questions of where gender systems come from and what they do. These systems are increasingly seen as a part of a larger typological domain, namely nominal classification. In other words, instead of seeing gender and the various types of classifiers as forming a dichotomy (as in Dixon’s 1982 influential account), we see them as differing along several dimensions, with numerous intermediate possibilities (Fedden & Corbett forthcoming). Against this background, an intriguing result was reported in Franjieh (2016), namely a system with certain gender-like properties, in a language (North Ambrym, Vanuatu) previously assumed to have possessive classifiers. Specifically, while possessive classifiers are used to signal the relation of the possessor to the possessum, which is changeable, speakers of North Ambrym are increasingly fixing the classifier, according to the lexical semantics of the noun. Now one of the dimensions (criteria) for analysing nominal classification systems is precisely redundant vs informative, and North Ambrym suggests a dramatic development along this criterion. We wished to investigate this change, both in North Ambrym and potentially in related languages, which have similar but usefully different nominal classification systems.

The interest of the change goes well beyond the specific languages of our study. There is no complete story of the rise of gender; rather we have a patchwork of evidence, which suggests how such systems may arise. Thus Greenberg (1978) pointed to the Daly languages of the Australian Northern Territory, where noun classifiers appear to develop towards a gender system. One of these languages, Ngan’gityemerri, has been studied in detail by Reid (1997). The system is illuminating. There are arguably fifteen gender values, assigned semantically (including male, female, canine, other non-human animates, striking instruments, and two types of spear). Nine genders have bound agreement markers, found on agreement targets, such as adjectives (Reid 1997: 181):

    1. (1)
    1. a-syensyerrgimi
    2. animate-white.rock.wallaby
    1. a=tyentyenmuy
    2. animate=tame
    1. ‘a tame white rock wallaby’ (‘=’ is used for clitics and ‘–‘ for affixes)

Six genders are indicated by optional freeform classifiers (Reid 1997: 177):

    1. (2)
    1. (syiri)
    2. strike
    1. magulfu
    2. cylindrical.fighting.stick
    1. (syiri)
    2. strike
    1. marrgu
    2. new
    1. ‘a new cylindrical fighting stick’

Syiri is used for weapon-like objects that have a striking type of contact. Its first use in (2) is more classifier-like, while its second use approaches an agreement marker. The two types of realization shown in (1) and (2) form a single system: (i) for some gender values there is a noun classifier available (in addition to a marker on the noun and to a bound agreement marker); and (ii) while the use of the classifiers is optional, so too is agreement (Reid 1997: 168). Reid’s fine study provides a window into the rise of gender systems; Ngan’gityemerri shows the development from freeform classifier to bound agreement marker, a process that is ongoing (Reid 1997: 211–222). While this picture is convincing, we must remember that the evidence concerns additional gender values within an already existing gender system. Thus, noun classifiers are a potential source of gender, but we do not see a system arising ex nihilo in Ngan’gityemerri. We do see a fascinating system, split in terms of exponence.

Nominal classification systems may also be split ‘in the other dimension’; that is, there can be levels of semantic generality, with these being realised through similar means. A good example is the Northwest Amazonian language Bora-Miraña (Seifart 2005; 2018). Here we find around 60 rather specific classifying elements (such as ‘one-dimensional, pointed’), and a second more general system, with fewer items (for animacy, sex and number). They all occur in the same morphosyntactic slots, in an agreement-like fashion. Taken together, the Ngan’gityemerri and the Bora-Miraña systems suggest that the development from more classifier-like to more gender-like patterns is not unitary: there are different paths of development.

As we put together the different pieces of evidence, one part of the story is less clear, namely how a ‘useful’ semantic distinction is given up, at least in part, leading to the system being less informative (and more redundant). As Seifart stresses (2018: 20–23), we have rather little evidence on how transparent classification becomes opaque. Our results from the languages of Vanuatu and New Caledonia can shed light on this. Thus, the local development, important for how gender systems can emerge, has wider significance for what it adds to the more general patchwork of evidence.

Where might we look for evidence of this change in progress? It has been claimed that different systems of nominal classification can function as reference tracking devices across large units of discourse (Contini-Morava & Kilarski 2013), though the evidence is not unequivocal (Feist 2020). Precisely for reference tracking, a fixed, gender-like system could be argued to have advantages, since if each noun has a single gender value (Corbett & Fedden 2016: 503), the anaphoric material (e.g., the pronoun) provides the hearer with a clear indication of possible antecedents. As possessive classifiers allow freer association (and are thus more informative), they are less obviously purposed for reference tracking. It is precisely here that we look: we ask what happens when possessive classifiers are in the frame to be used for reference tracking.

Furthermore, corpora only exist in two of our six sample languages from Vanuatu and New Caledonia in our study, for North Ambrym and Vatlongos. There are no corpora for Merei, Lewo, Iaai and Nêlêmwa, which limits the possibilities for an in-depth corpus investigation. And where there is evidence, it is limited. Feist (2020) investigated the use of classifiers and gender in four languages as reference disambiguation aids. However, in the two languages in his sample that have classifiers, Mian (Trans-New Guinea, Papua New Guinea) and Kilivila (Austronesian, Papua New Guinea), the use of classifiers as reduced referential devices with no overt NP, occurred in only 3% to 5% of all instances of reference tracking in the collection of texts used. This limited use of classifiers without a full referential NP mirrors the lack of examples in the two corpora of Vatlongos and North Ambrym and provides further justification for an alternative approach (c.f. section 2.2 for reference tracking examples). We therefore designed an experimental approach using storyboards (Burton & Matthewson 2015) to elicit comparable targeted responses across our language sample, detailed in section 2.1.

The purpose of this study is not to investigate the use of possessive classifiers as devices for disambiguating between potential referents, for which see Feist (2020). We instead focus on the redundant vs informative criterion in nominal classification systems (Fedden & Corbett forthcoming). We are looking for the emergence of fixed (hence redundant) assignment. We expect possessive classifiers to show variable assignment, that is, with different classifiers functioning as the anaphor across large stretches of discourse, dependent on a particular semantic/pragmatic context (hence being informative). We are interested in instances where possessive classifiers become fixed, that is, where nouns are lexically specified according to which classifier they require, and therefore the same classifier must be used across large stretches of discourse to refer to the antecedent. This would indicate a development in the direction of canonical gender.

Note that assignment varies across the different types of nominal classification system. It is often thought that gender systems are redundant, since a given noun will be assigned to a single gender. This view has arisen, we believe, from the historical accident of the order of discovery and study of the different types of systems. Classical gender systems (redundant) and classifier systems (informative) were discovered and described. But then as new systems were found, they tended to be accommodated to these two poles, which means that the full range of variability has not yet been fully assimilated. Thus, there are nominal classification systems that, on most criteria, look rather like gender systems, but which allow reassignment of nouns, to different degrees. Good examples are Mawng (Australia), for which see Singer (2016: 63–64), and Savosavo (Solomon Islands), for which see Wegener (2012: 60–63). These systems are more informative than canonical gender systems. Conversely, the choice of classifier can be relatively fixed, and so less informative, as in Chinese, Thai, and Hmong (Bisang 1993: 17). Given this range of systems, it is useful to have a baseline from which to measure, and we take redundancy (each noun has a fixed gender value) as the canonical baseline. The key consideration for this paper, however, is that we can investigate and document a progression from informative systems towards less informative (redundant) systems. We stress that what is redundant may still be useful. Thus, if a noun always has the same classification in the system, that provides no information directly, but it could aid reference tracking, since an antecedent can be established more readily if its assignment is fixed.

2 Oceanic Possession and Reference Tracking

We give an overview of Oceanic possession, with specifics on the constructions found in our sample languages, in section 2.1. We discuss the use of possessive classifiers as reference tracking devices in section 2.2, giving justification for our study.

2.1 Oceanic Possession

Oceanic possession is widely discussed in the areal literature (Lichtenberk 1985; 2009; 2018; Ross 2004; and Lynch et al. 2002, among others);1 we give a brief overview of the possessive constructions found in the six sample languages in our study. These languages show a split in the class of nouns: some allow suffixation of a possessor index directly, whereas others cannot be directly suffixed. This two-way split is commonly termed ‘direct’ and ‘indirect possession’ in the Oceanic literature. In the typological literature it is called the alienability split (Bugaeva et al. 2022) or obligatory/optional possession (Bickel & Nichols 2013).

The rationale for this differing terminology is that nouns that are directly possessed tend to be semantically inalienable (including body parts, kin terms, part-wholes, and personal possessions). Nouns that are indirectly possessed are more likely to be easily transferable from one possessor to another. The following example from Iaai shows a direct possessive construction ((3)a) with a kin term directly suffixed by a possessor suffix. Example ((3)b) shows an indirect possessive construction where the possessor suffix does not attach directly to a possessed noun, but instead to one of a set of possessive classifiers.

    1. (3)
    1. Iaai: Dotte (2017: 341)2
    1.  
    1. a.
    1. hinyö-k
    2. mother-1sg
    1. ‘my mother’
    1.  
    1. b.
    1. haalee-k
    2. animal.cl-1sg
    1. kuli
    2. dog
    1. ‘my dog (pet)’

It is the possessive classifiers, as in ((3)b), which are our focus. The number of such possessive classifiers varies considerably; the number found in each of our sample languages is shown in Table 1.

Table 1

The sample languages and their number of possessive classifiers.

Language Number of possessive classifiers
Merei (Santo island, Vanuatu) 3
Lewo (Epi island, Vanuatu) 3
Vatlongos (Ambrym island, Vanuatu) 4
North Ambrym (Ambrym island, Vanuatu) 5
Nêlêmwa (Grand-terre, New Caledonia) >19
Iaai (Ouvea island, New Caledonia) >23

The possessive classifiers in Oceanic languages are often called relational classifiers (Lichtenberk 1983), since they encode the relation (intended or actual) between the possessor and possessed, rather than a particular salient attribute of the referent of the possessed noun. Hence, they are informative. Nouns can often occur with different classifiers, depending on the relationship that the speaker highlights. Example (4) from Iaai shows that the noun wââ ‘fish’ can occur with three different possessive classifiers depending upon the interactional context. This same noun can occur with the food classifier if the speaker intends to eat the fish; with the catch classifier if they want to highlight that they caught the fish; or with the general classifier if they want to highlight another use not specified by the available set of classifiers in their repertoire, for example if they wish to sell the fish.

    1. (4)
    1. Iaai: Dotte (2017: 346)
    1.  
    1. a.
    1. ö-k
    2. food.cl-1sg
    1. wââ
    2. fish
    1. ‘my fish (to eat)’
    1.  
    1. b.
    1. hanii-k
    2. catch.cl-1sg
    1. wââ
    2. fish
    1. ‘my fish (caught)’
    1.  
    1. c.
    1. anyi-k
    2. general.cl-1sg
    1. wââ
    2. fish
    1. ‘my fish (to sell)’

Both the New Caledonian languages in our sample, Nêlêmwa and Iaai, have more ‘open’ systems of classifiers; speakers are able to create new classifiers when a particular need arises. This is a productive process in Iaai and Nêlêmwa. New classifiers can emerge from existing directly possessed nouns, or from verbs by attaching possessor suffixes. To take an example from Iaai, the verb tûö ‘to smoke’ can be used as a classifier for possessions which you can smoke by the addition of possessor suffixes, e.g., tûe-n sigââ ‘his/her cigarette (smoke.cl-3sg cigarette)’ or tûe-n paep ‘his/her pipe (smoke.cl-3sg pipe)’. The Vanuatu languages in our sample do not allow the creation of new classifiers; their class of classifiers is closed.

In contrast to the widely held view that Oceanic possessive classifiers are relational classifiers, the North Ambrym language has a much stricter relation between noun and classifier, with the majority of nouns only being able to occur with just one classifier. For example, we ‘water’ in (5) can occur only with the drink classifier, regardless as to whether the possessor intends to drink the water or not:

    1. (5)
    1. North Ambrym: Franjieh (2016: 95)
    1.  
    1. a.
    1.   mwe-ng
    2.   drink.cl-1sg
    1. we
    2. water
    1.   ‘my water (for drinking, washing, etc.)’
    1.  
    1. b.
    1. *mwene-ng
    2.   general.cl-1sg
    1. we
    2. water
    1.   Intended: ‘my water’

Across the Oceanic languages, the directly possessed nouns tend to be a closed class, disallowing new members. In contrast, the class of indirectly possessed nouns is open, allowing new words, such as borrowings, to be possessed. Across the sample languages, the Vanuatu languages fit this description well. However, the two New Caledonian languages appear to have a semi-open class of directly possessed nouns. In Iaai, indirectly possessed nouns and verbs can be a source of new directly possessed nouns. Equally, directly possessed nouns act as a source for new possessive classifiers, which are an open class in Iaai.

Possessive classifiers are not homogenous within and across languages. We make a distinction between major and minor classifiers. Major classifiers occur with a large number of nouns and can allow neologisms or borrowings. Minor classifiers typically occur with only a handful of nouns and are essentially closed to new ones. North Ambrym has two minor classifiers: fire and basket. The latter classifier allows just a handful of nouns that refer to traditionally made baskets, whereas borrowings such as bak ‘bag’ are disallowed, and instead occur with the major general classifier. As minor classifiers occur with only limited numbers of nouns, there is a possibility that these are more likely to diminish in use. For example, speakers of North Ambrym are happy to possess arrbol ‘basket’ with either the basket or the general classifier with no change in semantics. Similar examples can be found in Iaai and Nêlêmwa, which have varying numbers of minor classifiers, with a shift in use to major classifiers.

The possessive classifiers in the languages of Vanuatu in our sample show signs of heterogenous semantic domains. Thus, the labels given to the classifiers are mnemonic devices, which refer to commonalities in the semantic domain across languages. For example, the food classifier in North Ambrym can also occur with nouns referring to tools, utensils, crockery, units of time and some kinship terms. The drink classifier in North Ambrym, Vatlongos and Lewo also occurs with nouns referring to buildings and mats. In Vatlongos and Lewo the drink classifier also encodes some items of clothing. Generally, the possessive classifiers in the languages of New Caledonia in our sample cover more homogenous semantic domains, due to the large number of classifiers that have carved up the semantic space in more fine detail. However, there are some classifiers which do have heterogenous semantic domains. The animal classifier in Iaai is typically used for domesticated animals, though some speakers also use it for domesticated trees and plants they look after.

Both North Ambrym and Nêlêmwa also have a further indirect construction which uses prepositions to link possessor and possessed. We include this here, since these indirect prepositional possessive constructions occur in speakers’ responses to some of the storyboards. In North Ambrym the general associative preposition ne can encode a looser connection between possessor and possessed. Schneider (2011: 385) discusses similar constructions across other languages of Vanuatu, describing them as “where two NPs are juxtaposed in a difficult to define, not-quite-possessive relationship.” Often these constructions can be used when the ‘possessor’ is inanimate, such as rrem ‘yam’ in example (6), where it is not an intrinsic part of saaroan ‘story’, but it is what the story is about:

    1. (6)
    1. North Ambrym: Franjieh (2012: 256)
    1. Saaroan
    2. story.nmlz
    1. ne
    2. ass
    1. rrem
    2. yam
    1. lo
    2. then
    1. mi
    2. rec.pst[3sg]
    1. yi
    2. like
    1. a
    2. dem
    1. ‘The story of the yam is like this’

Furthermore, the general associative preposition in North Ambrym can be used for some part-whole relationships as well, such as internal body parts (7).

    1. (7)
    1. North Ambrym: Franjieh (2012: 253)
    1. Olvaa
    2. Brain
    1. ne
    2. ass
    1. ni
    2. 1sg
    1. ‘my brain’

Schneider (2011) discusses the potential difference between indirect and associative constructions as being defined by control, where associative constructions encode a lack of control of the possessor over the possessed.

Nêlêmwa, unlike the other sample languages, does not have a general possessive classifier. Instead, Nêlêmwa3 uses an indirect linker i shown in example (8):

    1. (8)
    1. Nêlêmwa (Bril 2012: 72)
    1. bu
    2. hook
    1. i4
    2. link
    1. na
    2. 1sg
    1. ‘my hook’

In Nêlêmwa, we have found that some directly possessed nouns can also be possessed using the general linker preposition (though this is speaker-dependent), thus making the use of the linking preposition not restricted to a particular class of noun:

    1. (9)
    1. Nêlêmwa (elicited – no semantic difference)
    1.  
    1. a.
    1. balâ-ny
    2. ball-1sg
    1. ‘my ball’
    1.  
    1. b.
    1. balâ
    2. ball
    1. i
    2. link
    1. na
    2. 1sg
    1. ‘my ball’

2.2 Reference Tracking

In the sample languages, possessive classifiers can function as reference tracking devices. In these instances, possessive classifiers can be used without a possessed noun. The possessed noun is either recoverable from prior discourse or evident from context. In example (10) from Vatlongos, a section of discourse between a pigeon and a heron on the reef, the first instances of the a- food classifier occur with the possessed noun tiei ‘clam’ ((10)a). However, the second occurrence of the food classifier occurs on its own without the possessed noun ((10)c). Here, the classifier functions as a reference tracking device, and refers back to the first mention of tiei ‘clam’.

    1. (10)
    1. Vatlongos (Ezekiel & Ridge 2014)
    1.  
    1. a.
    1. Ui,
    2. intj
    1. tiei
    2. clam
    1. tei
    2. one
    1. a-van
    2. food.cl-1sg
    1. xitak
    2. this_one
    1. ‘“Hey there’s a clam for me over here!”
    1.  
    1. b.
    1. e
    2. conj
    1. koh
    2. heron
    1. bitene
    2. 3sg.say.ind
    1. inou
    2. 1sg
    1. mu
    2. first
    1. na-pus-i;
    2. 1sg-see-obj
    1. na-danga
    2. 1sg-stay_behind.ind
    1. ioxor
    2. there
    1. na-ketteh
    2. 1sg-look
    1. hira-k
    2. neck-1sg
    1. teviei
    2. long
    1. na-ketteh
    2. 1sg-look
    1. na-pus-i
    2. 1sg-see-obj
    1. vari.
    2. at_once
    1. but the heron said: “I saw it first when I was back there I saw it, my neck is long and I saw it at once.”
    1.  
    1. c.
    1. Ma
    2. then
    1. mai
    2. pigeon
    1. bitene
    2. 3sg.say.ind
    1. bos
    2. 3sg.good.ind
    1. ta,
    2. just
    1. a-m.
    2. food.cl-2sg
    1. So the pigeon said: “Okay fine, it’s yours.”’

In our sample, five of the six languages allow full possessed noun elision when the referent of the possessed noun in a possessive classifier construction is recoverable from prior discourse or from pragmatic/external context. However, North Ambrym allows only partial elision of the possessed noun of a classifier construction. In example (11), instead of the possessed noun, the partitive marker ge (ptv) must occur. The antecedent of the possessive phrase is bwehel ‘bird’ mentioned first in the sentence. The possessive phrase is interpreted as ‘his (food) one’.

    1. (11)
    1. North Ambrym (Franjieh 2018: 41)
    1. [Bwehelx
    2. bird
    1. ge
    2. REL
    1. mōrō
    2. 3du.real
    1. rrya
    2. carry
    1. bya
    2. go
    1. le],
    2. dem
    1. Batik
    2. Batik
    1. bya
    2. go
    1. rrō
    2. prog
    1. fne
    2. roast
    1. [a-nx
    2. food.cl-3sg
    1. ge].
    2. ptv
    1. ‘The (two) birds which they carried there, Batik went and roasted his one.’

In the available corpora of Vatlongos and North Ambrym, the use of classifiers as reference tracking devices is rare. As both languages have limited documentation, they have small corpora: Vatlongos has a total corpus of 51,302 tokens and North Ambrym 31,788 total tokens. Table 2 shows that the classifier constructions only make up a small part of the total number of indirectly possessed noun tokens in both corpora.

Table 2

Details of the language corpora for North Ambrym and Vatlongos.

Language Total indirectly possessed noun tokens Total classifier with possessed noun tokens (% of total indirectly possessed noun tokens) Total classifiers without possessed noun tokens (% of total classifier tokens)
North Ambrym 3886 427 (10.99%) 20 (4.68%)
Vatlongos 8514 1133 (13.31%) 45 (3.97%)

3 Methodology

3.1 Participants

The Participants were 125 adults over the age of 18 from the language communities in Vanuatu and New Caledonia. A power analysis was conducted using G*Power (v3.1.9.7), ensuring the sample is appropriate for detecting small-medium effects (Faul et al., 2009). The sample included 68 (54.4%) male and 57 (45.6%) female participants. Their ages ranged between 18 and 80 years (M = 44.37, SD = 15.46). The level of education was measured in years of formal education and ranged between 0 and 23 years (M = 8.89, SD = 4.00). Further participant information broken down by language is shown in Table 3. Participants were recruited using convenience and snowball methods via contacts in each language community. Favourable ethical opinion was obtained from the University of Surrey prior to the study.

Table 3

Demographic information for each language community.

Language Participants Sex Age Years of education
Male Female M SD M SD
Merei 21 13 8 32.52 13.12 6.14 4.10
Lewo 22 13 9 41.86 13.11 8.27 2.33
Vatlongos 22 8 14 39.82 13.69 8.05 3.07
North Ambrym 22 12 10 43.77 14.00 7.09 2.47
Nêlêmwa 19 12 7 57.95 15.85 10.47 3.49
Iaai 19 10 9 50.32 11.26 14.11 3.74

3.2 Experimental Setup

The storyboard experiment was conducted alongside two other experiments – a possessive labelling task and an eye-tracking task. Only the storyboard experiment will be discussed here; the other two experiments will be reported elsewhere. The order of experiments was randomised across participants. Testing took place across 56 days in the six language communities during July to October 2022. One to four participants were tested on each of the data collection days. All participants were provided with information about the study and gave informed consent. For participants in Vanuatu, the information sheet and consent form were read aloud in Bislama (the lingua franca of Vanuatu) and oral consent recorded. Participants in New Caledonia read the information sheet and signed written consent forms in French (the lingua franca of New Caledonia). For Iaai and Nêlêmwa, the experiments were conducted either at a participant’s home on a veranda or underneath a shelter, or in the building where the experimenter was staying. For North Ambrym, Vatlongos, Lewo, and Merei, the experiment was set up in a house on a table. Participants were reimbursed for their time with small monetary payments (Vanuatu) or gifts (New Caledonia), following the different cultural norms.

3.3 Materials and Design

Eight storyboards were designed to investigate how different interactional contexts affected the use of classifiers as anaphoric reference tracking devices across stretches of discourse. Each storyboard consisted of four images that represented different interactions related to a target noun (see Figure 1).

Figure 1
Figure 1

Storyboard targeting ‘fish’ with different interactions: catching (top left), carrying home (top right), eating (bottom left), and selling (bottom right).5

A representative sample of nouns were chosen to test across all six languages, combining variation in frequency and variation in cognitive salience. Corpora exist for North Ambrym and Vatlongos (c.f., Table 2); nouns with similar semantic definitions that appeared in both corpora were chosen as a starting point, resulting in 116 nouns. These nouns were cross checked with results from a prior free-list experiment (see Franjieh et al. 2022 for methodology). The free-list experiment elicited the most salient nouns associated with each classifier in each language. Cognitive Salience Index (CSI) scores (Sutrop 2001) were calculated to give an indication of the psychological saliency of the association of each noun to a particular classifier. CSI scores combine the frequency and mean position of a term within a list, across all participants. They range from 0 to 1, with 1 denoting maximum salience. The maximum CSI score of each noun across the sample languages was used to divide the list of 116 nouns into four ranks (>0.75 = 1; >0.5 and <=0.75 = 2; >0.25 and <=0.5 = 3; <=0.25 = 4). Token frequency was normalised per 10,000 words and averaged across both the North Ambrym and Vatlongos corpora. The list of 116 nouns was then divided into four frequency quartiles.

Abstract nouns, nouns referring to large entities (e.g., home, cliff), superordinate terms (e.g., animal, thing, fruit), and nouns relating to people and their relationships (e.g., old woman, teacher) were removed. This was because abstract nouns, superordinate terms and large entities are hard to create concrete images for. In some languages, people and relationship terms are possessed using the direct construction, rather than with a classifier. Equally, these nouns are hard to fit into different interactional contexts.

The remaining list of 66 nouns was used as a basis for developing the eight storyboards. Two nouns from each CSI rank were chosen, which were mapped onto being either high frequency (top two frequency quartiles) or low frequency (bottom two frequency quartiles) nouns. However, the third CSI rank nouns all corresponded to high frequency nouns (top frequency percentile) so it was not possible to fully choose a high and a low frequency noun for the third CSI rank. Nevertheless, this technique enabled us to select eight nouns ranging in cognitive salience and frequency. The order of the eight storyboards (c.f. Table 4) was randomised for each participant.

Table 4

the eight storyboards, their interactional contexts, CSI rank and frequency quartiles.

Target Noun Picture number and interactional context Max CSI rank Frequency quartile
Fish 1: catch
2: carry home
3: eat
4: sell
1 2
Sugarcane 1: cut
2: carry home
3: eat
4: sell
1 3
Firewood 1: collect/gather
2: carry home
3: make fire with
4: sell
2 3
Water 1: catch
2: carry home
3: drink
4: wash with
2 1
Yam 1: dig up
2: carry home
3: eat
4: sell
3 1
Pig 1: catch
2: carry home
3: eat
4: sell
3 1
Fly 1: catch
2: carry home
3: eat
4: sell
4 4
Rat 1: catch
2: carry home
3: eat
4: sell
4 1

3.4 Procedure

Prior to data collection, experimental protocols were translated into Bislama and French and were checked by a French and a Bislama native speaker. The protocols were then back translated into English to validate the translations. Participants were first read the instructions below in either Bislama or French. Participants were encouraged not to repeat the possessed noun in the stories, so as to elicit classifiers used anaphorically without a possessed noun. Participants were shown the story on a laptop screen. The story was presented one picture at a time and the Bislama/French text read out. Then the storyboard was presented again picture by picture and the participant was asked to retell the story in their language. Often the French or Bislama version was repeated during this time as a prompt. Answers were recorded directly onto the laptop underneath each picture.

Instructions

There are eight short stories to look at. I will explain each story first in Bislama/French and then I would like you to retell the story in your language. Each story has four pictures. For each picture I want to know how you say that the object in the picture belongs to the man or the woman in the picture.

After the first picture you do not need to repeat the name of the object, just the word meaning ‘his’ or ‘hers’. For example, if the first picture is of a man and a woman making coffee, you can say ‘they are making their coffee’. If the next picture shows the man drinking the coffee, you should say ‘he is drinking his one’ but not ‘he is drinking his coffee’.

Some stories will sound a bit strange, but I want you to translate them even though they sound strange.

I will write down what you have said and ask you to help translate it. I will record what you say on the audio recorder.

The instructions explicitly asked that, after establishing the referent of the possession in the first picture using a noun, the participant was to avoid further uses of that noun and instead use just the possessive classifier, if possible. If a participant gave a response that deviated from expected, they were prompted to reformulate, using a classifier without a possessed noun. If a participant only used a noun as the referring device in the second, third or fourth picture, they were asked if it was possible to replace the noun with just a possessive classifier. Similarly, if a participant gave a full possessive construction consisting of classifier and possessed noun, they were asked if it was possible to say the sentence again, without the possessed noun. The goal of the experiment was not to uncover whether possessive classifiers can occur on their own without a possessed noun as a reference tracking device, but whether the classifiers used as reference tracking devices would vary according to the interactional contexts shown by the different pictures of the storyboards. All responses were recorded from the participants.

4 Results and Analysis

The data were analysed in four ways. First, we analysed only the responses where classifiers were used anaphorically without a possessed noun (4.1). Second, we included all possessive constructions used by participants, including directly possessed nouns, prepositional possession, and classifiers that occurred with a possessed noun (4.2). Third, we analysed the patterns of classifier responses (4.3). Finally, we analysed the results for each individual storyboard (4.4). All statistical tests were conducted using R software (R Core Team 2023).

4.1 Classifiers used Anaphorically

We isolated all responses where participants used the possessive classifiers anaphorically without a possessed noun. We did this to track the use of classifiers across discourse units to see whether they matched or differed from their antecedent. The following types of responses were excluded: (i) no possessive construction, (ii) a directly possessed noun, (iii) a preposition, or (iv) a classifier with a possessed noun. This is a strict view of the use of classifiers as reference tracking devices, since we excluded classifiers that occur with a possessed noun. Then in section 4.2 we offer a less strict analysis, which includes all possessive constructions.

A minimum of two classifiers used anaphorically per storyboard were required in order to assess whether participants used a matching or non-matching strategy. The first picture of each four-picture storyboard sets the scene and introduces the target noun. Participants were not required to use a possessive phrase to introduce the target noun. However, some participants did use a classifier construction, which we counted in our analysis. In the subsequent three pictures (pictures 2, 3 and 4), only anaphoric uses of classifiers were counted. The amount of data required to determine if the anaphoric uses of classifiers matched an antecedent was either:

  1. A minimum of two anaphoric occurrences of classifiers for pictures 2, 3 and 4. (see Table 4 for the interactional contexts for each picture).

  2. A minimum of one anaphoric occurrence of a classifier for pictures 2, 3 and 4 and a classifier used for picture 1.

We give two glossed examples from the experiment below from the fish storyboard. Example (12), from North Ambrym, shows the introduction of the noun myaalo ‘fish’ in the first picture. The possessive classifiers in pictures 2–4 are all used anaphorically within the text to refer back to the antecedent myaalo ‘fish’. The North Ambrym example shows a matching strategy, as all classifiers are the same, regardless of the contextual interactions.

    1. (12)
    1. North Ambrym
    1.  
    1. 1.
    1. Jon
    2. John
    1. nyerō
    2. 3du
    1. Meri
    2. Mary
    1. mōrō
    2. 3du.real
    1. a
    2. go
    1. kokoō
    2. fish
    1. letee
    2. coast
    1. te
    2. conj
    1. mōrō
    2. 3du.real
    1. sene
    2. catch
    1. myaalo
    2. fish
    1. ‘John and Mary went to catch fish at the coast and they caught fish’
    1.  
    1. 2.
    1. Jon
    2. John
    1. nyerō
    2. 3du
    1. Meri
    2. Mary
    1. mōrō
    2. 3du.real
    1. rrya
    2. carry
    1. arō
    2. food.cl.3du
    1. ge
    2. ptv
    1. bya
    2. go
    1. bsaō
    2. home
    1. ‘John and Mary carried theirs home’
    1.  
    1. 3.
    1. Jon
    2. John
    1. myar
    2. hunger
    1. ma
    2. 3sg.real
    1. gte
    2. bite
    1. te
    2. conj
    1. ma
    2. 3sg.real
    1. ngne
    2. eat.tr
    1. an
    2. food.cl.3sg
    1. ge
    2. ptv
    1. ‘John was hungry and he ate his’
    1.  
    1. 4.
    1. Meri
    2. Mary
    1. me
    2. 3sg.real
    1. a
    2. go
    1. lon
    2. in
    1. maket
    2. market
    1. te
    2. conj
    1. bya
    2. go
    1. foone
    2. sell
    1. an
    2. food.cl.3sg
    1. ge
    2. ptv
    1. ‘Mary went to the market and sold hers’

Example (13), from Iaai, shows the noun wââ ‘fish’ being introduced for the first picture along with the food classifier. The further three instances of classifiers are all used anaphorically without a possessed noun and refer back to the antecedent wââ ‘fish’. The Iaai example shows a non-matching strategy as different classifiers are used across an individual storyboard.

    1. (13)
    1. Iaai6
    1.  
    1. 1.
    1. Ioane
    2. John
    1. me
    2. conj
    1. Maria
    2. Mary
    1. ödrume
    2. 3du.prs
    1. he
    2. go
    1. ka
    2. conj
    1. thutri
    2. catch
    1. ödru
    2. food.cl.3du
    1. wââ
    2. fish
    1. ‘John and Mary went and caught their fish’
    1.  
    1. 2.
    1. Ödrume
    2. 3du.prs
    1. hmetu
    2. return
    1. hnyabai
    2. home
    1. me
    2. conj
    1. hanidru
    2. catch.cl.3du
    1. ‘they returned home with theirs’
    1.  
    1. 3.
    1. Ioane
    2. John
    1. e
    2. 3sg
    1. be
    2. want
    1. han,
    2. eat
    1. ame
    2. 3sg.prs
    1. omesökâ
    2. cook
    1. haniiny
    2. catch.cl.3sg
    1. ‘John was hungry, he went and cooked his’
    1.  
    1. 4.
    1. Maria
    2. Mary
    1. ame
    2. 3sg.prs
    1. salemâ
    2. sell
    1. haniiny
    2. catch.cl.3sg
    1. ‘Mary went and sold hers’

Table 5 shows different permutations of dummy data to explicate the permissible responses accepted for analysis detailed above, where CL indicates ‘classifier with possessed noun’; ANA indicates ‘anaphoric use of a classifier’; and NO indicates ‘no possessive construction’. A classifier with a possessed noun was only included for analysis if it occurred as a response for the first picture as this was acceptable to introduce the target noun and function as the antecedent for future anaphoric uses of classifiers. Any further instances of classifiers with possessed nouns were deliberately omitted in this analysis (but see section 4.2 for an analysis where they are included).

Table 5

Example dummy permissible responses included for analysis.

Pic 1 Pic 2 Pic 3 Pic 4 analysable
P01 Response NO ANA ANA ANA yes
Accepted
P02 Response NO CL CL ANA no
Accepted X X
P03 Response CL CL ANA ANA yes
Accepted X
P04 Response CL NO CL ANA yes
Accepted X X
P05 Response NO ANA CL CL no
Accepted X X

Table 6 shows the different classifiers used anaphorically across the sample languages. One important by-product of this methodology is that we were able to describe three different classifiers that had previously not been documented (marked by an asterisk in Table 6).7

Table 6

Classifiers used anaphorically by participants across the sample languages. Classifiers marked with an asterisk had not been previously documented.

Merei (3) Lewo (3) Vatlongos (4) N. Ambrym (4) Iaai (12) Nêlêmwa (12)
general general general general general starch
consumable food food food food meat
*carry drink drink drink drink vegetable
land fire fire drink
catch sugarcane
land plant
plant animal
sugarcane catch-line
chew burden
burden weapon
animal *contents-gather
thought *contents-weapon

Responses were coded into a dichotomous variable to indicate one of two strategies: either matching classifiers used anaphorically across a storyboard, or non-matching (different) classifiers used anaphorically across a storyboard.

Figure 2 shows the mean percentage of matching classifiers used anaphorically across all storyboards by language. North Ambrym, with over 90% mean matching responses, shows a different pattern to the other languages in the sample. This supports previous research illustrating that North Ambrym has largely fixed assignment of classifiers (Franjieh 2018), in contrast to the other five languages. We investigate this further to establish whether these five languages show statistically significantly different patterns to North Ambrym.

Figure 2
Figure 2

Bar plot showing the mean percentage of matching anaphoric strategies for each sample language (total number of classifiers across all participants for each sample language given in brackets). Error bars show the 95% confidence intervals.

There were fluctuations in the number of matching strategies according to storyboard across languages (c.f. Figure 7). To account for these differences, we ran a binomial generalised linear mixed model with random intercepts.8 The dependent variable was the matching/non-matching strategy for a storyboard, with language as the fixed effect. The model had two random effects – individual storyboard and participant. Models were created in R using the glmer function from the lme4 package (Bates et al. 2015). Table 7 gives an overview of the results. R2 values were calculated using the MuMIn package in R (Bartoń 2024).9 There were 870 observations, 124 participants and 8 storyboards. Iaai was used as the reference level for the fixed effects of language.

Table 7

Generalised Linear Mixed Model for anaphoric possessive constructions. Significant p values for fixed effects in bold.

Model
Fixed Effects Estimate Standard Error z p
Intercept .166 .405 .409 .682
Lewo .353 .499 .709 .478
Merei .664 .512 1.296 .195
Nêlêmwa .282 .547 .515 .607
North Ambrym 2.669 .562 4.750 <.001
Vatlongos .361 .503 .718 .473
Random Effects Variance SD
Storyboard .121 .348
Participant 1.510 1.223
Marginal R2 Conditional R2
Theoretical R2 .153 .433

The model shows that the dependent variable is more often 1 (a matching strategy) in North Ambrym than in Iaai, and there is no significant difference between any other languages and Iaai. The model shows that North Ambrym is a significant predictor of matching reference tracking strategies. The model’s R2 values show that — when including the random effects of storyboard and participant — the model can account for 43.3% of the matching strategies.

4.2 All Possessive Constructions

Although we asked participants to give an anaphoric use of a classifier after the target noun was introduced, not every participant did so, even after prompting. This resulted in some instances where the minimum amount of data for analysis was not met (c.f. section 4.1), especially for the two New Caledonian languages (Iaai and Nêlêmwa) shown in Table 8. These missing data were due to several reasons. First, some participants saw each picture within a storyboard as an individual scene to be described, rather than as part of a continuous story. In these instances, participants preferred to give a classifier with a possessed noun, rather than use a classifier anaphorically without a possessed noun. Second, some missing data stemmed from participants using a directly possessed noun or avoiding a possessive phrase by giving just the noun without a classifier. Finally, in the case of Nêlêmwa, there is no general classifier, instead a general preposition is used, which must occur with a possessed noun (c.f., example (8)).

Table 8

Data points and missing data points per language.

Language Total potential data points Missing data including only classifiers used anaphorically (without a possessed noun) Missing data including all possessive constructions
Merei 168 13 (7.74%) 0
Lewo 176 0 0
Vatlongos 176 2 (1.14%) 2 (1.14%)
North Ambrym 176 1 (0.57%) 0
Iaai 152 51 (35.55%) 6 (3.95%)
Nêlêmwa 152 63 (41.45%) 4 (2.63%)

In Nêlêmwa we found examples where different participants could use a particular noun with either possessive classifiers or with the general linking preposition. For the yam storyboard, one participant used kuvic ‘yam’ only with the linking preposition: e.g., kuvi i ye (yam link 3sg), whereas another participant varied in choice between the ââ- plant classifier or the caa- starch classifier for kuvic ‘yam’. Another example from Nêlêmwa involves wi ‘water’, which could be either directly possessed wi-n ‘water-3sg’ or referred to anaphorically with the drink classifier, kêâ-, even by the same participant.

In order to account for the missing data, we investigated the use of possessive constructions across the four pictures in each storyboard regardless of their type. The following types of constructions were included for comparison:

  • Possessive classifiers used anaphorically without a possessed noun

  • Possessive classifiers with a possessed noun

  • Directly possessed nouns

  • Prepositional possessive constructions

The analysis sought to discover whether the same strategy (matching vs. non-matching) was used across the four pictures of a storyboard. As we include all types of possessive constructions, this led to an increase in the number of categories used for the comparison for the following languages, in addition to those given in Table 6:

  • For Iaai, participants also used directly possessed nouns and two previously undescribed classifiers: wash and bathe.10

  • For Nêlêmwa, participants also used directly possessed nouns, and the general prepositional possessive construction (example (8)).

  • For North Ambrym, participants also used an associative preposition (example (6)).

Table 9 shows the most frequent possessive construction used across each storyboard. Speakers of Merei have a preference for the general and consumable classifiers — the two main classifiers in the language. Speakers of the three languages of Central Vanuatu — Lewo, Vatlongos and North Ambrym used the food or drink classifier more than other classifiers. Speakers of the two New Caledonian languages — Nêlêmwa and Iaai — who have access to larger inventories of classifiers, used a mix of classifiers. Note that these are just the most frequent classifier used for each storyboard, and this does not show whether participants used a matching or a non-matching strategy, which will be discussed below.

Table 9

Most frequent classifier per storyboard.

Merei Lewo Vatlongos North Ambrym Nêlêmwa Iaai
Firewood general food general fire burden burden
Fish consumable food food food meat catch
Fly general food food food general preposition catch
Pig general food food food animal animal
Rat general food food food meat catch
Sugarcane consumable drink drink drink sugarcane general/food
Water general drink drink drink drink drink
Yam general food food food plant food

Figure 3 shows the mean percentage of matching classifiers used anaphorically across all storyboards by language. North Ambrym’s pattern of matching responses is greater than all other languages in the sample. In contrast, both Nêlêmwa and Iaai have the fewest mean matching responses, compared to all other languages.

Figure 3
Figure 3

Bar plot showing the mean percentage of matching strategies for each sample language (total number of possessive strategies across all participants for each sample language shown in brackets). Error bars show the 95% confidence intervals.

The results show both similarities and differences to the analysis given in section 4.1, which looked at the strict anaphoric use of classifiers. In both analyses, the results show that North Ambrym has a higher number of instances of matching classifiers compared to the other languages. When considering all possessive strategies, used anaphorically or non-anaphorically, the two New Caledonian languages (Iaai and Nêlêmwa) have much lower matching responses across the storyboards (c.f. Figure 3). This indicates that they have more variable classification systems compared to the Vanuatu languages. The three Vanuatu languages other than North Ambrym, that is Merei, Vatlongos and Lewo, have intermediate systems, which have a slight preference for using the same possessive strategy across the storyboards.

Multiple linear regression was conducted to investigate whether the percentage of non-matching possessive strategies used by each participant was affected by the number of different possessive constructions used across the storyboards and the language spoken. Homoscedasticity was confirmed with a Breusch-Pagan test result of x² = 1.175 p = .278. However, the Shapiro-Wilk test showed the residuals were non-normal, W = .970, p = .006. Despite this, given the nature of our data, transformation of the data or exclusion of valid data points that presented as outliers, were deemed more problematic as approaches than cautionary interpretation of the findings (see Pek, Wong & Wong 2018 for a review). The model was significant, F(6,118) = 12.02, p < .001, indicating at least one predictor significantly affects the percentage of non-matching strategies. The model explains 35% of the variance in number of non-matching strategies, with an adjusted R² of .348. The number of possessive constructions that a participant used across storyboards was a significant predictor t(3.449) = 7.461, p < .001, indicating a strong positive effect on the percentage of non-matching possessive strategies. Only the North Ambrym language was a significant predictor t(–2.819) = –27.826, p = .006, indicating a strong negative effect on the percentage of non-matching possessive strategies.

Similar to the analyses detailed in section 4.1, we ran a GLMM with random intercepts. The dependent variable was the matching/non-matching strategy for a storyboard for all possessive constructions, with language as the fixed effect. The model had two random effects — individual storyboard and participant. Table 10 gives an overview of the results. There were 988 observations, 125 participants and 8 storyboards. Iaai was used as the reference level for the fixed effects of language.

Table 10

Generalised Linear Mixed Model for all possessive constructions. Significant p values for fixed effects in bold.

Model
Fixed Effects Estimate Standard Error z p
Intercept –.570 .347 –1.645 .100
Lewo 1.073 .419 2.559 .011
Merei 1.260 .427 2.948 .003
Nêlêmwa .050 .434 .0115 .908
North Ambrym 3.159 .479 6.594 <.001
Vatlongos 1.085 .423 2.565 .010
Random Effects Variance SD
Storyboard .200 .447
Participant 1.084 1.041
Marginal R2 Conditional R2
Theoretical R2 .194 .420

The model shows that the dependent variable is more often 1 (a matching strategy) in North Ambrym, Merei, Vatlongos and Lewo than it is in Iaai. There was no significant difference between Nêlêmwa and Iaai. This model gives a nuanced picture as all the Vanuatu languages are significant predictors of matching strategies, with North Ambrym the most significant. The R2 values in the model show that when including the random effects of storyboard and participant, the model accounts for 42% of the matching strategies.

To investigate if the three Vanuatu languages of Lewo, Merei and Vatlongos function as intermediate systems different to North Ambrym on the one hand, and Iaai and Nêlêmwa on the other hand, we re-ran the GLMM with Vatlongos as the reference level. Table 11 shows the results for the fixed effects of language.

Table 11

Generalised Linear Mixed Model with Vatlongos as the intercept. Significant p values in bold.

Model
Fixed Effects Estimate Standard Error z p
Intercept .515 .328 1.567 .117
Iaai –1.085 .423 –2.565 .010
Lewo –.012 .403 –.029 .977
Merei .175 .410 .426 .670
Nêlêmwa –1.035 .420 –2.464 .014
North Ambrym 2.074 .460 4.508 <.001

The model shows that the dependent variable is more often 1 (matching strategy) in North Ambrym than in Vatlongos and that the dependent variable is more often 0 (non-matching strategy) in Iaai and Nêlêmwa than in Vatlongos. There is no significant difference between Lewo and Vatlongos or Merei and Vatlongos. The model confirms that Merei, Lewo and Vatlongos function as intermediate systems.

To visualise the difference between participants and languages, Figure 4 shows the cumulative percentage of participants using a matching strategy across storyboards. The eight storyboards are represented on the x-axis, where the number 8 represents the percentage of participants who used matching strategies for all eight storyboards. The number 7 represents the cumulative percentage of participants who gave matching strategies for either all eight storyboards, or seven out of eight of the storyboards. Figure 4 gives a clear representation of the differences between the languages, with three clusters of clines. North Ambrym, with the most matching strategies, has over 35% of participants using a matching strategy for all eight storyboards, with 90% of participants using a matching strategy for either all storyboards, or seven out of eight storyboards. The three other Vanuatu languages (Merei, Vatlongos and Lewo) all have smaller percentages of participants using matching strategies for all eight storyboards (between 9% and 14%), rising to between 22% and 32% for all eight storyboards and for seven out of eight storyboards. It is important to note therefore, that some speakers in these languages do prefer to use a matching strategy regardless of interactional context. There were no speakers of the two New Caledonian languages, Iaai and Nêlêmwa, who gave a matching strategy for either all eight storyboards or seven out of eight storyboards. Around 16% of speakers of Iaai and Nêlêmwa used matching strategies for six out of eight of the storyboards. These three clusters confirm the mixed model analysis above.

Figure 4
Figure 4

Line plot showing the cumulative percentage of participants using matching strategies across the storyboards.

4.3 Response patterns

In this section, we provide a brief overview of the use of classifiers across each language and storyboard to get a better understanding of how speakers of each language use their system. To get a more in-depth understanding, we then abstract away from the semantic/mnemonic labels used to compare across languages in a more equal way.

For understanding the move towards fixed assignment in classification system, we have to investigate what types of matching and non-matching strategies were chosen by the participants across the storyboards, and whether certain strategies are preferred. The storyboards followed a similar four picture pattern of (i) finding or catching a particular object, (ii) returning home with it, (iii) doing something natural with it (e.g., eating, drinking, making fire), and (iv) doing something less obvious, such as selling or washing with it. As each language has a varying inventory of classifiers, which embody different semantic domains, and a number of other possessive constructions, it is not a simple task to compare like for like. In order to create a fair comparison across languages, we abstracted away from the semantics of each possessive construction and instead focussed on whether a particular construction was used and then if it was repeated later in the story. There are five logical values based on the four possible responses for a storyboard:

  • 0: no possessive strategy given. For example, if only a noun, a pronoun or zero anaphor was used by a participant in a particular response.

  • 1: the first possessive construction used. For example, a particular possessive classifier, such as food or drink, or a directly possessed noun or prepositional construction. Any further uses of the same strategy will also receive this code.

  • 2: the number given to the second different possessive construction used and any uses of that same construction thereafter.

  • 3: the number given to the third different possessive construction used and any uses of that same construction thereafter.

  • 4: the number given to the fourth different possessive construction used.

Table 12 gives some fictitious data to show how the patterns are coded. Fictitious data is shown for eight different participants and their responses to the fish storyboard. Different classifiers are shown in small caps – gen ‘general classifier’, food ‘food classifier’, burden ‘burden or carry classifier’. Some responses have both a classifier and a possessed noun, other cells have anaphoric use of classifiers without a possessed noun, and yet other cells have just a noun which does not appear in a possessive construction. The pattern row gives a four-digit code corresponding the pattern used by the participant, outlined above. Participants 1–3 used matching strategies, and participants 4–8 used non-matching strategies.

Table 12

Example pattern coding for fictitious data for eight participants for the ‘fish’ storyboard.

Picture Participants
1 2 3 4 5 6 7 8
1 gen fish food fish fish fish fish catch fish food fish fish
2 gen food food gen gen fish burden fish burden fish fish
3 gen food food food food food food fish food fish
4 gen food food gen food gen burden gen
Pattern 1111 1111 0111 0121 0122 1234 1212 0012
Matching Yes Yes Yes No No No No No

Descriptive statistics for the number of pattern responses for each language are shown in Table 13.

Table 13

Descriptive statistics for number of possessive strategy patterns across the sample languages, sorted from lowest to highest mean number of patterns.

language mean sd median IQR
North Ambrym 2.273 0.550 2.000 1.000
Vatlongos 3.409 1.403 3.000 2.750
Merei 3.857 1.315 4.000 2.000
Lewo 3.955 1.214 4.000 2.000
Nêlêmwa 4.368 1.212 4.000 1.500
Iaai 4.895 1.150 5.000 1.500

We conducted an analysis of variance using the rstatix package (Kassambara 2023). The ANOVA shows a significant difference between number of patterns given by participants across languages F(5, 119) = 12.579, p < .001, ηp2 = .346. Pairwise comparisons comparing individual languages indicated that North Ambrym had significantly lower numbers of patterns than all other languages (largest p < .002). Iaai participants gave significantly more patterns than Lewo, Merei, Vatlongos and North Ambrym (largest p = .012). Nêlêmwa participants gave significantly more pattern responses than North Ambrym and Vatlongos (largest p = .005). Figure 5 shows the mean number of patterns across the language sample. The pattern closely mirrors that shown in Figure 3, revealing that languages with more matching possessive strategies used lower overall patterns. This is expected as there were only 9 matching possessive strategies (patterns that either have 0 or 1), lower than the twenty patterns for non-matching possessive strategies (patterns that have 2 or higher in them).

Figure 5
Figure 5

Bar plot showing the mean number of patterns for each sample language (total number of possessive strategies across all participants for each sample language shown in brackets). Error bars show the 95% confidence intervals.

The stacked bar plot in Figure 6 illustrates the different patterns used across the languages. In order to simplify the data, patterns where there were insufficient numbers of classifiers to be counted as either matching or non-matching are not included. Patterns such as 0000, 0100 or 0010, where only one or no classifiers were given for a storyboard amounted to 3.95% of the data for Iaai, 2.63% for Nêlêmwa, and 1.14% for Vatlongos.

Figure 6
Figure 6

Stacked bar plot showing matching and non-matching strategies across the sample languages.

Table 14 shows a summary of the highest frequency patterns for both matching and non-matching strategies. Out of the nine fixed patterns, the 0111 pattern had the highest frequency across all languages. Out of the twenty variable patterns, 0121 had the highest frequency for Merei, Nêlêmwa and Iaai. 0112 had the highest frequency for Lewo and Vatlongos, and 0122 was the highest percent for North Ambrym.

Table 14

Total number of patterns, matching and non-matching with details of highest frequency patterns for matching and non-matching.

Language total matching patterns Highest frequency matching pattern total non-matching patterns Highest frequency non-matching pattern Total patterns
N. Ambrym 4 0111 (67.05%) 5 0122 (5.68%) 9
Merei 5 0111 (35.71%) 10 0121 (10.12%) 15
Lewo 2 0111 (36.36%) 10 0112 (11.93%) 12
Vatlongos 4 0111 (43.18%) 15 0112 (17.61%) 19
Nêlêmwa 6 0111 (25%) 12 0121 (16.45%) 18
Iaai 6 0111 (26.32%) 14 0121 (17.11%) 20

Individual languages were analysed using paired samples t-tests comparing the total number of unique matching and non-matching patterns. North Ambrym showed significantly more matching patterns than non-matching patterns (t = 6.736, p < .001). All other languages had no significant difference between the frequency of matching and non-matching patterns (lowest p = .752).

4.4 Individual Storyboard Analysis

The storyboards investigate eight different target nouns that differ both in noun frequency and saliency. Figure 7 details the percentage of matching possessive strategy across individual storyboards (same criteria as section 4.2) for all participants across each sample language. The x-axis shows each individual storyboard in left to right descending order according to average matching possessive strategy percentages across all six languages (i.e., ‘firewood’ has the highest mean agreement percentages across the six languages, and ‘water’ has the lowest mean agreement percentages).

Figure 7
Figure 7

Matching possessive strategies (%) across each storyboard and language. The brackets after each storyboard name indicate the CSI rank followed by the frequency quartile.

Chi-squared tests with Yates’ continuity correction showed that there were significant differences between storyboards and whether participants used a matching or non-matching strategy in their response (either anaphorically or with a possessed noun). There was a significant difference for Merei, with a moderate association (X2 = 17.485, p = .015, V = .32311). Both Lewo (X2 = 32.273, p < .001, V = .428) and Vatlongos (X2 = 21.016, p = .004, V = .428) showed significant differences, with a relatively strong association. For North Ambrym, expected values were less than 5 for non-matching strategies, hence a Fisher’s test of exactness was conducted, which resulted in a significant difference, but with a moderate association (p = .038, V = .278). Nêlêmwa has a moderate association (X2 = 14.459, p = .044, V = .313). For Iaai, a Chi-squared test was non-significant (p = .179).

For the languages with significant differences and a relatively strong association (Lewo and Vatlongos), Bonferroni corrected pairwise comparisons (adjusted p = .003) were conducted by comparing adjusted standardised residuals. Lewo showed significantly fewer matching strategies for the pig storyboard and significantly more matching strategies for the yam storyboard. Vatlongos showed significantly more matching strategies for the firewood storyboard.

5 Discussion

A clear cline emerges from the analyses of the storyboard data in our language sample. In our experiment the North Ambrym language is set apart from the other languages, since it has significantly higher matching anaphoric uses of classifiers. North Ambrym has a mean anaphoric use of matching classifiers of above 90%, with the lowest standard deviation amongst the language sample (c.f., section 4.1). Participants from North Ambrym used the same classifier anaphorically more consistently across an individual storyboard than participants from other languages, indicating that the North Ambrym classifier system bears comparison with a system which is redundant in having fixed assignment and which allows anaphoric agreement (in this respect comparable to many gender systems).

When taking into account different types of possessive constructions (section 4.2), used either anaphorically or non-anaphorically, North Ambrym’s system still had more uses of matching strategies across a storyboard. At the other end of the cline, both Nêlêmwa and Iaai with large (and fairly open) systems of classifiers had higher amounts of non-matching reference tracking strategies in a storyboard, indicating a lack of fixed assignment. In the middle of the cline are Merei, Lewo and Vatlongos. Participants from these three languages used on average more matching possessive strategies than non-matching strategies. The GLMM confirmed that North Ambrym had significantly more matching strategies than Merei, Lewo and Vatlongos, and Iaai and Nêlêmwa had significantly more non-matching strategies. Language, storyboards, and individual variation accounted for 42% of the overall variation in the GLMM. We did not have enough participants to stratify the sample for age or gender and this may be the reason for the lower explanatory power of the model. Multiple linear regression revealed that the more different possessive constructions a participant used across the storyboards, the more likely they were to use non-matching reference tracking strategies. This supports the findings that Iaai and Nêlêmwa, with the largest inventory of classifiers in the sample, had the highest amount of non-matching reference tracking strategies. Furthermore, the only language to have an effect on the number of non-matching reference tracking strategies was North Ambrym, with a significant negative effect, supporting the difference of this language to the others in the sample.

By abstracting away from the different semantics of the possessive constructions, we identified different patterns of matching and non-matching strategies used by participants (section 4.3). North Ambrym, with a more fixed (redundant) assignment system showed significantly fewer different patterns of reference tracking strategies overall than all other languages. Both Iaai and Nêlêmwa, had significantly more patterns of reference tracking strategies available than most other languages. Individual paired t-tests for each language demonstrate that North Ambrym does use significantly more matching strategies than non-matching strategies, indicating a system which is more gender-like in this respect.

Investigations of matching responses across the eight storyboards in each individual language (section 4.4) revealed only minor differences for Lewo and Vatlongos, but these differences were not shared by other languages. There is little evidence suggesting an overall effect of particular storyboards on whether participants used more matching responses. This also points to there being little influence of our linguistic measure of frequency and our psychological measure of salience on participants’ matching strategies.

6 Conclusion

The data presented reveals that – within classifier systems – redundancy can develop, as systems of assignment become fixed, and this brings them closer to grammatical gender. North Ambrym had previously been presented as showing this phenomenon (Franjieh 2016; 2018). However, North Ambrym’s system had not been rigorously tested for use in anaphoric contexts. By comparing North Ambrym against other similar Oceanic languages with varying inventories of classifiers, we have shown that North Ambrym has an average of just over 90% of instances of matching classifiers in the storyboard experiment, with the lowest levels of variation amongst participants.

Languages with large inventories of classifiers are less likely to develop fixed assignment systems. Participants who used more possessive constructions in their responses resulted in a higher amount of different matching and non-matching strategies. This points to a situation where participants from languages with more possessive constructions, such as Iaai and Nêlêmwa, used a higher amount of non-matching strategies. The other languages in our sample, Merei, Lewo and Vatlongos with smaller inventories of classifiers showed around 50% matching strategies. The results from Merei reveal that the mean number of matching strategies was higher than 50%, showing that participants were more likely to use matching rather than non-matching strategies. This should be taken together with the results from the mixed effects logistic regression, which showed that the languages of Vanuatu (North Ambrym, Lewo, Merei and Vatlongos) had significantly more matching strategies than those of New Caledonia (Iaai and Nêlêmwa). North Ambrym also had significantly more matching strategies than Merei, Lewo and Vatlongos with Iaai and Nêlêmwa having significantly more non-matching strategies. Individual participant variation is also an important factor in predicting matching strategies, revealing that some speakers are more likely to have stricter (more fixed) reference tracking systems, indicating a move towards a more gender-like system. Potentially, Merei, Vatlongos and Lewo may be showing initial signs of moving towards greater redundancy overall (a more fixed assignment system).

Evidence is accumulating from different languages around the world showing that there is no sharp dividing line between classifier systems and grammatical gender. Our evidence from Oceanic classifier systems shows that systems with small inventories of classifiers have the potential to become fixed in their assignment of noun to classifier. The data presented here provides a further argument for treating grammatical gender and classifiers as part of the same general domain of nominal categorisation.

Grammatical gender is defined by its participation in agreement (Hockett 1958: 231; Corbett 1991: 1). Yet agreement itself is subject to different views, particularly on the potential domains of agreement. Specifically, some would restrict agreement to the phrase and the clause, while others allow for agreement across larger syntactic units of discourse, so that anaphoric agreement is included in a broader definition (see Corbett 2006: 21–23, 41–42, 227–230; Enger 2013; Haig & Forker 2018; Paparounas & Akkuş 2023). Whatever one’s position on that issue, the fixed assignment of noun to classifier, coupled with use in reference tracking, as we have demonstrated with North Ambrym, are key components in the development towards a gender-like system. And more generally, we have seen how useful distinctions can be reduced, as transparent classification becomes more opaque.

Abbreviations

Non-standard glosses are listed below.

      1. animal
      2. ass
      3. catch
      4. cl
      5. conj
      6. drink
      7. food
      8. general
      9. intj
      10. link
      11. ptv
      12. real
      1. domestic animal classifier
      2. associative
      3. catch classifier
      4. classifier
      5. conjunction
      6. drink classifier
      7. food classifier
      8. general classifier
      9. interjection
      10. linking preposition
      11. partitive
      12. realis

Data availability/Supplementary files

The individual participant data files, Python coding files and R analysis files can be found on the following link: https://doi.org/10.5281/zenodo.14250835 and on our project website: https://nominal-categorisation.surrey.ac.uk/projects/optimal-categorisation/experiments/.

Ethics and consent

This research has undergone a full ethical review and been granted ethical approval by the University of Surrey’s ethics committee.

Funding information

The support of the ESRC (UK), grant: ES/R00837X/1 ‘Optimal categorisation: the origin and nature of gender from a psycholinguistic perspective’ is gratefully acknowledged.

Acknowledgements

We wish to thank the 125 participants from the six different language communities in Vanuatu and New Caledonia. In particular, we are grateful to Willie Salong from the North Ambrym community; Madleen Ben from the Vatlongos community; Jack Graham Lui from the Lewo community; Melkio Wulmele and Adam Pike from the Merei community; Willion Phadom from the Nêlêmwa community; and Wejë Bae from the Iaai community. We thank Eleanor Ridge for allowing us access to her corpus of Vatlongos, and Sebastian Fedden for discussion of the typology of nominal classification systems. We are greatly indebted to three anonymous referees for their insightful comments, and to Lyn Tieu for helpful editorial suggestions.

Competing interests

The authors have no competing interests to declare.

Authors’ contributions

Conceptualisation, Funding Acquisition: Michael Franjieh, Alexandra Grandison, Greville G. Corbett; Methodology, Formal Analysis, writing – Original Draft: Michael Franjieh, Alexandra Grandison, Anne-Laure Dotte, Greville G. Corbett; Investigation: Michael Franjieh, Anne-Laure Dotte; Data Curation and Visualisation: Michael Franjieh.

Notes

  1. For comparison with Malayo-Polynesian see Schapper & McConvell (2024). [^]
  2. Example glosses for possessive constructions have been standardised across the paper, but original transcriptions kept. [^]
  3. Nêlêmwa is not alone in not having a general possessive classifier: related languages found in northern New Caledonia such as Zuanga-Yuanga also lack a general classifier and have prepositional possession (Bril 2023). [^]
  4. There are two linkers in Nêlêmwa. The linker i is used with human possessors and the linker o is used with collective/non-specific human or inanimate determiners (Bril 2012: 72). [^]
  5. These gorgeous illustrations we commissioned from Erin Aniker. They are being reused in literacy materials for schools in Vanuatu and New Caledonia. To date, these literacy materials include thematic dictionaries/grammars for four languages (North Ambrym, Lewo, Vatlongos and Iaai) and for storybooks for five languages (North Ambrym, Lewo, Vatlongos, Merei and Iaai). They are provided free of charge to the schools and are available for download at: https://nominal-categorisation.surrey.ac.uk/projects/optimal-categorisation/outreach/. [^]
  6. It is interesting that the initial classifier in example 13–1 is the food classifier, rather than the catch classifier. As Oceanic classifier systems are relational, they can encode intended relations as well as actual relations. Here it is the intention to eat the fish that is encoded, rather than the actual relation of catching it. [^]
  7. A brief explanation of the previously undescribed classifiers is given here. For Merei, the sor- carry classifier is used for items that are carried and resembles the semantics of the burden classifiers described for Nêlêmwa (Bril 2012: 367) and Iaai (Dotte 2017). The example from the firewood storyboard: sor-na tavaiabu ‘his carried firewood (carry.cl-3sg firewood)’. For Nêlêmwa, the shâ-yalavi- content-gather classifier occurred in the rat storyboard. On further elicitation we found that the classifier is used for items that are collected by gathering, such as shâ-yalavi-ny büa ‘my gathered blue-giant clams (content-gather-1sg blue-giant clam)’. We found that this classifier can also be used with kuvic ‘yam’. An alternative construction was offered using the general linking preposition, with our consultant saying that it has an identical meaning to the classifier usage: shâ-yalaw i na ‘my gathered possessions (content-gather link 1sg)’. This alternative construction was also given by a participant in one of the storyboard responses. The second classifier, shâ-ââdaxi- content-weapon, is used for animals that have been killed by your weapon. This classifier was given in the pig storyboard. On further elicitation, our language consultant also gave shâ-ââdaxi-ny dia ‘my deer, killed with a weapon (content-weapon-1sg deer)’. [^]
  8. We thank a reviewer for suggesting this approach. We explored other modelling strategies but found them to hold less explanatory power. Results for ANOVA and a GLMM with a single random intercept of individual storyboards can be found in the supplementary files. [^]
  9. For one Iaai participant all their responses did not meet the criteria for inclusion described previously and were omitted from this analysis. This participant’s responses were included in the subsequent analysis in section 4.2. [^]
  10. For Iaai, the hûe- wash classifier occurred with the noun köiö ‘water’ in the water storyboard: hûe-n köiö ‘his washing water (wash.cl-3sg water)’. This might be related to the verb hneû/hnûû ‘to wash’ Ozanne-Rivierre 1984: 62). The sûe- bathe classifier also occurred with the noun köiö ‘water’ in the water storyboard: sûe-n köiö ‘his bathing water (bathe.cl-3sg water)’. This classifier is more transparently derived from the verb sooû/sûûa ‘to wash’ (Ozanne-Rivierre 1984: 104), though translated as ‘to bathe (se baigner)’ by the participant. [^]
  11. Cramér’s V calculated using the rcompanion package in R (Mangiafico 2024). [^]

References

Bartoń, Kamil. 2024. Mumin: Multi-model inference. R package version 1.48.4. https://CRAN.R-project.org/package=MuMIn.

Bates, Douglas & Mächler, Martin & Bolker, Ben & Walker, Steve. 2015. Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1). 1–48. DOI:  http://doi.org/10.18637/jss.v067.i01

Bickel, Balthasar & Nichols, Johanna. 2013. Obligatory possessive inflection. In Dryer, Matthew S. & Haspelmath, Martin (eds.), WALS Online (v2020.3) [Data set]. Zenodo. DOI:  http://doi.org/10.5281/Zenodo.7385533. (Available online at http://wals.info/chapter/58, Accessed on 2024-03-13).

Bisang, Walter. 1993. Classifiers, quantifiers and class nouns in Hmong. Studies in Language 17. 1–51. DOI:  http://doi.org/10.1075/sl.17.1.02bis

Bril, Isabelle. 2012. Ownership, part–whole, and other possessive–associative relations in Nêlêmwa (New Caledonia). In Aikhenvald, Alexandra Y. & Dixon, Robert M. W. (eds.), Possession and ownership, 65–89. Oxford, New York: Oxford University Press. DOI:  http://doi.org/10.1093/acprof:oso/9780199660223.003.0002

Bril, Isabelle. 2023. Categorizing possession in Zuanga-Yuanga and other Kanak languages (New Caledonia): A typological perspective. Linguistics 61(6). 1403–1446. DOI:  http://doi.org/10.1515/ling-2022-0035

Bugaeva, Anna & Nichols, Johanna & Bickel, Balthasar. 2022. Appositive possession in Ainu and around the Pacific. Linguistic Typology 26(1). 43–88. DOI:  http://doi.org/10.1515/lingty-2021-2079

Burton, Strang & Matthewson, Lisa. 2015. Targeted construction storyboards in semantic fieldwork. In Bochnak, M. Ryan & Matthewson, Lisa (eds.), Methodologies in Semantic Fieldwork, 135–156. Oxford, New York: Oxford University Press. DOI:  http://doi.org/10.1093/acprof:oso/9780190212339.003.0006

Contini-Morava, Ellen & Kilarski, Marcin. 2013. Functions of nominal classification. Language Sciences 40. 263–299. DOI:  http://doi.org/10.1016/j.langsci.2013.03.002

Corbett, Greville G. 1991. Gender. Cambridge: Cambridge University Press. DOI:  http://doi.org/10.1017/CBO9781139166119

Corbett, Greville G. 2006. Agreement. Cambridge: Cambridge University Press.

Corbett, Greville G. 2013. Systems of gender assignment. In Dryer, Matthew S. & Haspelmath, Martin (eds.), WALS Online (v2020.3) [Data set]. Zenodo. DOI:  http://doi.org/10.5281/Zenodo.7385533 (Chapter available online at http://wals.info/chapter/32, Accessed on 2024–03–15.)

Corbett, Greville G. & Fedden, Sebastian. 2016. Canonical gender. Journal of Linguistics 52. 495–531. DOI:  http://doi.org/10.1017/S0022226715000195

Dixon, Robert M. W. 1982. Nominal classification. In Dixon, Robert. M. W. (ed.), Where have all the adjectives gone? and other essays in semantics and syntax, 157–233. Berlin: De Gruyter. DOI:  http://doi.org/10.1515/9783110822939

Dotte, Anne-Laure. 2017. Dynamism and change in the possessive classifier system of Iaai. Oceanic Linguistics 56(2). 339–363. DOI:  http://doi.org/10.1353/ol.2017.0017

Enger, Hans-Olav. 2013. Scandinavian pancake sentences revisited. Nordic Journal of Linguistics 36(3). 275–301. DOI:  http://doi.org/10.1017/S0332586513000280

Ezekiel, Sopa & Ridge, Eleanor. 2014. The pigeon and the heron. Pangloss collection. DOI:  http://doi.org/10.24397/pangloss-0000079

Faul, Franz & Erdfelder, Edgar & Buchner, Axel & Lang, Albert-Georg. 2009. Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods 41(4). 1149–1160. DOI:  http://doi.org/10.3758/BRM.41.4.1149

Fedden, Sebastian & Corbett, Greville G. Forthcoming. Typology of gender and classifiers. To appear in Schiller, Niels & Kupisch, Tanja (eds.), Oxford Handbook of Gender and Classifiers. Oxford: Oxford University Press.

Feist, Timothy. 2020. Nominal classification: Does it play a role in referent disambiguation? Studies in Language 44(1). 191–230. DOI:  http://doi.org/10.1075/sl.19026.fei

Franjieh, Michael. 2012. Possessive classifiers in North Ambrym, a language of Vanuatu: explorations in semantic classification. London: School of Oriental and African Studies, University of London PhD thesis.

Franjieh, Michael. 2016. Indirect possessive hosts in North Ambrym: Evidence for gender. Oceanic Linguistics 55(1). 87–115. DOI:  http://doi.org/10.1353/ol.2016.0011

Franjieh, Michael. 2018. North Ambrym possessive classifiers from the perspective of canonical gender. In Fedden, Sebastian & Audring, Jenny & Corbett, Greville G. (eds.), Non-canonical gender systems, 37–67. Oxford, New York: Oxford University Press. DOI:  http://doi.org/10.1093/oso/9780198795438.003.0003

Franjieh, Michael & Grandison, Alexandra & Dotte, Anne-Laure & Corbett, Greville G. 2022. Implementing free-listing: Possessive classifiers in Oceanic. Semantic Fieldwork Methods 4(2).

Greenberg, Joseph H. 1978. How does a language acquire gender markers? In Greenberg, Joseph H. & Ferguson, Charles A. & Moravcsik, Edith A. (eds.), Universals of Human Language: III: Word Structure, 47–82. Stanford: Stanford University Press.

Haig, Geoffrey & Forker, Diana. 2018. Agreement in grammar and discourse: A research overview. Linguistics 56(4). 715–734. DOI:  http://doi.org/10.1515/ling-2018-0014

Hockett, Charles F. 1958. A course in modern linguistics. New York: Macmillan.

Kassambara, Alboukadel. 2023. Rstatix: Pipe-friendly framework for basic statistical tests. R package version 0.7.2, https://rpkgs.datanovia.com/rstatix/.

Lichtenberk, Frantisek. 1983. Relational classifiers. Lingua 60(2–3). 147–176. DOI:  http://doi.org/10.1016/0024-3841(83)90072-4

Lichtenberk, Frantisek. 1985. Possessive constructions in Oceanic languages and in Proto- Oceanic. In Pawley, Andrew & Carrington, Lois (eds.), Austronesian linguistics at the 15th Pacific Science Congress, 93–140. Canberra: Pacific Linguistics, The Australian National University. DOI:  http://doi.org/10.15144/PL-C88.93

Lichtenberk, Frantisek. 2009. Attributive possessive constructions in Oceanic. In McGregor, William B. (ed.), The expression of possession, 249–252. Berlin and New York: Mouton de Gruyter. DOI:  http://doi.org/10.1515/9783110213232.249

Lichtenberk, Frantisek. 2018. The diachrony of Oceanic possessive classifiers. In McGregor, William B. & Wichmann, Søren (eds.), The diachrony of classification systems, 165–200. Amsterdam: John Benjamins. DOI:  http://doi.org/10.1075/cilt.342.07lic

Lynch, John & Ross, Malcolm & Crowley, Terry. (eds.) 2002. The Oceanic Languages. Richmond, Surrey: Curzon.

Mangiafico, Salvatore S. 2024. Rcompanion: Functions to support extension education program evaluation. Rutgers Cooperative Extension, New Brunswick, New Jersey. version 2.4.35, https://CRAN.R-project.org/package=rcompanion/.

Ozanne-Rivierre, Françoise. 1984. Dictionnaire Iaai-Français (Ouvéa, Nouvelle-Calédonie). Paris: SELAF.

Paparounas, Lefteris & Akkuş, Faruk. 2023. Anaphora and agreement in the Turkish DP: Delimiting binding-through-agree. Natural Language & Linguistic Theory 42. 633–700. DOI:  http://doi.org/10.1007/s11049-023-09583-4

Pek, Jolynn & Wong, Octavia & Wong, Augustine C. M. 2018. How to address non-normality: a taxonomy of approaches, reviewed, and illustrated. Frontiers in Psychology: Quantitative Psychology and Measurement 9. 2104. DOI:  http://doi.org/10.3389/fpsyg.2018.02104

R Core Team. 2023. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.

Reid, Nicholas. 1997. Class and classifier in Ngan’gityemerri. In Harvey, Mark & Reid, Nicholas (eds.), Nominal Classification in Aboriginal Australia (Studies in Language Companion Series 37), 165–228. Amsterdam: John Benjamins. DOI:  http://doi.org/10.1075/slcs.37.10rei

Ross, Malcolm. 2004. The morphosyntactic typology of Oceanic languages. Language and Linguistics 5(2), 491–541. http://hdl.handle.net/1885/87569.

Schapper, Antoinette & McConvell, William. 2024. Adnominal possession. In Adelaar, Alexander & Schapper, Antoinette (eds.), The Oxford Guide to the Malayo-Polynesian Languages of Southeast Asia, 825–844. Oxford: Oxford University Press. DOI:  http://doi.org/10.1093/oso/9780198807353.003.0048

Schneider, Cynthia. 2011. Na passive and na- associative in Abma: Shared properties; shared origin? Oceanic Linguistics 50(2). 380–398. DOI:  http://doi.org/10.1353/ol.2011.0030

Seifart, Frank. 2005. The structure and use of shape-based noun classes in Miraña (North West Amazon) (= MPI Series in Psycholinguistics, 33). Nijmegen: Max Planck Institute for Psycholinguistics. http://hdl.handle.net/2066/26990.

Seifart, Frank. 2018. The semantic reduction of the noun universe and the diachrony of nominal classification. In McGregor, William B. & Wichmann, Søren (eds.), The diachrony of classification systems, 9–32. Amsterdam: John Benjamins. DOI:  http://doi.org/10.1075/cilt.342.02sei

Singer, Ruth. 2016. The dynamics of nominal classification: Productive and lexicalised uses of gender agreement in Mawng. Berlin: De Gruyter Mouton. DOI:  http://doi.org/10.1515/9781614513698

Sutrop, Urmas. 2001. List Task and a Cognitive Salience Index. Field Methods 13(3). 263–276. DOI:  http://doi.org/10.1177/1525822X0101300303

Wegener, Claudia. 2012. A grammar of Savosavo. Berlin: De Gruyter Mouton. DOI:  http://doi.org/10.1515/9783110289657