The linking constraint and exhaustification

Hayes (1986) proposes the linking constraint, a convention for the interpretation of autosegmental rules. As stated, it holds that association lines should be “interpreted as exhaustive”. In the context of a rule, this means that the target and triggers are not permitted to have additional linkages not mentioned in the rule.

Later in the paper, Hayes makes it clear that this is to be interpreted with respect to whatever tiers are mentioned. For example, imagine a rule that manipulates the melodic/featural tier but is conditioned in part by the CV tier—Hayes adopts CV theory, but the “constraint” is just as applicable to approaches which use an X and/or moraic tier—then the rule does not apply to any susbstring of the melody whose melodies contain associations to the CV tier not mentioned. Similarly, imagine a rule that targets elements on the CV tier but is conditioned in part by the melody: such a rule would not apply to any substring of the CV tier with associations to the melody not explicitly stated in the rule.

I would like to claim that this is all too informal. It should be possible to state the substring that matches the rule using something like first-order logic (FOL), and similarly to translate the change into a logical statement. However, it’s not immediately clear how to write the procedure that translates autosegmental diagrams into the appropriate FOL sentences. (I put aside the encoding of the change: I think this will be comparatively easy.) Autosegmental diagrams itself are essentially an fragment of undirected graph, and translating these into FOL statements is straightforward enough: the description of the graph is defined by the logical conjunction of:

one-place predicates stating what type each element is (i.e., what tier its on),
two-place immediate-precedence predicates (when the rule refers to multiple elements on a given tier),
two-place (unordered) predicates indicating the association lines between tiers.

To enforce the linking constraint, one needs to add additional predicates to this conjunction that rule out associations not mentioned. Conceptually, I think of this as an exhaustification function (with apologies for the abuse of terminology) which takes the graph description above and returns the predicates needed to rule out forbidden associations in the relevant subgraphs. While I think I know what these exhaustifying predicates need to be for toy examples, I don’t yet know what the general algorithm is.

I am also somewhat concerned whether phonologists are (were?) applying the linking convention in a vibes-based fashion depending on the example in question, in which case no algorithm could properly describe analytical practices. Finally, I am interested in the feasibility of the opposite approach—a road not taken, as far as I know, in autosegmental theory— whereby undesired associations are ruled out simply by making the rule more explicit.

Does anyone know of any relevant work on this topic? Surely I am not the first person to be bothered by this.

References

Hayes, B. 1986. Inalterability in CV phonology. Language 62: 321-351.

In memoriam: Eugene Buckley

It was just announced that Gene Buckley has passed away. I took nearly all of his classes in graduate school, TAed for him, and he served on my dissertation committee, so he definitely had a profound influence on me and my work. I particularly remember his seminar on opacity, which introduced me to substance-free phonology, and his seminar on Kashaya, where I learned about the ancient, mysterious i → d / _ u rule and the many Russian loanwords from their contact with Fort Ross (e.g., [jaːpalka] ‘apple’ < яблоко, with devoicing, stress-conditioned lengthening, and CV-metathesis). Gene was an island of relative sanity and calm during my turbulent grad school year. He’ll be missed.

[Gene’s colleague Rolf Noyer wrote a brief memorial here.]

How not to acquire phonological exchanges

I recently gave a talk at the Canadian Linguistic Association (sort of like the LSA, but Canadian and frankly a lot better because it doesn’t have nearly as many prominent crackpots and cranks) with Charles Reiss on the notion of “exchange rules” as they’re understood in Logical Phonology (LP). Whereas SPE-era theories can use alpha-notation to generate exchange rules, LP can model exchange processes via a series of seemingly complex rules, either via a Duke of York gambit or opportunistic abuse of underspecification. Since it seems quite likely that purely phonological exchanges don’t in fact exist, we suggest that the language acquisition device is constrained so as to not profer analyses of those types, though we also consider that this may be an accident of the “diachronic filter” of the sort developed by Ju. Blevins, M. Hale, and J. Ohala. The handout is here for those interested, and like other things we’ve been writing about LP, will probably be included in a forthcoming book. One interesting question that we raise, but don’t answer, is what one ought to do about exchanges in Optimality Theory. Alderete (2001), for example, proposes to model them with a family of “anti-faithfulness constraints”. While one could predict the absence of exchanges by eliminating this constraint family, Alderete also uses anti-faithfulness for phenomena other than exchanges, and some of these may be more robustly attested; it’s not clear what ought to be done thus.

References

Alderete, J. 2001. Dominance effects as transderivational anti-faithfulness. Phonology 18: 201-253.

NAPhC talk on Polish and Logical Phonology

I just gave a talk at NAPhCxiii in Montréal in which I used Logical Phonology to characterize major patterns in noun stem allomorphy. With assiduous use of underspecification, I am able to render these patterns largely exceptionless. The handout is on LingBuzz and LOA here.

Tutorial on Substance-Free Logical Phonology

A few days ago, we (myself with the help of graduate student Rim Dabbous and professor Charles Reiss) gave a detailed tutorial on Substance-Free Logical Phonology (LP) at the LSA meeting in Philadelphia. I was pleasantly surprised with how many people showed up (I printed about 30 handouts, and we ran out) and how engaged they were. If the question period is any indication, our colleagues understood the theory quite well. The handout is now on LingBuzz and we have a lot more material coming for you soon.

I’ll try to summarize how LP fits into theories of exceptionality in a separate post soon.

Postscript

I also took the liberty of creating a simple archive for our Logical Phonology papers, the Logical Phonology Archive (LOA for short, pronounced [lwa] like French loi ‘law’ or Kreyòl loa ‘vodou spirit’). Technically, this site is sort of interesting: other than some static text, the table containing papers is an HTML iframe, dynamically generated from a Google Sheet using a template populated by server-side Javascript using the Google Apps Scripts platform; hat tip to my colleague Rivka Levitan for making me aware of this very handy possibility.

Underspecification: the Inkelasverse

Before I continue the series on theories of lexical exceptionality, I need to pause for a minute and review some work by Sharon Inkelas and colleagues: Inkelas and Cho 1993, Inkelas 1995, Inkelas and Orgun 1995, Inkelas et al. 1997. Inkelas and colleagues’ work has recently been generalized by Charles Reiss and colleagues (e.g., Bale et al. 2014, Reiss 2021, and Gorman & Reiss 2024, the latter under review and available on LingBuzz), turning Inkelas’s original idea into a generalized theory of exceptionality, and I’ll review that before long, in a separate post. I don’t consider the Inkelas(verse) approach to represent a complete theory of exceptionality and so I am not presenting them as such.

Preliminaries

A few other caveats are in order before I begin. First, underspecification is hardly a novel idea: it wasn’t even a new one when it first became a major subject of consideration in the 1980s. The American structuralists espoused the axiom of biuniqueness, which in some sense presages underspecification. At a high level, biuniqueness holds that there is a one-to-one correspondence between the phonemic and allophonic (“phonetic”) level. In practice, this rules out the correspondence modern phonologists term neutralization, in which the allophones of two distinct phonemes overlap. Formally, neutralization occurs when two phonemes /x, y/ (where x ≠ y) both have an allophone [z] (regardless of whether z ∈ {x, y} or not). A well-known example of neutralization in many dialects of English (Joos 1942) is the intervocalic [ɾ] in writer and rider. On the basis of write and ride, one would be tempted to set this up as an allophone of /t, d/, respectively, but this neutralization violates biuniqueness.¹

This axiom is generally consonant with the overall late structuralist search for a deterministic discovery procedure. One is tempted to liken the late structuralists to alchemists, a workable discovery procedure to the philosopher’s stone, and their failed search to the magnum opus.

The structuralists imposed no biuniqueness constraint on a more abstract, poorly-theorized “morphophonemic” level which would, apparently, contain generalizations like English post-tonic flapping. The morphophonemic level was poorly theorized so how exactly this ought to work is a little unclear. The archiphoneme, popularized by the Prague School structuralists Nikolai Trubetzkoy and Roman Jakobson, suggests one possibility. An archiphoneme is a sort of morphophonemic symbol not yet specified for features which are to be later neutralized. By convention they are denoted in derivations with capital letters not otherwise used for IPA symbols. In the case of post-tonic flapping, one might write the flap character as the arciphoneme /D/, and propose allomorphic or morphophonemic rules which insert it into writer and rider in place of /t, d/. In the Inkelasverse, archiphonemes are simply a way to annotate underspecification (here presumably of the Voice feature that distinguishes /t, d/), and their convention of using capital letters for underspecified segments calls back to this tradition.

Given the archiphonemic “escape hatch” and noting that many late structuralists also published influential morphonemic sketches (e.g., Bloomfield 1939, Chomsky 1951, Hamp 1953), one might reasonably question how closely held the biuniqueness axiom really was at the time. Chomsky and Halle (e.g., Halle 1959, Chomsky 1964, Chomsky & Halle 1965) argue that biuniqueness introduces a great deal of duplication, with morphophonemic rules reprising the same generalizations as phonemic ones. They thus propose to reject biuniqueness and to merge the phonemic and morphophonemics into a single level of description. Yet we see that Chomsky and Halle got a lot of pushback (see cited papers) from late structuralist éminences grises, so their rejection of biuniqueness was not uncontroversial at the time.

Secondly, there are a rather-distinct set of arguments, not considered in any detail by Inkelas and colleagues, for surface underspecification. Keating (1988), for example, provides phonetic arguments for the use of underspecification. If one permits underspecification on the surface, it is not obvious why it should not also be permitted in earlier stages of the derivation.

Jumping forward a bit, underspecification theory received a lot of attention in the late 1980s and early 1990s, but interest seems to have waned after Donca Steriade’s chapter in the first Blackwell Handbook of Phonology (1995), which reviews a number of theories of underspecification, taking a dim view of the overall enterprise. Steriade’s review is very good—I would recommend you read it if you haven’t—but as I remember it, the critiques mostly do not apply to the Inkelasverse.

Inkelas and Cho 1993

Inkelas and Cho (1993; henceforth I&C) propose a theory in which underspecification (or its absence, prespecification) is used to derive a number of interesting effects. Their slogan is: prespecification is inalterability; one might add that thus underspecification is mutability.

I&C begin with a common-place observation that geminates are often immune to processes which singleton segments undergo. They review attempts to derive this effect using autosegmental representations and various rule-application conventions, but conclude that no set of representations and conventions work in general. I&C propose that it is not geminate status per se but rather a more general convention of prespecification that is at play.

For example, in certain dialects of Berber, singleton consonants spirantize (and voice if they are pharnygealize), but geminates are always voiceless stops. Note that this is not merely a static generalization: the singleton and geminate forms are morphologically related, because these languages have templatic morphology and certain inflectional and/or derivational forms call for consonants to be geminated. Noting that Continuancy and Voice are generally non-contrastive for consonants, I&C propose:

(1) Berber consonants (after I&C, §5.2.1):
a. consonants, either singleton or geminate, are not underlyingly specified for Continuant or Voice.
b. a geminate-specific feature-filling rule inserts [−Continuant], and if [+Pharyngeal], [-Voice].²
c. a later feature-filling rule inserts [+Continuant], and if [+Pharyngeal], [-Voice].

Crucially, rule (1b) bleeds rule (1c) by virtue of being feature-filling, and thus geminates are exempt. This opens up a few new avenues to consider.

First, they note that some of the relevant dialects of Berber have apparent lexical exceptions: voiced, continuant geminates […ɣɣ…]. They propose that these apparent exceptions can be prespecified [+Voice, +Continuant]. Given the condition that rules (1bc) are feature-filling, these “exceptions” are not really exceptional at all! I&C note (§5.7) that this is a highly-restrictive theory of lexical exceptionality compared to earlier approaches but claim some advantages as well.

In a footnote (p. 556, fn. 26), I&C note that they cannot handle cases in which “exceptionality takes the form of failure to trigger, rather than failure to undergo, a rule” which “remain a problem for us until they can be resolved in a representational fashion”.

Secondly, they observe that the account given in (1) is in no way specific to geminate status. It requires, of course, that (1b) can target geminates to the exclusion of singletons, but this follows from their assumption that geminates are prespecified with two moras and a standard interpretative procedure; i.e., if a rule specifies that a target is linked to two moras, a segment linked to only one is not a target.³ Thus, this account generalizes to instances of “inalterability” which are not related to geminacy. For example, exceptions to vowel harmony in Turkish may simply reflect pre-specified Back and Round features which are “inalterable” with respect to structure-filling harmony rules.

Third, they note that this sort of account cannot be applied to structure-changing rules, and find no counterexamples. Indeed, they suggest, as many others have, that so-called structure-changing rules are decomposed into a feature-delinking rule followed by feature-insertion (or spreading) rule.

I note that a few examples given by I&C, such as Klingenheben’s Law in Hausa, are of no obvious relevance to synchronic phonology. For example, this law can be thought of as a diachronic change, which may or may not have had a synchronic analogue in the past, or as a static phonotactic constraint deriving (diachronically) from said change, which may not have any synchronic force.

I&C take their proposal to be an instance of Kiparsky’s notion of radical underspecification (RU), according to which features are binary and only marked feature values are present underlyingly. Indeed, their account of inalterability as prespecification is a good argument against privativity and thus for binarity. For instance, if Voice is privative (and thus equivalent to [+Voice] in a binary system), then there is no way for (1b) to block (1c). However, in my opinion RU does not acquit itself well in I&C’s account. They note some cases where it conflicts with their assumptions about feature markedness. Furthermore, it’s not clear to me what the presence of prespecified faux-exceptions I&C posit like the […ɣɣ…]s in Berber dialects means for RU.

Inkelas 1995

Inkelas (1995) attempts to relate some of I&C’s earlier insights to ideas in Optimality Theory (OT).

Inkelas begins with some now-famous data from Turkish. There are three kinds of consonant-final roots: those in which the final consonant is consistently voiceless (e.g., sanat/sanatı ‘art’; the second form is the accusative), those which are consistently voiced (etüd/etüdü ‘etude’), and those which alternate (e.g., kanat/kana[d]ı ‘wing’). Short of root suppletion, it is hard to imagine how one would handle this three-way distinction without making use of underspecification. Inkelas proposes, naturally, that the alternating kanat is underlyingly /kanaD/, where /D/ denotes a coronal stop underspecified for Voice, and the non-alternating final consonants are prespecified for Voice. This is once again an instance of prespecification as inalternability, because presumably the processes which insert final voice specifications are feature-filling and thus do not apply to fully-specified root-final /t, d/.

Inkelas considers but rejects an alternative using morpheme-level exceptionality, an idea pursued in some detail in SPE and subsequent work and still in vogue in OT circles. She suggests this is no different than introducing morpheme-specific multiple grammars, and that with this tool in place, it is hard to know when to stop proliferating grammars. For instance, while it might be sensible to treat one of the three voicing patterns in Turkish as exceptional, giving us two grammars, one could just as well derive all three patterns using the same underlying stem-final /d/ and three different grammars. In my opinion, one might equate “exceptionality” with the lack of productivity, and thus use standard diagnostics (e.g., perhaps child product errors, wug-tests, or evidence from loanword adaptation) and/or theories (e.g., Tolerance) to address this issue, and one might also argue against grammar proliferation on evaluation-metric grounds. But I agree that the morpheme-level theory is radically underconstrained without additional theoretical commitments along these lines.

At the same time, Inkelas also claims the use of underspecification to account for such phenomena as unconstrained. She suggests that the notion of lexicon optimization (LO), slightly revised, provides an appropriate constraint.

She points out that LO, as originally defined, is intended to pick out an “faithful” UR for cases of pure allophony, and proposes a variant of LO which is sensitive to the presence of alternations (her 16). Basically, the principle is to pick as the UR of a morpheme M the form which is most harmonic with respect to all allomorphs of M. While I understand the motivation here, this does not seem to be formal enough to implement. For instance, let us suppose S is the set of surface forms of M, and let I be the set of candidates for the UR. Note also that S and I may be disjoint (i.e., elements of I need not be observed surface forms), an assumption which will become important in a moment. Let us assume that, somehow, the grammar is already known. Finally, suppose that the child has computed, for each hypothetical UR in I, a tableau for surface forms s with respect to the grammar; there will be |S| × |I| such tableaux. Then, for candidate UR i, how exactly is the child to combine the harmony scores of |S| tableaux so that it can be compared to the scores for candidate UR j? In the cases Inkelas considers, |I| = 3 (because she considers, e.g., in the Turkish case, /t, d, D/), and the winning i harmonically bounds the other candidate URs, but it is not clear to me that this is generally true. However, I put these big issues aside for sake of argument.

Inkelas argues that, in the case that there are predictable alternations, this revised form of LO will select a UR with the underspecified segment with respect to the alternating features, and the UR will be fully specified otherwise. For instance, if Turkish in fact has high-ranked constraints motivating word-final plosive devoicing, it will select Voice-underspecified /kanaD-/ for kanat, whereas non-alternating sanat and etüd will be fully specified as /t, d/ respectively. This, she shows, is also the case in simpler two-way alternations, like ATR harmony in Yoruba: she argues that the revised form of LO will leave prefix vowels which undergo ATR harmony underspecified for ATR.

A key idea here is that underspecification is not driven by universal principles (as in earlier markedness-based approaches) but is quite intimately tied to the grammar. Indeed, her form of LO depends on the child having already converged on a grammar before setting (or perhaps “optimizing”) the URs.

This paper concludes with a number of implications for the proposal. She compares and constrasts her proposal with the Prague school archiphoneme, which shares some important similarities. More importantly, she notes that underspecification which gives rise to ternarity is an argument against privativity, since there is no way to derive ternarity in a privative feature system. I would add that, since there does not seem to be any principle that governs which features can be underspecified nor any evidence suggesting that some features cannot, there is no evidence that any features are privative.

Inkelas’s proposal is an dialogue with (her advisor) Kiparsky’s notion of absolute neutralization. Kiparsky (1973) introduces this notion to critique SPE‘s roundly-criticized notion that nightingale contains an underlying, never-surfacing velar fricative /x/ which blocks trisyllabic shortening, and what he actually critiques is positing an underlying /x/ which is deleted in all surface forms. Here the “absolute” part of “absolute neutralization” could easily be referring to the fact that /x/ is not merely neutralized, but entirely deleted. However, his proposed remedy, the alternation condition, is stated more broadly:

(2) Alternation Condition (Kiparsky 1973:65): neutralization processes cannot apply to all occurrences of a morpheme.

This makes it reasonably clear to me that he intends “absolute” to refer to the fact the neutralization (which just happens to be deletion in this case) occurs in all cases (i.e., across the board).

It seems to me that Inkelas’s proposal not only conforms to (2), at least in simple cases, it actually provides a glimpse into how the child builds URs that conform to (2). This is a marked improvement.

While Inkelas seems to view her proposal as closely tied to OT and its assumptions, I do not necessarily agree: I would argue that any coherent theory of phonology will have some analogue to lexicon optimization, a point already anticipated in the SPE-era evaluation metric.

Inkelas and Orgun 1995

Inkelas and Orgun (1995; henceforth I&O) attempt to merge the previous proposals about underspecification into a larger theory of morphological structure. Their object of study is a partial sketch of Turkish morphophonology, and as such, interacts with Inkelas and colleagues’ earlier discussion of various phonological phenomena in Turkish.

While it is not all that germane to the Inkelasian theory of underspecification, I should briefly summarize the novel elements of their morphological theory. First, I&O propose that morphemes are “prespecified” for the levels in which they are introduced. Secondly, I&O propose that these levels may have different phonology. Neither of these proposals is controversial, and echo, e.g., Siegel (1979 [1974]). More interestingly, though, they propose that (partial) forms are subject only to the phonology of the levels at which they derive, a principle they call level economy. Level economy means that, for example, a form which lacks level 3 morphemes is not subject in any way to level 3 morphology, and when a level 2 morpheme is inserted, the resulting (possibly partial) form is not subject to level 1 morphology, and so on.

One case of relevance to underspecification is their new take on Turkish ternary voicing (I&O:776f.), discussed earlier. They claim that monosyllabic roots are all of the non-alternating variety: e.g., at/atı ‘horse’ or ad/adı ‘name’. Their analysis of this restriction is novel: they propose that a hypothetical /aD-/ would be identical to at/atı because of the interaction a between final-consonant invisibility, a minimality condition which prevents this from applying to monosyllables, and level prespecification to exempt certain stems from minimality and thus preventing final-consonant invisibility.

Later in the paper (op. cit.:787f.) it is revealed that this generalization has many exceptions. I&O search a Turkish dictionary for plosive-final monosyllables, and filtering out those not known to the second author, a native speaker of Turkish. In total, there are 4 monosyllables ending in a plosive which are voiced and non-alternating, 77 which are voiceless, and 17 which are alternating. I&O propose that the 17 alternating stems are prespecified as level 1 morphemes, which ultimately prevents them from undergoing devoicing at level 1.

I for one am not convinced there are any meaningful generalizations here. First, these frequencies could very well be historical accident (a point made in a different context by Inkelas et al. 1997:402), and there is no evidence children acquiring Turkish attend to them. Indeed, the other elements of their analysis (minimality and final-consonant invisibility) are either motivated by other exception-filled generalizations (e.g., their analysis of velar drop), and some of these are static conditions on URs which may also be ignored by children. Secondly, it is not clear why I&O derive the “exceptionality” of alternating final plosives from their morphological prespecification, whereas non-alternating voiced stems, which are much rarer, are instead treated phonologically. I&O write that in their analysis, “[n]onalternating voiced final plosives are exceptional among monosyllables…we account for them through the prespecification of [+voice]”. But by their analysis alternating final plosives are in the same sense “exceptional” in lacking a voice specification: their use of level prespecification solves a problem that I&O have themselves introduced by making feature-changing devoicing as a level 1 process.

In conclusion, I much prefer the simpler analysis of ternary voicing provided by Inkelas (1995), which does not make mention of final-consonant invisiblity, minimality, or level prespecification.

Inkelas et al. 1997

In our final paper, Inkelas et al. (1997) provide a detailed critique of morphemic exceptionality, which they call cophonology. Much of their discussion is based on ternary voicing in Turkish, and a proposed static generalization known as labial attraction; the second is the ternary voicing alternation.

You have already heard what both Inkelas and colleagues and I have to say about ternary voicing; it is interesting in this regard that they cite, but do not rehearse, the more complex (and IMO problematic) analysis proposed by I&O two years earlier.

But before I tell you what they have to say, I should briefly discuss labial attraction. As it happens, chapter 3 of my dissertation (Gorman 2013) actually discusses labial attraction in some detail. This is in the context of a larger critique of phonotactic theory and its relation to ordinary phonology: I am in this work claiming that many phonotactic patterns derive directly from the occulting effect of phonological processes, and I argue that others are not necessarily internalized by speakers. I first perform a simple statistical analysis of labial attraction in Turkish using a digital lexicon of the language (Inkelas et al. 2000). In the analysis, I compare the frequency of disharmonic …aCu… to harmonic …aCı… sequences, conditioned on whether the intervening consonant is labial or not. As it happens, there is significantly more disharmony when the intervening consonant is non-labial, suggesting the proported lexical trend is not a trend at all.⁴ Secondly, I statistically reanalyze the results of a nonce word judgment task conducted by Zimmer (1969).⁵ This reanalysis finds that Zimmer’s Turkish speakers fails to find a significant effect of labial harmony: that is, they do not prefer disharmonic, labial-attraction nonce words like pamuz over harmonic nonce words like pamız. Pace Lees and Zimmer, there is no evidence that labial attraction exists, and subsequent theorizing based on it is fruit of the poisonous tree. Indeed, Inkelas et al. are also skeptical at various points about labial attraction. For instance, they write that “because of Lexicon Optimization, the lexicon of Turkish will look the same whether or not Labial Attraction is actually part of the grammar” (op. cit.:412; emphasis theirs), and provide tableaux establishing this point. I agree.

Let us put this all aside for sake of argument. What do Inkelas et al. propose?

First, they again endorse the use of underspecification for ternary alternations proposed by I&C and Inkelas (1995), and the use of underspecification for predictable alternations and prespecification elsewhere as proposed by Inkelas (1995). They express some concern whether prespecification can be made consistent with certain types of synchronic chain shifts.

Secondly, Inkelas et al. also tentatively endorse the notion that “structure-changing” rules ought to be decomposed into structure-deleting and structure-inserting rules; this idea, which has a long precedent, plays a major role in our recent work on Substance-Free Logical Phonology (LP). They express some concern as to whether this reintroduces “conspiracies” where structure-deleting and structure-inserting rules have similar contexts. I for one disregard this concern because I don’t regard Kisseberth’s (1970) notion of conspiracy to be sufficiently formal, and phonologists after Kisseberth have engaged in a sort of motivated cognition in which their intuitions about what is or is not a conspiracy are unduly informed by what kinds of redundancies they are prepared to excise. Let us, for sake of argument, formalize this notion as related to the decomposition proposed by Inkelas et al., using the tools of LP. A definition follows. Let us suppose that there exist two rules of the following form, where A, C, and D are natural classes and B is a set of feature specifications.

(4) A ∖ B / C _ D
(5) A ⊔ B / C _ D

Then, if (4-5) are both present in some grammar and there is no evidence to establish that some other rule applies after (4) but before (5), we will say that (4-5) are in a conspiracy. Our experience applying LP to real data suggests such conspiracies do occur but are not the norm. For instance, consider Gorman & Reiss’s (2024) analysis of Cervara metaphony. First, I repeat Gorman & Reiss’s rules (20-21) as (6-7); these are clearly in a conspiracy as defined above.

(6) [-Low, +ATR] ∖ {-High} / _ C₀ [+High]
(7) [-Low, +ATR] ⊔ {-High} / _ C₀ [+High]

The same is true of Gorman & Reiss’s rules (24-25). But this is not always the case; rules (19), (23) and (26-28) are all unification rules which are dissimilar to any subtraction rule. And it should be noted that Gorman & Reiss’s analysis is essentially a translation of an earlier analysis (Danesi 2022) into LP, and was not engineered to reach this conclusion.

Third, Inkelas et al. rehearse, in longer form, Inkelas’s (1995) earlier argument against cophonologies, and adds in a problem I discussed in an earlier post: cophonologies have little to say about cases in which one segment in a morpheme is exceptional with respect to some process but another is not. Their example, Spanish diphthongization, is not the best one, but Rubach’s discussion of yer-deletion in Polish, discussed in that post, is more germane.

Endnotes

Interestingly, as far as I can tell, Joos, a structuralist in good standing at the time, does not seem to have noticed the apparent violation of biuniqueness nor does he draw attention to the “morphophonemic” nature of flapping forced by biuniqueness.
I grant, sake of argument, that this “if pharyngeal, voiceless” sub-condition, reminiscent of the largely-forgotten SPE-era angle-bracket notation, can be encoded within a single rule, but I would probably prefer to split (1b) into two rules: one for spirantization and one for pharyngeal voicing, and similarly for (1c).
I&C adopt moraic theory, but I suspect that one could do something quite similar with X-theory. I&C also note that intervocalic geminates occupy both coda and onset position and this can have a similar effect. This works so long as syllable position is determined (i.e., syllables are parsed) before the relevant rules apply; it doesn’t require that it be prespecified.
There are other comparisons I could have done here, but I thought this was the most salient comparison. This comparison controls for the frequency at which a is followed by a [+High, +Back] vowel in the following syllable, and thus directly focuses attention on labial attraction as a source of exceptions to harmony.
My reanalysis was necessary because while Zimmer reports that he conducted a statistical analysis of the data, he provides no useful details about his statistical procedures.

References

Bale, A., Papillon, M., and Reiss, C. 2014. Targeting underspecified segments: a formal analysis of feature-changing and feature-filling rules. Lingua 148: 240-253.
Bloomfield, L. 1939. Menomini morphophonemics. Travaux de Cercle Linguistique de Prague 8: 105-115.
Chomsky, N. 1951. Morphophonemics of Modern Hebrew. Master’s thesis, University of Pennsylvania.
Chomsky, N. 1964. Current Issues in Linguistic Theory. Mouton.
Chomsky, N. and Halle, M. 1965. Some controversial questions in phonological theory. Journal of Linguistics 1: 97-138.
Danesi, P. 2022. Contrast and phonological computation in prime learning: raising vowel harmonies analyzed with emergent primes in radical substance free phonology. Doctoral Dissertation, Université Côte d’Azur.
Gorman, K. 2013. Generative phonotactics. Doctoral dissertation, University of Pennsylvania.
Gorman, K., and Reiss, C. 2024. Metaphony in Substance Free Logical Phonology. Submitted. URL: https://lingbuzz.net/lingbuzz/008634.
Halle, M. 1959. The Sound Pattern of Russian. Mouton.
Hamp, E. 1953. Morphophonemes of the Keltic mutations. Language 27: 230-247.
Inkelas, S. and Cho, Y.-M. 1993. Inalterability as prespecification. Language 69: 529-574.
Inkelas, S., 1995. The consequences of optimization for underspecification. In Proceedings of the 25th Meeting of the North East Linguistic Society, pages 287-302.
Inkelas, S. and Orgun, C.O., 1995. Level ordering and economy in the lexical phonology of Turkish. Language 71, 763-793.
Inkelas, S., Orgun, C.O., and Zoll, C. 1997. The implications of lexical exceptions for the nature of grammar. In I. Roca (ed.), Derivations and Constraints in Phonology, pages 393-419. Clarendon Press.
Inkelas, S. Küntay, A., Orgun, C.O., and Sprouse, R. 2000. Turkish Electronic Living Lexicon (TELL). Turkic Languages 4: 253-275.
Joos, M. 1942. A phonological dilemma in Canadian English. Language 18(2): 141-144.
Keating, P. 1988. Underspecification in phonetics. Phonology 5:275-292.
Kiparsky, P. 1973. Phonological representations: Abstractness, opacity and global rules. In O. Fujimura (ed.), Three Dimensions of Linguistic Theory, pages 57-86. TEC.
Kisseberth, C. W. 1970. On the functional unity of phonological rules. Linguistic Inquiry 1(3): 291-306.
Lees, R. B. 1966a. On the interpretation of a Turkish vowel alternation. Anthropological Linguistics 8: 32-39.
Lees, R. B. 1966b. Turkish harmony and the description of assimilation. Türk Dili Araştırmaları Yıllığı Belletene: 279-297.
Siegel, D. 1979. Topics in English Morphology. Garland Publishing.
Steriade, D. 1995. Underspecification and markedness. In J. Goldsmith (ed.), The Handbook of Phonological Theory, pages 114-174. Blackwell.

Languages with two i’s

[I’ve edited this after the fact to add a new case study. -KBG.]

Doing some research on underspecification and its use as an exceptionality-generating device for our work on Logical Phonology, I have encountered an common pattern: there are quite a few languages, of wide genetic and geographic distribution, whose description suggests their phonological inventory contains two underlying segments ordinarily realized as [i(ː)] but showing different phonological behaviors. The following is a brief catalog followed by some discussion.

Barrow Inupiaq

I have already discussed this well-known case, which is discussed by Dresher (2009:§7.2.1), who in turn has the data from Kaplan (1981:§3.22). In a number of dialects of Eskimo-Aleut, a four-phoneme vowel system *i, *u, *ə, *a has been simplified to an ordinary three-vowel system by the merger of *ə into *i. However, in Barrow Inupiaq, there is a “strong i” (< *i) which triggers palatalization of a following coronal consonant whereas “weak i” (< *ə) does not.

(1) Barrow Inupiaq (Kaplan 1981:§3.22, his 27-29):

a. iglu ‘house’, iglulu ‘and a house’, iglunik ‘houses’
b. ini ‘place’, inilu ‘and a place’, ininik ‘places’
c. iki ‘wound’, ikiʎu ‘and a wound’, ikiɲik ‘wounds’

Presumably, the stem-final i in (1b) is weak and the one in (1c) is strong. One possible way to derive this (slightly different from what I said in my earlier post) is to treat palatalization as triggered by [−Back, +High] strong i, leave weak i underspecified for Back, and then use the following late redundancy rule to fill in the missing specification for weak i.

(2) [+High] ⊔ {−Back}

Given the definition of unification, this will non-vacuously apply to weak i, vacuously apply to strong i, and unification with [+High, +Back] /u/ will fail; /a/ does not meet the structural description because it is [−High].

Blackfoot

Charles Reiss (personal communication) brings my attention to a case in Blackfoot. Frantz (2017:§6B) describes the situation clearly:

The initial vowel of stem ipii ‘enter,’ unlike the initial vowel of itsiniki ‘tell a story,’ always causes a preceding k to be replaced by the affricate ks. We will speak of this phenomenon as breaking of k, and of the i which is involved as a breaking i. For any morpheme which begins with i we need to know whether that i is a breaking i or not; if it is a breaking i, then if it immediately follows a morpheme ending in k we know that the k will be replaced by ks.

Naturally enough, Frantz proposes that there are two i‘s in Blackfoot. Thus far, one would expect the analysis to be parallel to our analysis of palatalization in Barrow Inupiaq, with “catalytic” (breaking), fully-specified /i/ and “quiescent” (non-breaking) underspecified segment (which Frantz writes as /I/). But there is there is one wrinkle; a few pages later Frantz (op. cit.:§6D) reports that the second person prefix k- is “impervious to breaking”. Thus, it seems like we also need two k‘s: one “mutable” (breaking), underspecified segment we can write as /K/, and one “inalterable” (non-breaking), fully-specified /k/. Breaking then occurs only in the sequence /…K-i…/.

Czech

Anderson & Browne (1973; henceforth AB) discuss several vocalic alternations in two registers of Czech. I will focus on the literary language (“LL” for AB; henceforth just Czech) and put aside additional complications associated with their analysis of the common spoken register, which makes some possibly problematic simplifying assumptions for sake of argument.

The surface front vowels of Czech are [i, iː, ɛ, ɛː].¹ However, specific instances of these three vowels either trigger or do not trigger palatalization of the preceding consonant.² For example, the masculine animate nom.pl. -i [-i] is palatalizing, but the feminine nom.pl -y [-i] and the gen.pl. -ych [-ix] are not:

(3) sestřin [sɛstr̝in] ‘sister’s’, sestřini [sɛstr̝iɲi] masc.anim. nom.pl.,
sestřiny [sɛstr̝ini] fem. nom.pl., sestřinych [sɛstr̝inix] gen.pl.

The same is true of [ɛ]-initial suffixes. For example, the -ě [-ɛ] prep.sg. suffix is palatalizating, but the -ech [-ɛx] prep.pl. is not (note the -o suffix in the citation form is the neut. nom.sg.):

(4) okno [okno] ‘window’, okně [okɲɛ] prep.sg., oknech [oknɛx] prep.pl.

Czech [i, iː] are reflexes of both Old Czech i(ː) and ɨ(ː), the latter two merging into the former two respectively. This distinction is preserved in the writing system as [i, iː] from earlier ɨ(ː) is written <y, ý> respectively. AB adopt the sensible proposal that this distinction is also preserved phonologically. That is, they suggest non-palatalizing <y, ý> are underlyingly [+High, −Low, −Round, +Back] and palatalizing <i, í> are [+High, −Low, −Round, −Back]. While they do not sketch out a full analysis, presumably <y, ý> are fronted after palatalization has applied. Alternatively (and more in tune with the approach of Gorman and Reiss 2024), one might instead propose that <y, ý> are [+High, −Low, −Round] and underspecified for [Back]. Palatalization will be triggered by a following [−Back] vowel, and then the following unification rule will (non-vacuously) front <y, ý>:

(5) [−Low, −Round] ⊔ {−Back}

As written, this will also also handle the parallel behavior of non-palatalizing <ě>, which gives it the same featural representation as palatalizing <e> [ɛ]. Under either account, though, Czech has two i‘s (in long and short variants) and two e‘s too.

Hungarian

Charles Reiss (p.c.) draws my attention to the existence of a roughly 60 Hungarian noun stems (Siptár & Törkenczy 2000:§3.2.2; henceforth ST) which are apparent exceptions to harmony in that they have a front vowel but select for back-vowel allomorphs. For instance, “regular” víz ‘water’ has the harmonic dative víznek and ablative víztől, and so on. But there are some exceptions, which ST call “antiharmonic”.

(6) Antiharmonic stems (after ST:68, their 33):
a. híd ‘bridge’, hídnak dat.sg. (*hídnek), hídtól abl.sg. (*hídtől)
b. szid ‘scold’, szidhat ‘may scold’, szidó ‘scolding’
c. héj ‘crust’, héjnak dat.sg., héjtól abl.sg.

According to ST, most antiharmonic stems are in i or í as in (6ab), and just a few are in é as in (6c). While these are traditionally understood as lexical (morpheme-level) exceptions, one possible account is to treat antiharmonic í, i, or é as having a different underlying specification than harmonic í, i, and é.

There are a number of possible ways this might work out. According to ST, í, i, and é are not so much “harmonic” but rather “neutral”. They claim the front-harmonic suffix allomorphs (like –nek and –től) are the default allomorphs. Indeed, stems consisting of a non-neutral vowel like á [aː], a [ɒ], o [o] or ó [oː] followed by í, i, and é select the back allomorphs, and do so without exception, suggesting í, i, and é are simply transparent to harmony.

(6) Back-neutral stems (after ST: loc. cit., their 34b):
a. papír ‘paper’, papírnak dat.sg., papírtól abl.sg.
b. dózis ‘dose’, dózisnak dat.sg., dózistól abl.sg.
c. kávé ‘coffee’, kávénak dat.sg., kávétól abl.sg.

If we adopt ST’s perspective, then one possibility is that harmonic-neutral í, i, and é are specified [−Low, −Round] and antiharmonic vowels are [−Low, +Back, −Round]—keeping them distinct from underlying u [u], ú [uː], and ó [oː]—and become [−Back] later in the derivation. The following context-free rules, applied in order after harmony has been computed, ought to suffice.³

(7) [−Low, −Round] ∖ {+Back}
(8) [-Low, -Round] ⊔ {−Back}

(7) gives harmony-neutral and antiharmonic front vowels the same representation, applying non-vacuously only to the latter, and (8) insures that both are realized as [−Back] on the surface. The [-Round] condition is essential to exempt front round vowels—namely ü [y], ű [yː], ö [ø], and ő [øː]—and the [-Low] condition exempts low vowels, which are back-harmonic.

Kashaya

Buckley (1994) describes four unusual rules in Kashaya, given below in extensional (i.e., non-featural) notation.

(9) i -> a / m _
(10) i -> u / d _
(11) V -> a / q _
(12) V -> o / q^w _

While (9-10) appear to be quite regular processes, he notes that 3 out of 21 i-initial suffixes (the inchoative -ibic and the reflexives -iyic’ and -ic’) fail to undergo them. Similarly, most i‘s undergo (11-12) but not the previously mentioned i-initial suffixes. With (11) the result is […ki…] instead of *[…qa…]; with (12) the result is […qo…] instead of *[…q^wo…].

Buckley proposes that Kashaya has two i‘s—mutable /i/ and inalterable /î/—which are surface-identical except with respect to (9-12). He proposes that /î/ is [+High] whereas /i/ is underlyingly underspecified for it. This analysis can be easily translated into Logical Phonology. For instance, (9) can be respecified as follows.⁴

(13) [+Vocalic] ⊔ {-High, +Low} / [+Labial, +Nasal, …] _

(13) applies non-vacuously to only one segment: underspecified /i/. Both High and Low must be specified since Kashaya also has mid vowels /e, o/. Then a context-free redundancy rule can then be used to “merge” /i, î/, non-vacuously applying to any underspecified /i/ remaining after (13) and the Logical Phonology analogues of (10-12).

(14) [+Vocalic] ⊔ {+High, -Low}

Spanish

The final case concerns the so-called raising verbs of the Spanish 3rd conjugation. This is a well-known case and you can read my our take on it in Gorman & Reiss 2024 (§4) here. Briefly, we argue that the “raising” verbs have an underlying vowel underspecified for High which is realized as i except when it is followed by a [-Back, +High] vowel in the following syllable. Verbs with the underspecified i-like vowel, like the pedir-pido ‘ask for (s.t.)’ are thus distinct from verb stems with non-alternating i (e.g., vivir-vivo ‘live’) or e (sumergir–sumerjo ‘submerge’).

Discussion

Summarizing our analyses:

Barrow Inupiaq has two i‘s, one “catalytic” and one “quiescient” with respect to palatalization.
Blackfoot as two i’s, one catalytic and one quiescent with respect to breaking (and also two k‘s, one mutable and one inalterable).
Czech has two i‘s (ignoring phonemic length; and two e’s, also ignoring phonemic length), one catalytic and one quiescient with respect to palatalization.
Hungarian has two i‘s (ignoring phonemic length; and two long e’s), one catalytic and one quiescient with respect to vowel harmony.
Kashaya has two i’s, one catalytic and inalterable with respect to several processes, and one quiescent and mutable with respect to those same processes.
Spanish has two i‘s, one mutable and one inalterable with respect to i-dissimilation.

The approach to underspecification developed by Sharon Inkelas and colleagues, whose slogan is “prespecification as inalterability”, is well-equipped to handle Kashaya and Spanish, where i‘s are underspecified to generate mutability and prespecified to generate inalterability. However, it has nothing to say about Barrow Inupiaq, Blackfoot, Czech, or Hungarian, a point which Inkelas & Cho (1993:556, fn. 26) concede. One of the innovations of Logical Phonology is that it generalizes their treatment of targets to triggers, adding a new slogan—”prespecification as catalysis”—allowing for all five cases to receive a more-uniform, purely phonological treatment.

Is there something about i, and perhaps e and a in some occasions, which might account for its apparent tendency to either mutate or resist mutation, or to either trigger or fail to trigger mutation? Let us call these tendencies duality. First, it should be said that this generalization is based on a handful of reasonably-well characterized languages, and it is not clear that it is a meaningful tendency. But, granting the significance of this tendency for sake of argument, one might be tempted to derive the duality of i from some phonetic—acoustic-auditory or articulatory—property of high front vowels. One can imagine many just-so stories. For instance, /i/ is very commonly as the default and/or epenthetic vowel, and I have been told that epenthetic segments are longer on average than non-epenthetic segments. Perhaps its default or epenthetic status, or an associated shorter temporal duration, accounts for its duality, though it is not clear how. Or perhaps duality derives from something about its phonetic cues. Logical Phonology, however, is a strictly substance-free approach, and as such, such explanations, however interesting, must be extra-grammatical, of the sort studied by Juliette Blevins and John Ohala among others.

Endnotes

I have taken the liberty of adapting AB’s transcriptions into IPA. I also follow standard conventions in transcribing the mid front vowels as [ɛ(ː)] rather than their [e(ː)], though nothing depends on this.
I put aside two details here. First, there appears to be a separate rule of velar palatalization which is not conditioned on the palatalizing/non-palatalizing contrast under discussion. Secondly, palatalization triggered by palatalizing e is realized as [j] insertion when the preceding consonant is a plain consonant (e.g., /v/) lacking a palatal allophone.
In Logical Phonology, feature-changing processes are always expressed as a deletion rule followed by a unification rule, so the apparent redundancy in (7-8) is intentional. This separation allows us to dispense with the distinction between feature-filling and feature-changing rules, and has other desirable properties discussed elsewhere.
In this rule I am using an ellipsis for the specific features of /m/ since the exact specification of this segment is unimportant here.

References

Anderson, S. A., and Browne, W. 1973. On keeping exchange rules in Czech. Papers in Linguistics 6: 445-482.
Buckley, E. 1994. Prespecification of default features: the two /i/’s of Kashaya. In Proceedings of the 24th Annual Meeting of NELS, pages 17-30.
Dresher, B. E. 2009. The Contrastive Hierarchy in Phonology. Cambridge University Press.
Franz, D. G. 2017. Blackfoot Grammar. 3rd edition. University of Toronto Press.
Gorman, K., and Reiss, C. 2024. Metaphony in Substance Free Logical Phonology. Submitted. URL: https://lingbuzz.net/lingbuzz/008634.
Inkelas, S. and Cho, Y.-M. 1993. Inalterability as prespecification. Language 69: 529-574.
Kaplan, L. D. 1981. Phonological Issues in North Alaskan Inupiaq. Alaska Native Language Center.
Siptár, P., and Törkenczy, M. 2000. The Phonology of Hungarian. Oxford University Press.

Optimality Theory on exceptionality

[This post is part of a series on theories of lexical exceptionality.]

I now take a large jump in exceptionality theory from the late ’70s to the mid-aughts. (I am skipping over a characteristically ’90s approach, which I’ll cover in my final post.) I will focus on a particular approach—not the only one, but arguably the most robust one—to exceptionality in Optimality Theory (OT). This proposal is as old as OT itself, but is developed most clearly in Pater (2006), and Pater & Coetzee (2005) propose how it might be learned. I will also briefly discuss the application of the morpheme-specific approach to the yer patterns characteristic of Slavic languages by Gouskova and Rysling, and Rubach’s critique of this approach.

I will have very little to say about the cophonology approach to exceptionality that has also been sporadically entertained in OT. Cophonology holds that morphemes may have arbitrarily different constraint rankings. Pater (henceforth P) is quite critical of this approach throughout his 2006 paper: among other criticisms, he regards it as completely unconstrained. I agree: it makes few novel predictions, and would challenge cophonologists (if any exist in 2024) to consider how cophonology might be constrained so as to derive interesting predictions about exceptionality.

Indexed constraints

Even the earliest work in Optimality Theory supposed that some constraints might be specific to particularly grammatical categories or morphemes. This of course is a loosening of the idea that Con, the universal constraint family, is language-universal and finite, but it seems to be a necessary assumption. P claims that this device is powerful enough to handle all known instances of exceptionality in phonology. The basic idea is extremely simple: for every constraint X there may also exist indexed constraints of the form X(i) whose violations are only recorded when the violation occurs in the context of some morpheme i.¹ There are then two general schemas that produce interesting results.

(1) M(i) >> F >> M
(2) F(i) >> M >> F

Here M stands for markedness and F for faithfulness. As will be seen below, (1) has a close connection to the notions of mutability and catalysis introduced in my earlier post; (2) in turn has a close connection with quiescence and inalterability.

One of P’s goals is to demonstrate that this approach can be applied to Piro syncope. His proposal is not quite as detailed as one might wish, but it is still worth discussing and trying to fill in the gaps. For P, the general syncope pattern arises from the ranking Align-Suf-C >> Max; in prose, it is permissible to delete a segment if doing so brings the suffix in contact with a consonant. This also naturally derives the non-derived environment condition since it specifically mentions suffixhood. P derives the avoidance of tautomorphemic clusters, previously expressed with the VC_CV environment, with the markedness constraint *CCC. This gives us *CCC >> Align-Suf-C >> Max thus far. This should suffice for derivations whose roots are all mutable and catalytic.

For P, inalterable roots are distinguished from mutable ones by an undominated, indexed clone of Max which I’ll call Max(inalt), giving us a partial ranking like so.

(3) Max(inalt) >> Align-Suf-C >> Max

This is of course an instance of schema (2). Note that since the ranking without the indexing is just Align-Suf-C >> Max, it seemingly treats mutability as the default and inalterability as exceptional, a point I’ll return to shortly.

Quiescent roots in P’s analysis are distinguished from catalytic ones by a lexically specific clone of Align-Suf-C; here the lexically indexed one targets the catalytic suffixes, so we’ll write it Align-Suf-C(cat), giving us the following partial ranking.

(4) Align-Suf-C(cat) >> Max >> Align-Suf-C

This is an instance of schema (1). It is interesting to note that the Align constraint bridges the distinction between target and trigger, since the markedness is a property of the boundary itself. Note also that it also seems to treat quiescence as the default and catalysis as exceptional.

Putting this together we obtain the full ranking below.

(5) *CCC, Max(inalt) >> Align-Suf-C(cat) >> Max >> Align-Suf-C

P, unfortunately, does not take the time to compare his analysis to Kisseberth’s (1970) proposal, or to contrast it with Zonneveld’s (1978) critiques, which I discussed in detail in the earlier post. I do observe one potential improvement over Kisseberth. Recall that Kisseberth had trouble with the example /w-čokoruha-ha-nu-lu/ [wčokoruhahanru] ‘let’s harpoon it’ , because /-ha/ is quiescent and having a quiescent suffix in the left environment is predicted counterfactually to block deletion in /-nu/. As far as I can tell this is not a problem for P; the following suffix /-lu/ is on the Align-Suf-C(cat) lexical list and /-nu/ is not on the Max(inalt) list and that’s all that matters. Presumably, P gets this effect because of the joint operation of the two flavors of Align-Suf-C and *CCC means has properly localized the catalysis/quiescence component of the exceptionality. However, P’s analysis does not seem to generate the right-to-left application; it has no reason to favor the attested /n-xipa-lu-ne/ [nxipalne] ‘my sweet potato’ over *[nxiplune]. This reflects a general issue in OT in accounting for directional application.

As I mentioned above, P’s analysis of Piro treats mutability and quiescence as productive and inalterability and catalysis as exceptional. Indeed, it predicts mutability and quiescence in the absence of any indexing, and one might hypothesize that Piro speakers would treat a new suffix of the appropriate shape as mutable and quiescent. I know of no reason to suppose this is correct; for Matteson (1965), these are arbitrary and there is no obvious default, whereas my impression is that Kisseberth views mutability (like P) and catalysis (unlike P) as the default. This question of productivity is one that I’ll return to below as I consider how indexing might be learned.

Learning indexed constraints

Pater and Coetzee (2005, henceforth P&C) propose indexed constraint rankings can be learned using a variant of the Biased Constraint Demotion (BCD) algorithm developed earlier by Prince and Tesar (2004). Most of the details of that algorithm are not strictly relevant here; I will focus on the ones that are. BCD supposes that learners are able to accumulate UR/SR pairs and then use the current state of their constraint hierarchy to record them as a data structure called a called mark-data pair. These give, for each constraint violation, whether that violation prefers the actual SR or a non-optimal candidate. From a collection of these pairs it is possible to rank constraints via iterative demotion.² The presence of lexical exceptionality produces a case where it is not possible to for vanilla BCD to advance the demotion because a conflict exists: some morphemes favor one ranking whereas others favor another. P&C propose that in this scenario, indexed constraints will be introduced to resolve the conflict.

P&C are less than formal in specifying how this cloning process works, so let us consider how it might function. Their example, a toy, concerns syllable shape. They suppose that they are dealing with a language in which /CVC/ is marked (via NoCoda) but there are a few words of this shape which surface faithfully (via Max). They suppose that this results in a ranking paradox which cannot be resolved with the existing constraints. As stated, I have to disagree: their toy provides no motivation for NoCoda >> Max.³ Let us suppose, though, for sake of argument that there is some positive evidence, after all, for that ranking. Perhaps we have the following.

(6) Toy grammar (after P&C):
a. /kap/ – > [ka]
b. /gub/ -> [gu]
c. /net/ -> [net]
d. /mat/ -> [mat]

Let us also suppose that there is some positive evidence that /kap, gub/ are the correct URs so they are not changed to faithful URs via Lexicon Optimization. Then, (6ab) favor NoCoda >> Max but (6cd) favor Max >> NoCoda. P&C suppose this is resolved by cloning (i.e., generating an indexed variant of) Max, producing a variant for each faithfully-surfacing /CVC/ morpheme. If these morphemes are /net/ and /mat/, then we obtain the following partial ranking after BCD.

(7) Max(net), Max(mat) >> NoCoda >> Max

This is another instance of schema (2); there are just multiple indexed constraints in the highest stratum. Indeed, P&C imagine various mechanisms by which Max(net) and Max(mat) might be collapsed or conflated at a later stage of learning.

It is crucial to the P&C’s proposal that the child actual observes the exceptional morphemes both of (6cd) surfacing faithfully; however, it is not necessary to observe (6ab), just to observe some morphemes in which, like in (6ab), a coda consonant is deleted so as to trigger cloning. The critical sample for (7), then, is either (6acd) or (6bcd). It is not necessary to see both (6a) and (6b), but it is necessary to see both of (6cd). Thus, there is some very real sense in which this analysis treats coda deletion as the productive default and coda retention as exceptional behavior, much like how P’s analysis of Piro treated mutability and quiescence as productive. However, it seems like P&C could have instead adapted schema (1) and proposed that what is cloned is NoCoda, obtaining the following ranking.

(8) NoCoda(kap), NoCoda(gub) >> Max >> NoCoda

Then, for this analysis, the crucial sample is either (6abc) or (6abd), and there is a similar sense in which coda retention is now the default behavior.

P&C give no reason to prefer (7) over (8). Reading between the lines, I suspect they imagine that the relative frequency (i.e., number of morpheme types) which either retain or lose their coda is the crucial issue, and perhaps they would appeal to an informal “majority-rules” principle. That is, if forms like (6ab) are more frequent than those like (6cd) they would probably prefer (7) and would prefer (8) if the opposite is true. However, I think P&C should have taken up this question and explained what is cloned when. Indeed, there is an alternative possibility: perhaps cloning produces all of the following constraints in addition to Max and NoCoda.

(9) Max(kap), Max(gub), Max(net), Max(mat), NoCoda(kap), NoCoda(gub), NoCoda(net), NoCoda(mat)

While I am not sure, I think BCD would be able to proceed and would either converge on (7) or (8), depending on how it resolves apparent “ties”.

Another related issue, which also may lead to the proliferation of indexed constraints, is that P&C have little to say about how constraint cloning words in complex words. Perhaps the cloning module is able to localize the violation to particular morphemes. For instance, it seems plausible that one could inspect a Max violation, like the ones produced by Piro syncope, to determine which morpheme is unfaithful and thus mutable. However, if we wish to preserve P’s treatment of mutability as the default (and that inalterable morphemes have a high-ranked Max clone), we instead need to do something more complex: we need to determine that a certain morpheme does not violate Max (good so far), but also that under a counterfactual ranking of this constraint and its “antagonist” Align-Suf-C, would have done so; this may be something which can be read off of mark-data pairs, but I am not sure. Similarly, to preserve P’s treatment of quiescence as the default, we need to determine that a certain suffix has an Align-Suf-C violation (again, good so far), but also that under a counterfactual ranking of this constraint and its antagonist, it would have not done so.

While I am unsure if this counterfactual reasoning part of the equation can be done in general, I can think of least one case where the localization reasoning cannot be done: epenthesis at morpheme boundaries, as in the [-əd] allomorph of the English regular past. Here there is no sense in which the Dep violation can be identified to a particular morpheme. Indeed, Dep violations are defined by the absence of correspondence. This is a perhaps an unfortunate example for P&C’s approach. English has a number of few “semiweak” past tense forms (e.g. from Myers 1987: bit, bled, hid, met, sped, led, read, fed, lit, slid) which are characterized by a final dental consonant and shortening of the long nucleus of the present tense form. Given related pairs like keep-kept, one might suppose that these bear a regular /-d/ suffix, but fail to trigger epenthesis (thus *[baɪtəd], etc.). To make this work, we assume the following.

(10) Properties of semiweak pasts:
a. Verbs with semiweak pasts are exceptionally indexed to a high-ranking Dep constraint which dominates relevant syllable structure markedness constraints.
b. Verbs with semiweak pasts are exceptionally indexed to high-ranking markedness constraint(s) triggering “Shortening” (in the sense of Myers 1987)
c. A general (i.e., non-indexed) markedness constraint against hetero-voiced obstruent clusters dominates antagonistic voice faithfulness constraints.

The issue is this: how do children localize the failure of epenthesis in (10a) to the root and not the suffix, given that the counterfactual epenthetic segment is not an exponent of either, occurring rather at the boundary between the two? Should one reject the sketchy analysis given in (10), there are surely many other cases where correspondence alone is insufficient; for example, consider vowels which coalesce in hiatus.

The yers

I have again already gone on quite long, but before I stop I should briefly discuss the famous Slavic yers as they relate to this theory.

In a very interesting paper, Gouskova (2012) presents an analysis of the yers in modern Russian. In Russian, certain instances of the vowels e and o alternate with zero in certain contexts. These alternating vowels are termed yers in traditional Slavic grammar. A tradition, going back to early work by Lightner (1965), treats yers in Russian and other Slavic languages, as underlyingly distinct from non-alternating e and o, either featurally or, in later work, prosodically. For example, лев [lʲev] ‘lion’ has a genitive singular (gen.sg.) льва [lʲva] and мох [mox] ‘moss’ has a gen.sg. [mxa].

Gouskova (henceforth G) wishes to argue that yer patterns are better analyzed using indexed constraints, thus treating morphemes with yer alternations as exceptional rather than treating the yer segments as underspecified. In terms of the constraint indexing technology, G’s analysis is straightforward. Alternating vowels are underlyingly present in all cases, and their deletion is triggered by a high-ranked constraint *Mid (which disfavors mid vowels, naturally) which is indexed to target exactly those morphemes which contain yers. Additional phonotactic constraints relating to consonant sequences are used to prevent deletion that produces word-final consonant clusters. Roughly, then, the analysis is:

(11) *CC]_σ >> *Mid(yer morphemes) >> Max-V >> *Mid

As G writes (99-100, fn. 18): “In Russian, deletion is the exception rather than the rule: most morphemes do not have deletion, and neither do loanwords…”

It should be noted that G’s analysis departs from the traditional (“Lightnerian”) analysis in ways not directly to the question of localizing exceptionality (i.e., in the morpheme vs. the segment). For one, (11) seems to frame retention of a mid vowel as a default. In contrast, the traditional analysis does not seem to have any opinion on the matter. In that analysis, whether or not a mid vowel is alternating is a property of its underlying form, and should thus be arbitrary in the Saussurean sense. This is not to say that we expect to find yers in arbitrary contexts. There are historical reasons why yers are mostly found in the final syllable—this is the one of the few places where the historical sound change called Havlík’s Law, operating more or less blindly, could introduce synchronic yer/zero alternations in the first place (in many other contexts the yers were simply lost), and in other positions it is impossible to ascertain whether or not a mid vowel is a yer. Whether or not an alternative versions of the sound change could have produced an alternative-universe Russian where yers target the first syllable is an unknowable counterfactual given that we live in our universe, with our universe’s version of Havlík’s Law. Secondly, the traditional analysis (see Bailyn & Nevins 2008 for a recent exemplar) usually conditions the retention of yers on the presence of a yer (which may or may not be itself retained) in the following syllable. In contrast, G does not seem to posit yers for this purpose nor does she condition their retention on the presence of nearby yers. In the traditional analysis, these conditioning yers are motivated by the behavior of yers in prefixes and suffixes in derivational morphology, and much of this hinges on apparent cyclicity. G provides an appendix in which she attempts to handle several of these issues in her theory, but it remains to be seen whether this has been successful in dismissing all the concerns one might raise.

G provides a few arguments as to why the exceptional morpheme analysis is superior to the traditional analysis. G wishes to establish that mid vowels are in fact marked in Russian, so that yer deletion can take a something of a “free ride” on this constraint. As such, she claims that yer deletion is related to the reduction of mid vowels in unstressed syllables. But how do we know that these these facts are connected? And, if they are in fact connected, is it possible that there is an extra-grammatical explanation? For instance, there may be a “channel bias” in production and/or perception that disfavors faithful realization of mid vowels (and thus imposes a systematic bias in favor of reduction and deletion) compared to the more extreme phonemic vowels (in her analysis, /a, i, u/) which caused the actuation of both changes. Phenomenologically speaking, it is true that there are two ways in which certain Russian mid vowels are unfaithful, but this is just one of a infinite set of true statements about Russian phonology, and there is something “just so” about this one.

Before I conclude, let us now turn briefly to Polish. Like Russian, this language has mid-vowels which alternate with zero in certain contexts. (Unlike Russian, for whatever reason, the vast majority of alternating vowels are e; there are just three morphemes which have an alternating o.)

Rubach (2013, 2016), explicitly critiques constraint indexation using data from Polish. Rubach argues that G’s analysis cannot be generalized straightforwardly to Russian. He draws attention to stems that contain multiple mid vowels, only one of which is a yer (e.g., sfeter/sfetri ‘sweater’); and concludes that it is not necessarily possible to determine which (or both) should undergo deletion in an “exceptional” morpheme. The only mechanism with which one might handle this is a rather complex series of markedness constraints on consonant sequences. Unfortunately, Polish is quite permissive of complex consonant clusters and this mechanism cannot always be relied upon to deliver the correct answer. He also draws attention to the behavior of derivational morphology such as double diminuitives. In contrast, Rysling (2016) attempts to generalize G’s indexed constraint analysis of yers to Polish. However, her analysis differs from G’s analysis of Russian in that she derives the yers from epenthesis to avoid word-final consonant clusters. Furthermore, for Rysling, epenthesis, in the relevant phonotactic contexts (to a first approximation, certain C_C#), is the default, and failure to epenthesize is exceptional.⁵ Sadly, there is little interaction between the Rubach and Rysling papers (the latter briefly discusses the former’s 2013 paper), so I am not prepared to say whether Rysling’s radical revision addresses Rubach’s concerns with constraint indexation.

References

P and colleagues refer to these constraints as “lexically specific”, but in fact it seems the relevant structures are all morphemes, and never involve polymorphemic words or lexemes.
As far as I know, though, there is no proof of convergence, under any circumstances, for BCD.
Perhaps they are deriving this from the assumption that the initial state is M >> F, but without alternation evidence, BCD would rerank this as Max >> NoCoda and cloning would not be triggered.
A subsequent rule of obstruent voice assimilation, which is needed independently would give us [kɛpt] from /kip-d/, and so on.
Rysling seems to derive this proposal from an analysis of lexical statistics: she counts how many Polish nouns have yer alternations in the context …C_C# and compares this to non-alternating …CeC# and …CC#. It isn’t clear to me how the proposal follows from the statistics, though: non-epenthesis and epenthesis in …C_C# are about equally common in Polish, and their relative frequencies are not much different from what she finds in Russian.

References

Bailyn, J. F. and Nevins, A. 2008. Russian genitive plurals are impostors. In A. Bachrach and A. Nevins (ed.), Inflectional Identity, pages 237-270. Oxford University Press.
Gouskova, Maria. 2012. Unexceptional segments. Natural Language & Linguistic Theory 30: 79-133.
Kenstowicz, M. 1970. Lithuanian third person future. In J. R. Sadock and A. L. Vanek (ed.), Studies Presented to Robert B. Lees by His Students, pages 95-108. Linguistic Research.
Lightner, T. Segmental phonoloy of Modern Standard Russian. Doctoral dissertation, Massachusetts Institute of Technology.
Matteson, E. 1965. The Piro (Arawakan) Language. University of California Press.
Myers, S. 1987. Vowel shortening in English. Natural Language & Linguistic Theory 5(4): 485-518.
Pater, J. and Coetzee, A. W. 2005. Lexically specific constraints: gradience, learnability, and perception. In Proceedings of the Korea International Conference on Phonology, pages 85-119.
Pater, J. 2006. The locus of exceptionality: morpheme-specific phonology as constraint indexation. In University of Massachusetts Occasional Papers 32: Papers in Optimality Theory: 1-36.
Pater. J. 2009. Morpheme-specific phonology: constraint indexation and inconsistency resolution. In S. Parker (ed.), Phonological Argumentation: Essays on Evidence and Motivation, pages 123-154. Equinox.
Prince, A. and Tesar, B. 2004. Learning phonotactic distributions. In Kager, R., Pater, J. and Zonneveld, W. (ed.), Constraints in Phonological Acquisition, pages 245-291. Cambridge University Press.
Rubach, J. 2013. Exceptional segments in Polish. Natural Language & Linguistic Theory 31: 1139-1162.
Rysling, A. 2016. Polish yers revisited. Catalan Journal of Linguistics 15:121-143.
Zonneveld, W. 1978. A Formal Theory of Exceptions in Generative Phonology. Peter de Ridder.

Kisseberth & Zonneveld on exceptionality

[This post is part of a series on theories of lexical exceptionality.]

In a paper entitled simply “The treatment of exceptions”, Kisseberth (1970) proposes an interesting revision to the theory of exceptionality. Many readers may be familiar with the summary of this work given by Kenstowicz & Kisseberth 1977:§2.3 (henceforth K&K). Others may know it from the critique by Zonneveld (1978: ch. 3) or Zonneveld’s (1979) review of K&K’s book. I will discuss all of these in this post.

Kisseberth (1970)

A quick sidebar: Kisseberth’s paper is a fascinating scholarly artifact in that it probably could not be published in its current form today. (To be fair it was published in an otherwise-obscure journal, Papers in Linguistics.) For one, all the data is drawn from Matteson’s (1965) grammar of Piro;¹ the only other referenced work is SPE. Kisseberth (henceforth K) gives no page, section, or example numbers for the forms he cites. I have tried to track down some of the examples in Matteson’s book, and it is extremely difficult to find them. K gives no derivations, only a few URs, and the entire study hinges around a single rule. But it’s provocative stuff all the same.

K observes that Piro has a rule which syncopates certain stem-final vowels. He gives the following formulation:

(1) Vowel Drop:  V -> ∅ / VC __ + CV 

For example, [kama] ‘to make, form’ has a nominalization [kamlu] ‘handicraft’ with nominalizing suffix /-lu/, and [xipalu] ‘sweet potato’ has a possessed form /n-xipa-lu-ne/ [nxipalne] ‘my sweet potato’.² One might think that (1) is intended to be applied simultaneously, as this is the convention for rule application in SPE, but this would predict *[nxiplne], with a medial triconsonantal cluster. Left-to-right application gives *[nxiplune]; the only way to get the observed [nxipalne] is via right-to-left (RTL) application, which I’ll assume henceforth. As far as I know, the directionality issue has not been noticed in prior work.

Of course, there are exceptions of several types. (I am drawing additional data from the unpublished paper by CUNY graduate student Héctor González, henceforth G. I will not make any attempt to make González’s transcriptions or glosses comparable to those used by K, but doing so should be straightforward.)

One type is exemplified by /nama/ ‘mouth of’, which does not undergo Vowel Drop, as in /hi-nama-ya/ [hinamaya] ‘3sgmpssr-mouth.of-Obl.’ (G 5a); under RTL application we would expect *[hinmaya]. This is handled easily in the SPE exceptionality theory I reviewed a few weeks ago by marking /nama/ as [-Vowel Drop].

However, other apparent instances of exceptionality are not so easily handled. Consider two forms involving the verb root /nika/ ‘eat’. In /n-nika-nanɨ-m-ta/ [hnikananɨmta] ‘1sg-eat-Extns-Nondur-Vcl’ (G 5b) both vowels of the root satisfy (1) but do not undergo syncope. One might be tempted to mark this root as [-Vowel Drop], but it does undergo deletion in other derivations, such as in /nika-ya-pi/ [nikyapi] ‘eat-Appl-Instr.Nom’ (G 4d). Rather, it seems to be that the following /-nanɨ/ fails to trigger deletion. This is not easily handled in the SPE approach. K gives a number of similar examples involving the verbal theme suffixes /-ta/ and /-wa/, which also do not trigger syncope. If a morphemes vary in whether or not they undergo and whether or not they trigger Vowel Drop, one can imagine that these properties might cross-classify:

Mutable, catalytic The nominalizing suffix /-lu/, discussed above, is both mutable (i.e., undergoes syncope), and catalytic (triggers syncope) in /n-xipa-lu-ne/ [nxipalne].
Inalterable, catalytic I have not found any relevant examples in Piro; Kenstowicz & Kisseberth 1977 (118f.) present a hastily-described example from Slovak.
Mutable, quiescent /meyi-wa-lu/ [meyiwlu] ‘celebration’ shows that intranstive verb theme suffix /-wa/ is mutable but quiescent (does not trigger syncope; *[meywalu]).
Inalternable, quiescent /yimaka-le-ta-ni-wa-yi/ shows that the imperfective suffix /-wa/ (not to be confused with the homophonous intransitive /-wa/) is inalterable; /r-hina-wa/ [rɨnawa] ‘3-come-Impfv’ (G 6c) shows that it is quiescent (*[rɨnwa]).

According to G, there is also one additional category that does not fit into the above taxonomy: the elative suffix /-pa/ triggers deletion of the penultimate (rather than preceding) vowel, as in /r-hitaka-pa-nɨ-lo/ [rɨtkapanro] ‘3-put-Evl-Antic-3sgf’ (G 7a). Furthermore, /-pa/ appears to lose its catalytic property when it undergoes syncope, as in /cinanɨ-pa-yi/ [cinanɨpyi] ‘full-Elv-2sg’ (G 7c). Given the rather unexpected set of behaviors here, apparently confined to a single suffix, I wonder if this is the full story.

Having reviewed this data, I don’t have an abundance of confidence in it, particularly given K’s hasty presentation. However, K has identified something not obviously anticipated by the SPE theory. K’s proposal is a simple extension of the SPE theory; in addition to rule features for the target, we also need rule features for the context. For instance, inalterable, quiescent imperfective marker /-wa/,⁴ which neither undergoes nor triggers Vowel Drop, would be underlyingly [-rule Vowel Drop, –env. Vowel Drop]. Then, the rule interpretative procedure applies a rule R when its structural description is met, when the target is [+rule R], and when the all morphemes in the context are [+env. R].

Zonneveld (1978, 1979)

I have already gone on pretty long, but I should briefly discuss what subsequent writers have had to say about this proposal. Kenstowicz & Kisseberth (1977, henceforth K&K), perhaps unsurprisingly endorse the proposal, and provide some very hasty examples of how one might use this new mechanism. Zonneveld (henceforth Z), in turn, is quite critical of K’s theory. These criticisms are laid out in chapter 3 of Zonneveld 1978 (a published version of his doctoral dissertation), which reviews quite a bit of contemporary work dealing with this issue. The 1978 book chapter (about 120 typewritten pages in all) is a really good review; it is well organized and written, and full of useful quotations from the sources it reviews, and while it is somewhat dense it is hard to imagine how it could be made less so. Z reprises the criticisms of K’s theory briefly, and near verbatim, in his uncommonly-detailed review of K&K’s book (Zonneveld 1979). Z has several major criticisms of rule environment theory.

First, he draws attention to an example on where the conventions proposed by K will fail; I will spell this out in a bit more detail than Z does. The key example is /w-čokoruha-ha-nu-lu/ [wčokoruhahanru] ‘let’s harpoon it’. The anticipatory /-nu/ is mutable (but quiescent) and it is in the phonological context for syncope. To its left is the ‘sinister hortatory’ /-ha/, and this is known to be quiescent because it does not trigger deletion of the final vowel in /čokoruha/; cf. /čokoruha-kaka/ [čokoruhkaka] ‘to cause to harpoon’, which shows that the substring /…čokoruha-ha…/ does not undergo deletion because /-ha/ is quiescent rather than because /čokoruha/ is inalterable. To its right is the catalytic /-lu/. By K’s conventions, syncope should not apply to the /u/ in the anticipatory morpheme because /-ha/, in the left context, is [-env. Vowel Drop], but in fact it does. Z anticipates that one might want to introduce separate left and right context environment features: maybe /-ha/ is [+left env. Vowel Drop, -right env. Vowel Drop]. The following additional issues suggest the very idea is on the wrong track, though.

Seccondly, Z shows that rule environment features cause additional issues if one adopts the SPE conventions. The /-ta/ in /yona-ta-nawa/ [yonatnawa] ‘to paint oneself’ is presumably quiescent because it fails to trigger syncope in /yona/.⁵ Thus we would expect it to be lexically [-env. Vowel Drop], and for this specification to percolate to the segments /t/ and /a/. (I referred to this as Convention 2 in my previous post, and K adopts this convention.) However, it is a problem for this specification to be present on /t/, since that /t/ is itself in the left context for Vowel Drop, and this would counterfactually block its application to the second /a/! This is schematized below.

(2) Structural description matching for /yona-ta-nawa/

   VCVCV
yonatanawa

As a related point, Z points out that there many cases where under K’s proposal it is arbitrary whether one uses rule or environmental exception features. For instance, in the famous example obesity, the root-final s is part of the structural context so the root could be marked [-rule Trisyllabic Shortening], which would percolate to the focus e, or it could be marked [-env. Trisyllabic Shortening], which would percolate to the right-context s, or both; all three options would derive non-application. This is also schematized below.

(3) Structural description matching for obesity:

  VC-VCV
obes-ity

Z continues to argue that a theory that distinguishes between leftward and rightward contextual exceptionality also will not go through. Sadly, he does not provide a full analysis of the Piro facts in his preferred theory.

Z has a much more to say about the (then-)contemporary literature on rule exceptionality. For example, he discusses an idea, originally proposed by Harms (1968:119f.) and also exemplified by Kenstowicz (1970), that there are exceptions such that a rule applies to morphemes that do not meet its (phonologically defined) structural description. While he does seem to accept this, possible examples for such rules is quite thin on the ground, and the very idea seems to reflect the mania for minimizing rule descriptions and counting features that—and this is not just my opinion—polluted early generative phonology. If one rejects this frame, it is obvious that the effect desired can be simulated with two rules, applied in any order. The first will be a phonologically general one (with or without negative exceptions); the second will be the same change but targeting certain morphemes using whatever theory of exceptionality one prefers. Indeed, most examples of rules applying where their structural description is not met are already disjunctive, and I doubt whether such rules are really a single rule in the first place.

The ultimate theory Z settles on is one quite similar to that proposed by SPE. First, readjustment rules introduce rule features like [+R] and these handle simple exceptions of the obesity type. Z proposes further that such readjustment rules must be context-free, which clearly rules out using this mechanism for phonologically defined classes of negative exceptions; cf. (4-5) in my previous post. Secondly, Z proposes that so-called morphological features like Lightner’s [±Russian] will be used for deriving what we might now call “stratal” effects: morphemes that are exceptions to multiple rules. For instance, if we have three rules A, B, C that all [-Russian] morphemes are exceptions to, then context-free redundancy rules will introduce the following rule features.

(4)
[-Russian] -> {-A}
[-Russian] -> {-B}
[-Russian] -> {-C}

Z replays several arguments from Lightner about why morphological of this sort should be distinguished from rule features; I won’t repeat again them here. Finally, Z derives minor rules via readjustment rules triggered by so-called “alphabet” features. For instance, let us again consider umlauting English plurals like goose–geese. Z supposes, adding some detail to a sketchier portion of the SPE proposal, that morphemes targeted by umlaut are marked [+G] (where G is some arbitrary feature). There are two ways one could imagine doing this.

First, either the underlying form, /guːs/ perhaps, could be underlyingly [+G]. Then, let us assume that umlauting is simply fronting in the context of a Plural morphosyntactic feature, and that subsequent phonological adjustments (like the diphthongization in mouse–mice) are handled by later rules. Then it is possible to write this as follows:

(5) Umlaut (variant 1): [+Back, +G] -> {-Back} / __ [+Plural]

This rule is phonologically “context-free”, but its application is conditioned by the presence of the alphabet feature specification in the focus and the morphosyntactic feature in the context. I will take up the question of whether such rules are always phonologically context-free in a (much) later post.

I suspect that the analysis in (5) is the one Z has in mind, and it is also seems to be the orthodoxy in Distributed Morphology (henceforth DM); see, e.g., Embick & Marantz 2008 and particularly their (4) for a conceptually similar analysis of the English past tense. Applying their approach strictly would lead us to miss the generalization (if it is in fact a linguistically meaningful generalization) that umlauting plurals all have a null plural suffix. Umlauting plurals have an underlying feature [+G] (there is no “list” per se; it is just), but their rules of exponence also need to “list” these umlauting morphemes as exceptionally selecting the null plural rather than the regular /-z/. It seems to me this is not necessary because the rules of exponence for the plural maybe could be sensitive to the presence or absence of [+G]. This would greatly reduce the amount of “listing” necesssary. (I do not have an analysis of—and thus put aside—the other class of zero plurals in English, mass nouns like corn.)

(6) Rules of exponence for English noun plural (variant 1):

a. [+Plural] <=> ∅ / __ [+G]
b. <=> -ɹɘn / __ {√CHILD, …}
c. <=> -ɘn / __ {√OX, …}
d. <=> -z / __

Secondly and more elaborately, one could imagine that [+G] is inserted by—i.e., and perhaps, is the expression of—plurality for umlauting morphemes. In piece-based realizational theories like DM, affixes are said to expone (and thus delete) syntactic uninterpretable features. One possibility (which brings this closer to amorphous theories without completely discarding the idea of morphs) is to treat insertion of [+G] as an exponent of plurality.

(7) Rules of exponence for English noun plural (variant 2):

a. [+Plural] <=> {+G} / __ {√GOOSE, √FOOT, √MOUSE, …}
b. <=> -ɹɘn / __ {√CHILD}
c. <=> -ɘn / __ {√OX}
d. <=> -z / __

(7a) and (7b-d) implicate different types of computations—the former inserts an alphabet feature, the latter inserts vocabulary items—but I am supposing here that they can be put into competition. Under this alternative analysis, umlaut no longer requires a morphosyntactic context:

(8) Umlaut (variant 2): [+Back, +G] -> {-Back}

Beyond precedent, I do not see any reason to prefer analysis (5-6) over (7-8). Either can clearly derive what Lakoff called minor rules, though they differ in how exceptionality information is stored/propagated, and thus may have interesting consequences for how we relate the major/minor class distinction to theories of productivity. I have written enough for now, however, and I’ll have to return to that question and others another day.

Endnotes

I too will refer to this language as Piro, as do Matteson and Kisseberth. It should not be confused with unrelated language known as Piro Pueblo. Some subsequent work on this phenomenon refer to the language as Yine (and say it “was previously known as Piro”), though I also found another source that says that Yine is simply a major variety of Piro. I have been unable to figure out whether there’s a preferred endonym.
I am not prepared to rule out the possibility that /xipa/ is itself an exception (“inalterable”), but all evidence is consistent with RTL application.
In his endnote 2, K says the rule is even narrower than stated above, since it does not apply to monosyllabic roots. However, he might have failed to note that this condition is implicit in his rule, if we interpret (11) strictly as holding that the left context should be tautomorphemic. Piro requires syllables to be consonant-initial, so the minimal bisyllabic roots is CV.CV. Combining this observation with (1), we see that the shortest root which can undergo vowel deletion is also bisyllabic, since concatenating the left context and target gives us a bisyllabic VCV substring. In fact, things are more complicated because monosyllabic suffixes do undergo syncope; many examples are provided above. Clearly, the deleting vowel need not be tautomorphemic with the preceding vowel, contrary to what a strict reading of the “+” in (1) would seem to imply. According to González, syncope imposes no constraints on the morphological structure of its context except that it only applies in derived environments—CVCVCV trisyllables like /kanawa/ ‘canoe’ surfaces faithfully as [kanawa], not *[kanwa]—and is subject to lexical exceptionality discussed here.
K glosses this as ‘still, yet’.
As was the case with /xipa/ in endnote 2, we’d like to confirm that /yona/ is mutable rather than inalterable, but one does not simply walk into Matteson 1965.

References

Embick, D. and Marantz, A. 2008. Architecture and blocking. Linguistic Inquiry 39(1): 1-53.
González, H. 2023. An evolutionary account of vowel syncope in Yine. Ms., CUNY Graduate Center.
Harms, R. T. 1968. Introduction to Phonological Theory. Prentice-Hall.
Kenstowicz, M. 1970. Lithuanian third person future. In J. R. Sadock and A. L. Vanek (ed.), Studies Presented to Robert B. Lees by His Students, pages 95-108. Linguistic Research.
Kenstowicz, M. and Kisseberth, C. W. 1977. Topics in Phonological Theory. Academic Press.
Kisseberth, C. W. 1970. The treatment of exceptions. Papers in Linguistics 2: 44-58.
Matteson, E. 1965. The Piro (Arawakan) Language. University of California Press.
Zonneveld, W. 1978. A Formal Theory of Exceptions in Generative Phonology. Peter de Ridder.
Zonneveld, W. 1979. On the failure of hasty phonology: A review of Michael Kenstowicz and Charles Kisseberth, Topics in Phonological Theory. Lingua 47: 209-255.

Underspecification in Barrow Inupiaq

Dresher (2009:§7.2.1) discusses an interesting morphophonological puzzle from the Inuit (Canadian) and Inupiaq (Alaskan) dialects of Eskimo-Aleut. These dialects descend from a four-phoneme vowel system *i, *u, *ə, *a, but in most dialects *ə has merged into *i, yielding three surface vowels: [i, a, u]. However, in some dialects (including Barrow Inupiaq), there appears to be a covert contrast between two “flavors” of i: “strong i” triggers palatalization of a following coronal consonant whereas “weak i” does not.

(1) Barrow Inupiaq (Kaplan 1981:§3.22, his 27-29):

a. iglu ‘house’, iglulu ‘and a house’, iglunik ‘houses’
b. ini ‘place’, inilu ‘and a place’, ininik ‘places’
c. iki ‘wound’, ikiʎu ‘and a wound’, ikiɲik ‘wounds’

Presumably, the stem-final i in (1b) is weak and the one in (1c) is strong.

Following some prior work, Dresher supposes that there is an underlying contrast between weak and strong i. He posits the following featural specification:¹

(2) Features for Barrow Inupiaq vowels (to be revised):

strong i: [Coronal, -Low]
weak i: [-Low]
/u/: [Labial, -Low]
/a/: [+Low]

This analysis has a close relationship to the theory of underspecification used in Logical Substance-Free Phonology (henceforth, LP); I assume familiarity with the assumptions and operations of that theory, which have been discussed at length by Reiss and colleagues (Bale et al. 2020, Reiss 2021), including in an introductory textbook (Bale & Reiss 2018). Just a few modifications are needed, however.

For Dresher, who hypothesizes that non-contrastive features (computed using an algorithm he describes in detail; op. cit.:16) are the only ones which are phonologically active, it is not clear why strong i palatalizes coronal segments: clearly it is not that they are spreading the privative [Coronal], since that certainly would not trigger palatalization! One could, of course, adopt an analysis in which palatalization is not assimilatory. Alternatively, we could identify another feature specification which is characteristic of i and which might trigger palatalization of coronal consonants. Let us suppose this is in fact just [Palatal].² This gives us the following minimally-modified feature specification:

(2) Features for Barrow Inupiaq vowels (revised):

strong i: [Palatal, -Low]
weak i: [-Low]
/u/: [Labial, -Low]
/a/: [+Low]

According to Kaplan (§1.2), there are both plain and palatal coronal phonemes, so this seems to be a feature-changing process. Following the assumptions of LP that feature-changing processes derive from a deletion rule followed by an insertion rule, two rules are needed here; we give these below.

(3) [+Consonantal] {Coronal} / [Palatal] __
(4) [+Consonantal] ⊔ {Palatal} / [Palatal] __

Crucially, vowels other than strong i lack the [Palatal] specification to trigger (3-4).

Endnotes

Dresher’s analysis assumes privative features, but he notes elsewhere in the book that he usually adopts the features of his sources unless there is some relevant reason to dispute them.
If preferred, it is easy to translate the proposed analysis into one in which palatals are [-Back] and plain coronals are [+Back], à la Padgett (2003).

References

Bale, A., Papillon, M., and Reiss, C. 2014. Targeting underspecified segments: a formal analysis of feature-changing and feature-filling rules. Lingua 148: 240-253.
Bale, A., and Reiss, C. 2018. Phonology: a Formal Introduction. MIT Press.
Dresher, B. E. 2009. The Contrastive Hierarchy in Phonology. Cambridge University Press.
Kaplan, L. D. 1981. Phonological issues in North Alaskan Inupiaq. Alaska Native Language Center.
Padgett, J. 2003. Contrast and post-velar fronting in Russian. Natural Language and Linguistic Theory 21: 39-87.
Reiss, C. 2021. Towards a complete Logical Phonology model of intrasegmental changes. Glossa 6:107.