In his Phonology of Italian, Krämer (2009:§4.2.1) is interested in the productivity of velar palatlization before /i/-initial suffixes, such as the masculine noun plural /-i/. Palatalization obtains in, for example, in amico-amici [aˈmiːko, aˈmiːtʃi] ‘friend(s)’, but not in cuoco-cuochi [ˈkwɔːko, ˈkwɔːki] `cook(s)’. Krämer (henceforth K) further claims that non-palatalization has much higher type frequency.
K performs a small experiment in which ten adult native speakers are presented with nonce words in the singular and asked to complete sentence which requires them to form the /-i/ plural. Four subjects never palatalized; one palatalized all plurals; and five others produced a mix of the two strategies. Summarizing this result, Krämer (2012:125) concludes: “Thus, in Italian it is a personal decision whether velar palatalization is productive or not.”
I am not sure I agree. The most straightforward interpretation of this data, I think, is that the subjects used a mix of different task models. Some subjects may have been reasoning on whether palatalization is actually productive (a true “grammatical task model”), which for me means that the generalization is encoded (or not encoded, as seems more likely here) so as to apply to arbitrary words. Others may have been guessing based on form similarity to existing words (a “dictionary task model”), and others may have used a mix of the two strategies. It is perhaps not surprising that that adults can make use of the dictionary task model, because one can, with some conscious effort, think of phonemically or semantically related real words, and it’s easy to imagine deciding on whether or not to palatalize a nonce word based on the behavior of similar real words.
I think, unfortunately, that this is an unavoidable problem when wug-testing adults. I submit that the conscious analogizing abilities of adults are probably not relevant to questions of productivity and I think that because I don’t think that’s what productivity is. But I don’t know of any way to prevent adult participants from using a dictionary task model. Thus linguists and reviewers should be more skeptical about the utility of adult wug-tasks.
Schütze (2005) makes a similar point with what we might call wug–rating tasks. In such tasks, speakers are asked to assign a wellformedness rating (e.g., on a Likert scale) to candidate inflected forms of nonce words. Arguably, this setting encourage speakers to adopt a highly permissive variant of the dictionary task model, which might be framed as asking “could such a word ever have such a plural?” An answer to such questions are often interesting to the linguist, but I think it quite distinct from the question of productivity that K and others wish to study.
References
Krämer, M. 2009. The Phonology of Italian. Oxford University Press.
Krämer, M. 2012. Underlying Representations. Cambridge University Press.
Schütze, Carson. 2005. Thinking about what we are asking speakers to do. In Kepser, Stephan and Reis, Marga (ed.), Linguistic Evidence: Empirical, Theoretical, and Computational Perspectives, pages 457-485. De Gruyter Mouton.