Scaling laws for phonotactic complexity in spoken English language data

Publications: Contribution to journalArticlePeer Reviewed

Abstract

Two prominent statistical laws in language and other complex systems are Zipf's law and Heaps' law. We investigate the extent to which these two laws apply to the linguistic domain of phonotactics-that is, to sequences of sounds. We analyze phonotactic sequences with different lengths within words and across word boundaries taken from a corpus of spoken English (Buckeye). We demonstrate that the expected relationship between the two scaling laws can only be attested when boundary spanning phonotactic sequences are also taken into account. Furthermore, it is shown that Zipf's law exhibits both high goodness-of-fit and a high scaling coefficient if sequences of more than two sounds are considered. Our results support the notion that phonotactic cognition employs information about boundary spanning phonotactic sequences.

Original languageEnglish
Pages (from-to)693-704
Number of pages12
JournalLanguage and Speech
Volume64
Issue number3
DOIs
Publication statusPublished - Sept 2021

Austrian Fields of Science 2012

  • 602039 Phonology

Keywords

  • COMPRESSION
  • Heaps' law
  • SEGMENTATION
  • SPEECH
  • Zipf's law
  • diversity
  • inventory size
  • phonotactics
  • Heaps’ law
  • Zipf’s law

Fingerprint

Dive into the research topics of 'Scaling laws for phonotactic complexity in spoken English language data'. Together they form a unique fingerprint.

Cite this