Publications: M. Aylett
Publications
Aylett, M.P., Kimball, T., Andert, G. (2010) Scalable Mobile Implementation of High Quality Real Time Text to Speech Synthesis Fifth Workshop on Speech in Mobile and Pervasive Environments, Lisbon.
Aylett, M.P., King, S., Yamagishi, J. (2009) Speech Synthesis Without a Phone Inventory Interspeech 2009, Brighton,
2087-90
Aylett, M.P., Pidcock, C.J. (2009) The CereProc Blizzard Entry 2009: Some dumb algorithms that don't work Blizzard Challenge
Workshop, Edinburgh.
Andersson, J.S., Badino, L., Watts, O.S., Aylett, M.P. (2008) The
CSTR/Cereproc Blizzard Entry 2008: The Inconvenient Data.
(University of Edinburgh, UK / CereProc Ltd, UK), Blizzard Challenge
Workshop, Brisbane.
Aylett, M.P., Yamagishi, J., (2008) Combining Statistical Parametric Speech Synthesis and Unit-Selection for Automatic Voice Cloning. LangTech 2008, Rome.
Aylett, M.P., Pidcock, C.J., (2007) The CereVoice Characterful Speech Synthesiser SDK (Industrial Demo). IVA 2007, Paris, France, Proceedings. Lecture Notes in Computer Science 4722 Springer.
Aylett, M.P., Andersson, J.S., Badino, L., Pidcock, C.J. (2007)The Cerevoice Blizzard Entry 2007: Are Small Database Errors Worse than Compression Artifacts? Blizzard Challenge Workshop, Bonn.
Aylett, M.P, King, S. (2007) Single
Speaker Segmentation and Inventory Selection Using Dynamic Time
Warping Self Organization and Joint Multigram Mapping, Proceedings
of ISCA Speech Synthesis Workshop, Bonn 2007
Aylett, M.P., Pidcock, C.J., (2007) The
CereVoice Characterful Speech Synthesiser SDK, AISB 2007,
Newcastle. pp.174-8
Aylett, M.P., Pidcock, C.J., Fraser, M.E. (2006) The Cerevoice
Blizzard Entry 2006: A prototype Database Unit Selection Engine,
Blizzard Challenge Workshop, Pittsburgh.
Aylett, M., Turk, A. (2006) Language Redundancy
Predicts Syllabic Duration and the Spectral Characteristics of Vocalic
Syllable Nuclei, JASA, 119:3048-58
Aylett, M.P. (2006) Detecting
High Level Structure without Lexical Information. ICASSP 2006,
Toulouse.
Aylett, M.P. (2005) Extracting
the Acoustic Features of Interruption Points Using Non-Lexical
Prosodic Analysis. DISS 2005, Aix-en-Provence, 17-20
Aylett, M.P. (2005) Synthesising
Hyperarticulation in Unit Selection TTS. Interspeech 2005, Lisbon,
2521-24
Aylett, M., Turk, A. (2004) The Smooth Signal Redundancy
Hypothesis: A Functional Explanation for Relationships between
Redundancy, Prosodic Prominence and Duration in Spontaneous
Speech. Language and Speech, Volume 47(1), 31-56
Aylett, M.P. (2004) Merging Data Driven and Rule
Based Prosodic Models for Unit Selection TTS. Proceedings of ISCA
Speech Synthesis Workshop, Pitsburgh 2004, published online.
Bard, E. G., Aylett, M. P. (2004) Referential Form, Word Duration,
and Modeling the Listener in Spoken Dialogue. In John C. Trueswell and
Michael K. Tanenhaus, eds. Approaches to Studying World-Situated
Language Use: Bridging the Language-as-Product and Language-as-Action
Traditions. Cambridge, MA: MIT Press.
Aylett, M.P. (2003) Disfluency
and Speech Recognition Profile Factors. Proceedings of DiSS 03,
Disfluency in Spontaneous Speech Workshop, Göteborg University,
Sweden. Robert Eklund (ed.), Gothenburg Papers in Theoretical
Linguistics 89, ISSN 0349 1021, pp. 49-52.
Aylett, M.P., Fackrell, J. & Rutten P. (2003) My
Voice, Your Prosody: Sharing a Speaker Specific Prosody Model Across
Speakers in Unit Selection TTS. Eurospeech-2003 Geneva.
Aylett, M.P. (2002) Stochastic
Suprasegmentals: Relationships Between the Spectral Characteristics of
Vowels, Redundancy and Prosodic Structure. ICSLP-2002 Denver.
Bard, E.G., Lickley, R.J. & Aylett, M.P. (2001) Is
Disfluency Just Difficult? In Proceedings of DISS '01, An ISCA
Tutorial and Research Workshop, Edinburgh.
Aylett, M.P. (2001) Modelling
Care of Articulation with HMMs is Dangerous. In Proceedings of
Eurospeech-2001, Aalberg.
Bard, E.G., Sotillo C., Kelly M.L., Aylett, M.P. (2001) Taking
the Hit: Leaving some Lexical Competition to be Resolved
Post-Lexically. Language and Cognitive Processes, Volume 16, 5-6
p173-176
Aylett, M.P. (2000) Stochastic
Suprasegmentals - Relationships between Redundancy, Prosodic Structure
and Care of Articulation in Spontaneous Speech. PhD Thesis,
Department of Linguistics, University of Edinburgh.
Bard, E.G., Anderson, A.H., Sotillo, C., Aylett, M.,
Doherty-Sneddon, G., and Newlands, A. (2000) Controlling the
Intelligibility of Referring Expressions in Dialogue. Journal of
Memory and Language, Volume 42-1 p1-22.
Aylett, M.P. (2000) Stochastic
Suprasegmentals: Relationships between Redundancy, Prosodic Structure
and Care of Articulation in Spontaneous Speech. In Proceedings of
ICSLP-2000, Beijing.
Aylett, M.P. (2000) Modelling
clarity change in spontaneous speech. In R.J. Baddeley,
P.J.B. Hancock, and P.Foldiak, editors, Information Theory and the
Brain. Cambridge University Press, New York.
Bard, E.G. & Aylett, M.P. (1999) The
Dissociation of Deaccenting, Givenness and Syntactic Role in
Spontaneous Speech. In Proceedings of ICPhS-99, San Francisco.
Aylett, M.P. (1999) Stochastic
Suprasegmentals: Relationships between Redundancy, Prosodic Structure
and Syllabic Duration. In Proceedings of ICPhS-99, San
Francisco.
Bull, M.C. & Aylett, M.P. (1998) An
Analysis of the Timing of Turn-Taking in a Corpus of Goal-Orientated
Dialogue. In Proceedings of ICSLP-98 Sidney, Australia
(4)1175-8pp.
Aylett, M.P. & Bull M.C. (1998) The
Automatic Marking of Prominence in Spontaneous Speech Using Duration
and Part of Speech Information. In Proceedings of ICSLP-98 Sidney,
Australia (5)2123-6pp.
Aylett, M.P. (1998) Building
a Statistical Model of the Vowel Space for Phoneticians In
Proceedings of SST-98 ICSLP-98 Sidney, Australia.
Aylett, M. & Turk, A. (1998) Vowel
quality in spontaneous speech: What makes a good vowel? In
Proceedings of ICSLP-98 Sidney, Australia
Mayo, C., Aylett, M. & Ladd, D. (1997) Prosodic
Transcription of Glasgow English: An Evaluation Study of GlaToBI.
In Botinis, A., Kouroupetroglou, G. & Carayiannis, G. editors,
Proceedings of an ESCA Workshop: Intonation: Theory, Models and
Applications. Athens, Greece. ESCA and The University of Athens,
231-234pp.
RESEARCH INTERESTS:
Thesis: Prosodic structure, statistical redundancy and care of
articulation in spontaneous speech.
Speech Technology: Prosodic control in Unit Selection Synthesis. The use of sub-word units in concatenative speech synthesis. Voice cloning.
Dialogue: The relationships between dialogue structure, use of
reference and care of articulation.