EUSIPCO'2002 - Actes du colloque

> Home > Paper #625

Paper data Title: Time sequence information within a Gaussian mixture model Author(s): Stapert Robert, ACULAB Mason John, University of Wales Swansea Page numbers in the proceedings: Volume III pp 595-598 Session: Language and Speech Recognition Paper abstract This paper addresses the task of text independent speaker recognition and in particular looks at capturing time sequence information within the modelling process itself. A recent extension to the popular Gaussian mixture model (GMM) is the segmental mixture model (SMM), and its advantages are thought to be more pronounced as more and more training data becomes available. Here this idea is examined along with a hypothesis on model size, model complexity and their dependencies on the quantity of available training data. Experimental results on a 2000 speaker database show that an SMM does offer better recognition results than a GMM once a threshold in the amount of training data has been reached. Paper A PDF version is available here
[ Programme \| Find by author \| Find by keyword \| Find by paper code \| Contribution list \| About Eusipco'2002 \| Help \| Eusipco'2004 \| Home page]

Title:
Time sequence information within a Gaussian mixture model

Author(s):
Stapert Robert, ACULAB
Mason John, University of Wales Swansea

Page numbers in the proceedings:
Volume III pp 595-598

Session:
Language and Speech Recognition

This paper addresses the task of text independent speaker recognition and in particular looks at capturing time sequence information within the modelling process itself. A recent extension to the popular Gaussian mixture model (GMM) is the segmental mixture model (SMM), and its advantages are thought to be more pronounced as more and more training data becomes available. Here this idea is examined along with a hypothesis on model size, model complexity and their dependencies on the quantity of available training data. Experimental results on a 2000 speaker database show that an SMM does offer better recognition results than a GMM once a threshold in the amount of training data has been reached.