HEAL DSpace

Improvements to the equal-parameter BIC for speaker diarization

Αποθετήριο DSpace/Manakin

Εμφάνιση απλής εγγραφής

dc.contributor.author Stafylakis, T en
dc.contributor.author Anguera, X en
dc.date.accessioned 2014-03-01T02:52:42Z
dc.date.available 2014-03-01T02:52:42Z
dc.date.issued 2010 en
dc.identifier.uri https://dspace.lib.ntua.gr/xmlui/handle/123456789/36000
dc.relation.uri http://www.scopus.com/inward/record.url?eid=2-s2.0-79959826141&partnerID=40&md5=ae328b4b8cc21dc571842ce6d4427f1a en
dc.subject Bayesian Information Criterion en
dc.subject Clustering algorithms en
dc.subject Speaker diarization en
dc.subject.other Bayesian information criterion en
dc.subject.other Emission probabilities en
dc.subject.other Informative Priors en
dc.subject.other Mixture model en
dc.subject.other Mixture of Gaussians en
dc.subject.other Speaker diarization en
dc.subject.other State sequences en
dc.subject.other Statistical complexity en
dc.subject.other Speech communication en
dc.subject.other Clustering algorithms en
dc.title Improvements to the equal-parameter BIC for speaker diarization en
heal.type conferenceItem en
heal.publicationDate 2010 en
heal.abstract This paper discusses a set of modifications regarding the use of the Bayesian Information Criterion (BIC) for the speaker diarization task. We focus on the specific variant of the BIC that deploys models of equal - or roughly equal - statistical complexity under partitions of different number of speakers and we examine three modifications. Firstly, we investigate a way to deal with the permutation-invariance property of the estimators when dealing with mixture models, while the second is derived by attaching a weakly informative prior over the space of speaker-level state sequences. Finally, based on the recently proposed segmental-BIC approach, we examine its effectiveness when mixture of gaussians are used to model the emission probabilities of a speaker. The experiments are carried out using NIST rich transcription evaluation campaign for meeting data and show improvement over the baseline setting. © 2010 ISCA. en
heal.journalName Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 en
dc.identifier.spage 314 en
dc.identifier.epage 317 en


Αρχεία σε αυτό το τεκμήριο

Αρχεία Μέγεθος Μορφότυπο Προβολή

Δεν υπάρχουν αρχεία που σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στην ακόλουθη συλλογή(ές)

Εμφάνιση απλής εγγραφής