dc.contributor.author |
Stafylakis, T |
en |
dc.contributor.author |
Anguera, X |
en |
dc.date.accessioned |
2014-03-01T02:52:42Z |
|
dc.date.available |
2014-03-01T02:52:42Z |
|
dc.date.issued |
2010 |
en |
dc.identifier.uri |
https://dspace.lib.ntua.gr/xmlui/handle/123456789/36000 |
|
dc.relation.uri |
http://www.scopus.com/inward/record.url?eid=2-s2.0-79959826141&partnerID=40&md5=ae328b4b8cc21dc571842ce6d4427f1a |
en |
dc.subject |
Bayesian Information Criterion |
en |
dc.subject |
Clustering algorithms |
en |
dc.subject |
Speaker diarization |
en |
dc.subject.other |
Bayesian information criterion |
en |
dc.subject.other |
Emission probabilities |
en |
dc.subject.other |
Informative Priors |
en |
dc.subject.other |
Mixture model |
en |
dc.subject.other |
Mixture of Gaussians |
en |
dc.subject.other |
Speaker diarization |
en |
dc.subject.other |
State sequences |
en |
dc.subject.other |
Statistical complexity |
en |
dc.subject.other |
Speech communication |
en |
dc.subject.other |
Clustering algorithms |
en |
dc.title |
Improvements to the equal-parameter BIC for speaker diarization |
en |
heal.type |
conferenceItem |
en |
heal.publicationDate |
2010 |
en |
heal.abstract |
This paper discusses a set of modifications regarding the use of the Bayesian Information Criterion (BIC) for the speaker diarization task. We focus on the specific variant of the BIC that deploys models of equal - or roughly equal - statistical complexity under partitions of different number of speakers and we examine three modifications. Firstly, we investigate a way to deal with the permutation-invariance property of the estimators when dealing with mixture models, while the second is derived by attaching a weakly informative prior over the space of speaker-level state sequences. Finally, based on the recently proposed segmental-BIC approach, we examine its effectiveness when mixture of gaussians are used to model the emission probabilities of a speaker. The experiments are carried out using NIST rich transcription evaluation campaign for meeting data and show improvement over the baseline setting. © 2010 ISCA. |
en |
heal.journalName |
Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010 |
en |
dc.identifier.spage |
314 |
en |
dc.identifier.epage |
317 |
en |