Closed-form expressions vs. BIC: A comparison for speaker clustering

Stafylakis, T; Anguera, X; Katsouros, V; Carayannis, G

dc.contributor.author	Stafylakis, T	en
dc.contributor.author	Anguera, X	en
dc.contributor.author	Katsouros, V	en
dc.contributor.author	Carayannis, G	en
dc.date.accessioned	2014-03-01T02:52:56Z
dc.date.available	2014-03-01T02:52:56Z
dc.date.issued	2011	en
dc.identifier.issn	15206149	en
dc.identifier.uri	https://dspace.lib.ntua.gr/xmlui/handle/123456789/36154
dc.subject	Bayesian methods	en
dc.subject	Clustering methods	en
dc.subject	Speaker recognition	en
dc.subject.other	Bayesian methods	en
dc.subject.other	Closed-form expression	en
dc.subject.other	Clustering methods	en
dc.subject.other	Conjugate prior	en
dc.subject.other	Exponential family	en
dc.subject.other	Hyper-parameter	en
dc.subject.other	Latent variable	en
dc.subject.other	Model parameters	en
dc.subject.other	Speaker clustering	en
dc.subject.other	Speaker recognition	en
dc.subject.other	Tuning parameter	en
dc.subject.other	Bayesian networks	en
dc.subject.other	Signal processing	en
dc.subject.other	Speech communication	en
dc.title	Closed-form expressions vs. BIC: A comparison for speaker clustering	en
heal.type	conferenceItem	en
heal.identifier.primary	10.1109/ICASSP.2011.5946924	en
heal.identifier.secondary	http://dx.doi.org/10.1109/ICASSP.2011.5946924	en
heal.identifier.secondary	5946924	en
heal.publicationDate	2011	en
heal.abstract	In this paper, the use of closed-form expressions is compared to the BIC approximation, with respect to speaker clustering. We first show that the particular BIC setting which is commonly used in this task, namely the approximation of the marginal - with respect to the model parameters - and conditional - with respect to the latent variables - likelihood, belongs to an exponential family, and hence admits a closed-form expression by attaching conjugate priors. We then formalize the role of the tuning parameter as a hyperparameter of the prior and finally we explain the several proposed setting - global, local and segmental - based on the strength of the prior. Experiments are carried out for the speaker clustering task and improvement over the BIC approximation is reported. © 2011 IEEE.	en
heal.journalName	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings	en
dc.identifier.doi	10.1109/ICASSP.2011.5946924	en
dc.identifier.spage	2228	en
dc.identifier.epage	2231	en