Nos partenaires



Accueil du site > Français > Evénements > Séminaires



L’IRIT étant localisé sur plusieurs sites, ses séminaires sont organisés et ont lieu soit à l’Université Toulouse 3 Paul Sabatier (UT3), l’Université Toulouse 1 Capitole (UT1), l’INP-ENSEEIHT ou l’Université Toulouse 2 Jean Jaurès (UT2J).


Latest Advances in Multi-Way Low-Rank Factorization: Nonnegativity, Positive Semidefiniteness, and Joint Diagonalization

Kazuyoshi YOSHII - Kyoto University & RIKEN (Japon)

Lundi 1 Octobre 2018, 15h00 - 16h00
Version PDF :


Nonnegative matrix factorization (NMF) is the most basic method of nonnegativity-based low-rank factorization of matrix data (e.g., time-frequency data). It has intensively been used for audio source separation and automatic music transcription because it decomposes the power spectrogram of a mixture signal into interpretable “parts" (e.g., musical notes) due to the nonnegativity constraints of all data. The naive "multi-way" extension of NMF is nonnegative tensor factorization (NTF) that can deal with tensor data (e.g., time-frequency-channel data). On the other hand, we proposed a more mathematically-fundamental "multi-dimensional" extension of NMF called positive semidefinite tensor factorization (PSDTF) [Yoshii+ 2013] that approximates each PSD matrix given as input as the weighted sum of basis PSD matrices in the same way that NMF approximates each nonnegative vector given as input as the weighted sum of basis nonnegative vectors. We further proposed a "multi-way" extension of PSDTF called correlated tensor factorization (CTF) [Yoshii 2018] that approximates a PSD matrix (covariance matrix over all elements of an input tensor) as the sum of Kronecker products of multiple sets of basis PSD matrices. To reduce the impractically-huge computational complexity of CTF, we recently proposed a fast approximation of CTF based on joint diagonalization of basis PSD matrices called independent low-rank tensor analysis (ILRTA) [Yoshii+ 2018]. Interestingly, we revealed that CTF and its counterpart ILRTA, which are ultimate multi-way covariance modeling frameworks, include as their special cases state-of-the-art single- and multi-channel source separation methods independently developed by different researchers such as Itakura-Saito NMF [Févotte+ 2009], transform-learning NMF (TL-NMF) [Fagot+ 2018], multi-channel NMF (MNMF) [Sawada+ 2013], independent low-rank matrix analysis (ILRMA) [Kitamura+ 2016], fast full-rank spatial covariance analysis (FastFCA) [Ito+ 2018], and independent positive semidefinite tensor analysis (IPSDTA) [Ikeshita 2018].

Short Bio: Dr. Kazuyoshi Yoshii received the Ph.D. degree in informatics from Kyoto University, Kyoto, Japan, in 2008. He is currently an Associate Professor at the Graduate School of Informatics, Kyoto University, Kyoto, Japan, and concurrently the Leader of the Sound Scene Understanding Team, RIKEN Center for Advanced Intelligence Project (AIP), Tokyo, Japan. His research interests include music analysis, audio signal processing, and machine learning.