Publications de Hervé BREDIN
Walid Karam, Hervé Bredin, Hanna Greige, Gérard Chollet, Chafic Mokbel
Talking-Face Identity Verification, Audiovisual Forgery, and Robustness Issues
Dans : EURASIP Journal on Advances in Signal Processing, Hindawi Publishing Corporation, Numéro spécial Recent Advances in Biometric Systems: A Signal Processing Perspective, Vol. 2009, (en ligne), avril 2009.
Accès : http://www.hindawi.com/journals/asp/2009/746481.html
BibTeXEnrique Argones Rúa, Hervé Bredin, Carmen Garcia Mateo, Gérard Chollet, Daniel González Jiménez
Audio-visual speech asynchrony detection using co-inertia analysis and coupled hidden markov models
Dans : Pattern Analysis and Applications Journal, Springer-Verlag, Heidelberg, Allemagne, Vol. 12 N. 3, p. 271-284, 2009.
Audio-Visual Speech Synchrony Measure: Application to Biometrics
Dans : EURASIP Journal on Advances in Signal Processing, Hindawi Publishing Corporation, Numéro spécial Knowledge-Assisted Media Analysis for Interactive Multimedia, Vol. 2007, Article ID 70186, (en ligne), 2007.
Joonas Kalda, Tanel Alumae, Martin Lebourdais, Hervé Bredin, Séverin Baroudi, Ricard Marxer
TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024
25th Interspeech Conference (Interspeech 2024), Sep 2024, Kos, Greece. pp.1635–1639, ⟨10.21437/interspeech.2024-2462⟩
Adrien Lafore, Clément Pagés, Leila Moudjari, Sebastião Quintas, Isabelle Ferrané, Hervé Bredin, Thomas Pellegrini, Farah Benamara, Jérôme Bertrand, Marie-Françoise Bertrand, Véronique Moriceau, Jérôme Farinas
Premier système IRIT-MyFamillyUp pour la compétition sur la reconnaissance des émotions Odyssey 2024
35èmes Journées d’Études sur la Parole (JEP 2024) 31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2024) 26ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL 2024), Université Toulouse 3 Paul Sabatier; Université Toulouse Jean Jaurès, Jul 2024, Toulouse, France. pp.502-511
Joonas Kalda, Clément Pagés, Ricard Marxer, Tanel Alumäe, Hervé Bredin
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings
The Speaker and Language Recognition Workshop (Odyssey 2024), Jun 2024, Quebec City, France. pp.115-122, ⟨10.21437/odyssey.2024-17⟩
Adrien Lafore, Clément Pagés, Leila Moudjari, Sebastião Quintas, Hervé Bredin, Thomas Pellegrini, Farah Benamara, Isabelle Ferrané, Jérôme Bertrand, Marie-Françoise Bertrand, Véronique Moriceau, Jérôme Farinas
IRIT-MFU Multi-modal systems for emotion classification for Odyssey 2024 challenge
Odyssey 2024: The Speaker and Language Recognition Workshop, Jun 2024, Québec, Canada. pp.296-302, ⟨10.21437/odyssey.2024-42⟩
Marvin Lavechin, Marianne Métais, Hadrien Titeux, Alodie Boissonnet, Jade Copet, Morgane Rivière, Elika Bergelson, Alejandrina Cristia, Emmanuel Dupoux, Hervé Bredin
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
IEEE Automatic Speech Recognition and Understanding (ASRU 2023 ), IEEE, Dec 2023, Taipei, Taiwan. pp.1–7
pyannote.audio 2.1 speaker diarization pipeline: principle, benchmark, and recipe
24th INTERSPEECH Conference (INTERSPEECH 2023), Aug 2023, Dublin, Ireland. pp.1983-1987, ⟨10.21437/Interspeech.2023-105⟩
Powerset multi-class cross entropy loss for neural speaker diarization
24th INTERSPEECH Conference (INTERSPEECH 2023), Aug 2023, Dublin, Ireland. pp.3222-3226, ⟨10.21437/Interspeech.2023-205⟩
Marvin Lavechin, Yaya Sy, Hadrien Titeux, María Andrea Cruz Blandón, Okko Räsänen, Hervé Bredin, Emmanuel Dupoux, Alejandrina Cristia
BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models
INTERSPEECH 2023, Aug 2023, Dublin, Ireland. pp.4588-4592, ⟨10.21437/Interspeech.2023-978⟩
Juan Manuel Coria, Hervé Bredin, Sahar Ghannay, Sophie Rosset
Continual self-supervised domain adaptation for end-to-end speaker diarization
IEEE Spoken Language Technology Workshop (SLT 2022), IEEE Speech and Language Processing Technical Committee, Jan 2023, Doha, Qatar. à paraître
Paul Lerner, Juliette Bergoënd, Camille Guinaudeau, Hervé Bredin, Benjamin Maurice, Sharleyne Lefevre, Martin Bouteiller, Aman Berhe, Léo Galmant, Ruiqing Yin, Claude Barras
Bazinga! A Dataset for Multi-Party Dialogues Structuring
13th Conference on Language Resources and Evaluation (LREC 2022), European Language Resources Association (ELRA), Jun 2022, Marseille, France. pp.3434-3441
Accès: https://universite-paris-saclay.hal.science/hal-03737453
End-to-end speaker segmentation for overlap-aware resegmentation
Interspeech 2021, Aug 2021, Brno, Czech Republic
Philippe Ercolessi, Christine Senac, Hervé Bredin
StoViz: Story Vizualization of TV series (regular paper)
Dans : ACM International Conference on Multimedia (ACM Multimedia 2012), Nara, Japan, 29/10/12-02/11/12, ACM Digital Library, p. 1329-1330, 2012.
Philippe Ercolessi, Christine Senac, Hervé Bredin, Sandrine Mouysset
Hierarchical framework for TV series plot de-interlacing based on speakers, dialogues and images (regular paper)
Dans : ACM Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis, Nara, Japan, 29/10/12-02/11/12, Springer, p. 3-8, 2012.
Philippe Ercolessi, Christine Senac, Hervé Bredin
Toward plot de-interlacing in TV series using scenes clustering (regular paper)
Dans : International Workshop on Content-Based Multimedia Indexing (CBMI 2012), Annec, France, 27/06/12-29/06/12, IEEE : Institute of Electrical and Electronics Engineers, (support électronique), juin 2012.
Philippe Ercolessi, Christine Senac, Hervé Bredin, Philippe Joly
Summarizing Video Collection using Semantic Graph
Dans : Workshop IRIT/Kyushu Image et Multimedia, Toulouse – France, 24/11/11-25/11/11.
Accès : http://www.irit.fr/publis/SAMOVA/ercolessiWorkshop2011.pdf
BibTeXPhilippe Ercolessi, Christine Senac, Hervé Bredin, Philippe Joly
Segmenting TV series into scenes using speaker diarization (regular paper)
Dans : Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2011), Delft – Pays bas, 13/04/11-15/04/11, Delft University of Technology, (en ligne), avril 2011.
Résumé Accès : http://repository.tudelft.nl/view/conferencepapers/uuid%3A71be2da9-4a17-409a-9c6c-ee3f59ac71cf/
BibTeXHervé Bredin, Lionel Koenig, Jérôme Farinas
IRIT TRECVid 2010: Hidden Markov Models for Context-aware Late Fusion of Multiple Audio Classifiers
Dans : TREC Video Retrieval Evaluation, Gaithersburg, MD, USA, 15/11/10-17/11/10.
Accès : http://www.irit.fr/publis/SAMOVA/REPORT/IRIT_SIN_TrecVid2010.pdf
BibTeXPhilippe Ercolessi, Christine Senac, Hervé Bredin, Philippe Joly
Video Collection Summarization by Semantic Graph Comparison
Dans : Workshop on Visual Information Processing (EUVIP), Paris – France, 05/07/10-07/07/10.
Accès : http://www.irit.fr/publis/SAMOVA/ErcolessiEuvip2012.pdf
BibTeXHervé Bredin, Lionel Koenig, Hélène Lachambre, Elie El Khoury
IRIT @ TRECVid HLF 2009 Audio to the Rescue (regular paper)
Dans : TREC Video Retrieval Workshop (TRECVID 2009), Gaithersburg, MD, 16/11/09-17/11/09, National Institute of standards and Technology (NIST), (en ligne), novembre 2009.
Saman Cooray, Hervé Bredin, Noel O’Connor, Li-Qun Xu
An Interactive and Multi-Level Framework for Summarising User-Generated Videos (regular paper)
Dans : ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, 19/10/09-24/10/09, ACM Digital Library, (support électronique), 2009.
Benoît Fauve, Hervé Bredin, Walid Karam, Florian Verdet, Aurélien Mayoue, Gérard Chollet, Jean Hennebert, Richard Lewis, John Mason, Chafic Mokbel, Dijana Petrovska
Some Results from the BioSecure Talking-Face Evaluation Campaign
Dans : IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, USA, 30/03/08-04/04/08, IEEE : Institute of Electrical and Electronics Engineers, p. 4137-4140, 2008.
Making Talking-Face Authentication Robust to Deliberate Imposture
Dans : IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, USA, 30/03/08-04/04/08, IEEE : Institute of Electrical and Electronics Engineers, p. 1693-1696, 2008.
Audio-Visual Speech Synchrony Measure for Talking-Face Identity Verification
Dans : IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, USA, 15/04/07-20/04/07, Vol. II, IEEE : Institute of Electrical and Electronics Engineers, p. 233-236, 2007.
Hervé Bredin, Antonio Miguel, Ian Witten, Gérard Chollet
Detecting Replay Attacks in Audiovisual Identity Verification
Dans : IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, 14/05/06-19/05/06, Vol. I, IEEE : Institute of Electrical and Electronics Engineers, p. 621-624, 2006.
Hervé Bredin, Aurélien Mayoue, Gérard Chollet
Talking-Face Verification
Dans : Guide to Biometric Reference Systems and Performance Evaluation. Dijana Petrovska, Gérard Chollet, Bernadette Dorizzi (Eds.) , Springer, 10, p. 297-326, 2009.
Gérard Chollet, Rémi Landais, Thomas Hueber, Hervé Bredin, Chafic Mokbel, Patrick Perrot, Leila Zouari
Some Experiments in Audio-Visual Speech Processing
Dans : Advances in Nonlinear Speech Processing. Mohamed Chetouani (Eds.) , Springer-Verlag, p. 28-56, Vol. 4885/2007, LNCS, 2007.
Bouchra Abboud, Hervé Bredin, Guido Aversano , Gérard Chollet
Audio-visual Identity Verification: An Introductory Overview
Dans : Progress in Nonlinear Speech Processing. Yannis Stylianou, Marcos Faundez-Zanuy, Anna Eposito (Eds.) , Springer-Verlag, p. 118-134, Vol. 4391/2007, LNCS, 2007.
Vérification de l’identité d’un visage parlant. Apport de la mesure de synchronie audiovisuelle face aux tentatives délibérées d’imposture
Thèse de doctorat, Télécom ParisTech, 2007.
Philippe Ercolessi, Christine Senac, Hervé Bredin, Sandrine Mouysset
Vers un résumé automatique de séries télévisées basé sur une recherche multimodale d’histoires
Dans : Revue des Sciences et Technologies de l’Information, Hermès Science, Vol. 15 N. 2, pp. 41-66, 2012.
Marvin Lavechin, Maureen De Seyssel, Marianne Métais, Florian Metze, Abdelrahman Mohamed, Hervé Bredin, Emmanuel Dupoux, Alejandrina Cristia
Statistical learning models of early phonetic acquisition struggle with child-centered audio data
2023