53 documents

Journal articles

  • Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher, Samuel Thibault. Tracing task-based runtime systems: Feedbacks from the StarPU case. Concurrency and Computation: Practice and Experience, 2023, pp.24. ⟨10.1002/cpe.7920⟩. ⟨hal-04236246⟩
  • Maxime Gonthier, Loris Marchal, Samuel Thibault. Taming data locality for task scheduling under memory constraint in runtime systems. Future Generation Computer Systems, In press, ⟨10.1016/j.future.2023.01.024⟩. ⟨hal-03623220v2⟩
  • Mathieu Faverge, Nathalie Furmento, Abdou Guermouche, Gwenolé Lucas, Raymond Namyst, et al.. Programming Heterogeneous Architectures Using Hierarchical Tasks. Concurrency and Computation: Practice and Experience, In press, 35 (25), ⟨10.1002/cpe.7811⟩. ⟨hal-04088833v2⟩
  • Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher. Predicting Performance of Communications and Computations under Memory Contention in Distributed HPC Systems. International Journal of Networking and Computing, 2023, Special Issue on Workshop on Advances in Parallel and Distributed Computational Models 2022, 13 (1), pp.30. ⟨hal-03871630⟩
  • Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Julien Herrmann, Antoine Jego. Task-based parallel programming for scalable matrix product algorithms. ACM Transactions on Mathematical Software, 2023, ⟨10.1145/3583560⟩. ⟨hal-03936659v2⟩
  • Loris Marchal, Thibault Marette, Grégoire Pichon, Frédéric Vivien. Trading Performance for Memory in Sparse Direct Solvers using Low-rank Compression. Future Generation Computer Systems, 2022, 130, pp.307-320. ⟨10.1016/j.future.2021.12.018⟩. ⟨hal-03517124⟩
  • Gabriel Bathie, Loris Marchal, Yves Robert, Samuel Thibault. Dynamic DAG Scheduling Under Memory Constraints for Shared-Memory Platforms. International Journal of Networking and Computing, 2021, pp.1-29. ⟨10.15803/ijnc.11.1_27⟩. ⟨hal-03029847⟩
  • Alfredo Buttari, Søren Hauberg, Costy Kodsi. Parallel QR factorization of block-tridiagonal matrices. SIAM Journal on Scientific Computing, 2020, 42 (6), pp.C313-C334. ⟨10.1137/19M1306166⟩. ⟨hal-02370953v2⟩

Conference papers

  • Maxime Gonthier, Elisabeth Larsson, Loris Marchal, Carl Nettelblad, Samuel Thibault. Data-Driven Locality-Aware Batch Scheduling. APDCM 2024 - 26th Workshop on Advances in Parallel and Distributed Computational Models, 38th IEEE International Parallel and Distributed Processing Symposium, May 2024, San Francisco, United States. ⟨hal-04500281⟩
  • Nathalie Furmento, Abdou Guermouche, Gwenolé Lucas, Thomas Morin, Samuel Thibault, et al.. Optimizing Parallel System Efficiency: Dynamic Task Graph Adaptation with Recursive Tasks. WAMTA 2024 - Workshop on Asynchronous Many-Task Systems and Applications 2024, Feb 2024, Knoxville, United States. ⟨hal-04548787⟩
  • Alycia Lisito, Mathieu Faverge, Grégoire Pichon, Pierre Ramet. Enhancing sparse direct solver scalability through runtime system automatic data partition. WAMTA 2024 - Workshop on Asynchronous Many-Task Systems and Applications 2024, Feb 2024, Knoxville, United States. ⟨hal-04527103⟩
  • Emmanuel Agullo, Alfredo Buttari, Marek Felšöci, Guillaume Sylvand. Vers un solveur direct à base de tâches pour des systèmes linéaires FEM/BEM creux/denses. ComPAS 2023 - Conférence francophone d'informatique en Parallélisme, Architecture et Système, LISTIC : Laboratoire d’Informatique, Systèmes, Traitement de l’Information et de la Connaissance, de l’Université Savoie Mont Blanc., Jul 2023, Annecy, France. ⟨hal-04178064⟩
  • Olivier Beaumont, Jean-Alexandre Collin, Lionel Eyraud-Dubois, Mathieu Vérité. Data Distribution Schemes for Dense Linear Algebra Factorizations on Any Number of Nodes. IPDPS 2023 - 37th IEEE International Parallel & Distributed Processing Symposium, IEEE, May 2023, St. Petersburg, Florida, United States. ⟨hal-04013708⟩
  • Emmanuel Agullo, Alfredo Buttari, Olivier Coulaud, Lionel Eyraud-Dubois, Mathieu Faverge, et al.. On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM). IPDPS 2023 - 37th International Parallel and Distributed Processing Symposium, IEEE, May 2023, St. Petersburg, FL, United States. pp.357-367. ⟨hal-04093162⟩
  • Olivier Beaumont, Philippe Duchon, Lionel Eyraud-Dubois, Julien Langou, Mathieu Vérité. Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization. SC 2022 - Supercomputing, Nov 2022, Dallas, Texas, United States. ⟨hal-03768910⟩
  • Mathieu Faverge, Nathalie Furmento, Abdou Guermouche, Gwenolé Lucas, Raymond Namyst, et al.. Programming Heterogeneous Architectures Using Hierarchical Tasks. HeteroPar 2022 - twentieth international workshop, Aug 2022, Glasgow, United Kingdom. pp.12. ⟨hal-03789625⟩
  • Olivier Beaumont, Lionel Eyraud-Dubois, Mathieu Vérité, Julien Langou. I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels. ACM Symposium on Parallelism in Algorithms and Architectures, Association for Computing Machinery : SIGACT, SIGARCH, Jul 2022, Philadelphie, United States. ⟨hal-03580531⟩
  • Philippe Swartvagher. Interactions entre calculs et communications au sein des systèmes HPC distribués : évaluation et modélisation.. COMPAS 2022 - Conférence francophone d'informatique en Parallélisme, Architecture et Système, Jul 2022, Amiens, France. ⟨hal-03719612⟩
  • Mathieu Faverge, Nathalie Furmento, Abdou Guermouche, Gwenolé Lucas, Samuel Thibault, et al.. Programmation des architectures hétérogènes à l'aide de tâches hiérarchiques. COMPAS 2022 - Conférence francophone d'informatique en Parallélisme, Architecture et Système, Jul 2022, Amiens, France. ⟨hal-03773486⟩
  • Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher. Modeling Memory Contention between Communications and Computations in Distributed HPC Systems. IPDPS - 2022 - IEEE International Parallel and Distributed Processing Symposium Workshops, May 2022, Lyon / Virtual, France. pp.10, ⟨10.1109/IPDPSW55747.2022.00086⟩. ⟨hal-03682199⟩
  • Maxime Gonthier, Loris Marchal, Samuel Thibault. Memory-Aware Scheduling of Tasks Sharing Data on Multiple GPUs with Dynamic Runtime Systems. IPDPS 2022 - 36th IEEE International Parallel & Distributed Processing Symposium, May 2022, Lyon, France. pp.1-11, ⟨10.1109/IPDPS53621.2022.00073⟩. ⟨hal-03552243⟩
  • Emmanuel Agullo, Marek Felšöci, Guillaume Sylvand. Direct solution of larger coupled sparse/dense linear systems using low-rank compression on single-node multi-core machines in an industrial context. IPDPS 2022 - 36th IEEE International Parallel and Distributed Processing Symposium, May 2022, Lyon, France. pp.11, ⟨10.1109/IPDPS53621.2022.00012⟩. ⟨hal-03774145⟩
  • Maxime Gonthier, Loris Marchal, Samuel Thibault. Locality-Aware Scheduling of Independent Tasks for Runtime Systems. COLOC 2021 - 5th workshop on data locality - 27th International European Conference on Parallel and Distributed Computing, Aug 2021, Lisbon, Portugal. pp.1-12, ⟨10.1007/978-3-031-06156-1_1⟩. ⟨hal-03290998⟩
  • Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher. Interferences between Communications and Computations in Distributed HPC Systems. ICPP 2021 - 50th International Conference on Parallel Processing, Aug 2021, Chicago / Virtual, United States. pp.11, ⟨10.1145/3472456.3473516⟩. ⟨hal-03290121⟩
  • Emmanuel Agullo, Marek Felšöci, Guillaume Sylvand. Comparison of coupled solvers for FEM/BEM linear systems arising from discretization of aeroacoustic problems. COMPAS 2021 - Conférence francophone d'informatique en Parallélisme, Architecture et Système, Jul 2021, Lyon / Virtuel, France. ⟨hal-03264472⟩
  • Philippe Swartvagher. Interactions entre calculs et communications au sein des systèmes HPC distribués. COMPAS 2021 - Conférence francophone d'informatique en Parallélisme, Architecture et Système, Jul 2021, Lyon, France. ⟨hal-03290074⟩
  • Lionel Eyraud-Dubois, Cristiana Bentes. Algorithms for Preemptive Co-scheduling of Kernels on GPUs. HiPC 2020 : 27th IEEE International Conference on High Performance Computing, Data, and Analytics, Dec 2020, Pune / Virtual, India. ⟨hal-03148711⟩
  • Olivier Beaumont, Lionel Eyraud-Dubois, Mathieu Verite. 2D Static Resource Allocation for Compressed Linear Algebra and Communication Constraints. HIPC 2020: 27th IEEE International Conference on High Performance Computing, Data, and Analytics, Dec 2020, (virtual), India. ⟨hal-02900244v2⟩
  • Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher, Samuel Thibault. Using Dynamic Broadcasts to improve Task-Based Runtime Performances. Euro-Par - 26th International European Conference on Parallel and Distributed Computing, Rzadca and Malawski, Aug 2020, Warsaw, Poland. ⟨10.1007/978-3-030-57675-2_28⟩. ⟨hal-02872765⟩
  • Olivier Beaumont, Julien Langou, Willy Quach, Alena Shilova. A Makespan Lower Bound for the Scheduling of the Tiled Cholesky Factorization based on ALAP Schedule. EuroPar 2020 - 26th International European Conference on Parallel and Distributed Computing, Aug 2020, Warsaw / Virtual, Poland. ⟨hal-02487920⟩
  • Changjiang Gou, Ali Al Zoobi, Anne Benoit, Mathieu Faverge, Loris Marchal, et al.. Improving mapping for sparse direct solvers: A trade-off between data locality and load balancing. EuroPar 2020 - 26th International European Conference on Parallel and Distributed Computing, Aug 2020, Warsaw / Virtual, Poland. pp.1-16. ⟨hal-02973315⟩
  • Philippe Swartvagher. Amélioration des performances de supports d'exécution à tâches à l'aide de broadcasts dynamiques. COMPAS 2020 - Conférence francophone d'informatique en Parallélisme, Architecture et Système, Jun 2020, Lyon, France. ⟨hal-02580626⟩
  • Rocío Carratalá-Sáez, Mathieu Faverge, Grégoire Pichon, Guillaume Sylvand, Enrique S Quintana-Ortí. Tiled Algorithms for Efficient Task-Parallel H-Matrix Solvers. PDSEC 2020 - 21st IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing, May 2020, News Orleans, United States. pp.1-10. ⟨hal-02513433⟩
  • Gabriel Bathie, Loris Marchal, Yves Robert, Samuel Thibault. Revisiting dynamic DAG scheduling under memory constraints for shared-memory platforms. IPDPS - 2020 - IEEE International Parallel and Distributed Processing Symposium Workshops, May 2020, New Orleans / Virtual, United States. pp.1-10, ⟨10.1109/IPDPSW50202.2020.00102⟩. ⟨hal-03024626⟩
  • Grégoire Pichon, Mathieu Faverge, Pierre Ramet. Recent Developments Around the Block Low-Rank PaStiX Solver. PP 2020 - SIAM Conference on Parallel Processing for Scientific Computing, Feb 2020, Seattle, United States. ⟨hal-03140189⟩
  • Alexandre Denis. Scalability of the NewMadeleine Communication Library for Large Numbers of MPI Point-to-Point Requests. CCGrid 2019 - 19th Annual IEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing, May 2019, Larnaca, Cyprus. ⟨hal-02103700⟩

Poster communications

  • Maxime Gonthier, Loris Marchal, Samuel Thibault. Memory-Aware Scheduling Of Tasks Sharing Data On Multiple GPUs. ISC 2023 - ISC High Performance 2023, May 2023, Hamburg, Germany. Lecture Notes in Computer Science. ⟨hal-04090595⟩
  • Maxime Gonthier, Samuel Thibault, Loris Marchal. Memory-Aware Scheduling Of Tasks Sharing Data On Multiple GPUs. HiPEAC ACACES 2022 - 18th International Summer School on Advanced Computer Architecture and Compilation for High-performance Embedded Systems, Jul 2022, Fiuggi, Italy. ⟨hal-04090607⟩
  • Maxime Gonthier, Loris Marchal, Samuel Thibault. Locality-Aware Scheduling Of Independent Tasks For Runtime Systems. HiPEAC ACACES 2021, Sep 2021, Fiuggi, Italy. ⟨hal-04090604⟩
  • Philippe Swartvagher. Interferences between Communications and Computations in Distributed HPC Systems. Euro-Par - 27th International European Conference on Parallel and Distributed Computing, Aug 2021, Lisbon / Virtual, Portugal. , Euro-Par 2021: Parallel Processing Workshops. ⟨hal-03333852⟩
  • Philippe Swartvagher. Interferences between Communications and Computations in Distributed HPC Systems. Journée de l'École Doctorale Mathématiques et Informatique, May 2021, Bordeaux, France. ⟨hal-03292004⟩

Preprints, Working Papers

  • Maxime Gonthier, Samuel Thibault, Loris Marchal. A generic scheduler to foster data locality for GPU and out-of-core task-based applications. 2023. ⟨hal-04146714⟩

Reports

  • Mathieu Faverge, Nathalie Furmento, Abdou Guermouche, Gwenolé Lucas, Raymond Namyst, et al.. Programming Heterogeneous Architectures Using Hierarchical Tasks. [Research Report] RR-9466, Inria Bordeaux Sud-Ouest. 2022, pp.21. ⟨hal-03609275v2⟩
  • Alexandre Denis, Emmanuel Jeannot, Philippe Swartvagher. Modeling Memory Contention between Communications and Computations in Distributed HPC Systems (Extended Version). [Research Report] RR-9451, INRIA Bordeaux, équipe TADAAM. 2022, pp.34. ⟨hal-03564751v2⟩
  • Emmanuel Agullo, Marek Felšöci, Guillaume Sylvand. Direct solution of larger coupled sparse/dense linear systems using low-rank compression on single-node multi-core machines in an industrial context. [Research Report] RR-9453, Inria Bordeaux Sud-Ouest. 2022, pp.25. ⟨hal-03557692⟩
  • Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Julien Herrmann, Antoine Jego. Task-Based Parallel Programming for Scalable Algorithms: application to Matrix Multiplication. [Research Report] RR-9461, Inria Bordeaux - Sud-Ouest. 2022, pp.29. ⟨hal-03588491v2⟩
  • Emmanuel Agullo, Marek Felšöci, Guillaume Sylvand. A comparison of selected solvers for coupled FEM/BEM linear systems arising from discretization of aeroacoustic problems: literate and reproducible environment. [Technical Report] RT-0513, Inria Bordeaux Sud-Ouest. 2021, pp.100. ⟨hal-03263620⟩
  • Maxime Gonthier, Loris Marchal, Samuel Thibault. Locality-Aware Scheduling of Independant Tasks for Runtime Systems. [Research Report] RR-9394, Inria Grenoble -Rhône-Alpes. 2021, pp.21. ⟨hal-03144290v7⟩
  • Loris Marchal, Thibault Marette, Grégoire Pichon, Frédéric Vivien. Trading Performance for Memory in Sparse Direct Solvers using Low-rank Compression. [Research Report] RR-9368, INRIA. 2020. ⟨hal-02976233⟩

Theses

  • Antoine Jego. Advanced task-based programming models for scalable linear algebra operations. Other [cs.OH]. Institut National Polytechnique de Toulouse - INPT, 2023. English. ⟨NNT : 2023INPT0107⟩. ⟨tel-04440126⟩
  • Gwenolé Lucas. On the Use of Hierarchical Task for Heterogeneous Architectures. Distributed, Parallel, and Cluster Computing [cs.DC]. Université de Bordeaux, 2023. English. ⟨NNT : 2023BORD0231⟩. ⟨tel-04316145⟩
  • Maxime Gonthier. Scheduling Under Memory Constraint in Task-based Runtime Systems. Distributed, Parallel, and Cluster Computing [cs.DC]. Ecole normale supérieure de lyon - ENS LYON, 2023. English. ⟨NNT : 2023ENSL0061⟩. ⟨tel-04260094⟩
  • Philippe Swartvagher. On the Interactions between HPC Task-based Runtime Systems and Communication Libraries. Data Structures and Algorithms [cs.DS]. Université de Bordeaux, 2022. English. ⟨NNT : 2022BORD0322⟩. ⟨tel-03989856⟩