Publicaciones en las que colabora con RAFAEL RODRÍGUEZ SÁNCHEZ (17)

2024

  1. Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors

    International Journal of High Performance Computing Applications, Vol. 38, Núm. 2, pp. 55-68

2022

  1. NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors

    Proceedings - Symposium on Computer Architecture and High Performance Computing

  2. QR Factorization Using Malleable BLAS on Multicore Processors

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2021

  1. A New Generation of Task-Parallel Algorithms for Matrix Inversion in Many-Threaded CPUs

    Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2021

  2. Low precision matrix multiplication for efficient deep learning in NVIDIA Carmel processors

    Journal of Supercomputing, Vol. 77, Núm. 10, pp. 11257-11269

  3. Scalable Hybrid Loop- And Task-Parallel Matrix Inversion for Multicore Processors

    2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021

2020

  1. Integration and exploitation of intra-routine malleability in BLIS

    Journal of Supercomputing, Vol. 76, Núm. 4, pp. 2860-2875

  2. Programming parallel dense matrix factorizations with look-ahead and OpenMP

    Cluster Computing, Vol. 23, Núm. 1, pp. 359-375

2016

  1. Architecture-aware configuration and scheduling of matrix multiplication on asymmetric multicore processors

    Cluster Computing, Vol. 19, Núm. 3, pp. 1037-1051

  2. Refactoring conventional task schedulers to exploit asymmetric ARM big.LITTLE architectures in dense linear algebra

    Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016

2015

  1. Time and energy modeling of high-performance Level-3 BLAS on x86 architectures

    Simulation Modelling Practice and Theory, Vol. 55, pp. 77-94