Publicaciones en colaboración con investigadores/as de Universidad Politécnica de Valencia (26)

2022

  1. Anatomy of the BLIS Family of Algorithms for Matrix Multiplication

    Proceedings - 30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2022

  2. NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors

    Proceedings - Symposium on Computer Architecture and High Performance Computing

  3. QR Factorization Using Malleable BLAS on Multicore Processors

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2021

  1. A New Generation of Task-Parallel Algorithms for Matrix Inversion in Many-Threaded CPUs

    Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2021

  2. Low precision matrix multiplication for efficient deep learning in NVIDIA Carmel processors

    Journal of Supercomputing, Vol. 77, Núm. 10, pp. 11257-11269

  3. Scalable Hybrid Loop- And Task-Parallel Matrix Inversion for Multicore Processors

    2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021

2020

  1. Integration and exploitation of intra-routine malleability in BLIS

    Journal of Supercomputing, Vol. 76, Núm. 4, pp. 2860-2875

  2. Programming parallel dense matrix factorizations with look-ahead and OpenMP

    Cluster Computing, Vol. 23, Núm. 1, pp. 359-375

2018

  1. Optimized Fundamental Signal Processing Operations for Energy Minimization on Heterogeneous Mobile Devices

    IEEE Transactions on Circuits and Systems I: Regular Papers, Vol. 65, Núm. 5, pp. 1614-1627

2017

  1. Solving Weighted Least Squares (WLS) problems on ARM-based architectures

    Journal of Supercomputing, Vol. 73, Núm. 1, pp. 530-542

2015

  1. Time and energy modeling of high-performance Level-3 BLAS on x86 architectures

    Simulation Modelling Practice and Theory, Vol. 55, pp. 77-94

  2. Vectorization of binaural sound virtualization on the ARM Cortex-A15 architecture

    2015 23rd European Signal Processing Conference, EUSIPCO 2015