Publicaciones en colaboración con investigadores/as de Universitat Politècnica de Catalunya (20)

2022

  1. NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors

    Proceedings - Symposium on Computer Architecture and High Performance Computing

2021

  1. A New Generation of Task-Parallel Algorithms for Matrix Inversion in Many-Threaded CPUs

    Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2021

2020

  1. Towards an auto-tuned and task-based spmv (lass library)

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  2. sLASs: A fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library)

    Journal of Parallel and Distributed Computing, Vol. 138, pp. 153-171

2019

  1. A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization with Partial Pivoting

    IEEE Access, Vol. 7, pp. 17617-17633

  2. Accelerating conjugate gradient using OmpSs

    Proceedings - 2019 20th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2019

  3. BLAS-3 Optimized by OmpSs Regions (LASs Library)

    Proceedings - 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2019

  4. Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD

    Numerical Algorithms, Vol. 80, Núm. 2, pp. 635-660

2017

  1. DSPONE48: A methodology for automatically synthesize HDL focus on the reuse of DSP slices

    Journal of Parallel and Distributed Computing, Vol. 106, pp. 132-142

  2. Reduction to tridiagonal form for symmetric eigenproblems on asymmetric multicore processors

    Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2017

  3. Static versus dynamic task scheduling of the LU factorization on arm big. little architectures

    Proceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2017

2010

  1. Extending OpenMP to survive the heterogeneous multi-core era

    International Journal of Parallel Programming

2009

  1. A proposal to extend the OpenMP tasking model for heterogeneous architectures

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)