FRANCISCO DANIEL
IGUAL PEÑA
Profesor titular de universidad
SANDRA
CATALÁN PALLARÉS
Profesora ayudante doctora
Publicaciones en las que colabora con SANDRA CATALÁN PALLARÉS (16)
2024
-
Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors
International Journal of High Performance Computing Applications, Vol. 38, Núm. 2, pp. 55-68
-
Inference with Transformer Encoders on ARM and RISC-V Multicore Processors
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2023
-
Automatic Generation of Micro-kernels for Performance Portability of Matrix Multiplication on RISC-V Vector Processors
ACM International Conference Proceeding Series
-
Fine-grain task-parallel algorithms for matrix factorizations and inversion on many-threaded CPUs
Concurrency and Computation: Practice and Experience
-
Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures
Journal of Parallel and Distributed Computing, Vol. 175, pp. 51-65
2022
-
NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors
Proceedings - Symposium on Computer Architecture and High Performance Computing
-
QR Factorization Using Malleable BLAS on Multicore Processors
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2021
-
A New Generation of Task-Parallel Algorithms for Matrix Inversion in Many-Threaded CPUs
Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2021
-
Scalable Hybrid Loop- And Task-Parallel Matrix Inversion for Multicore Processors
2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021
2020
-
Programming parallel dense matrix factorizations with look-ahead and OpenMP
Cluster Computing, Vol. 23, Núm. 1, pp. 359-375
2018
-
Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors
Journal of Computational Science, Vol. 25, pp. 140-151
2017
-
Revisiting conventional task schedulers to exploit asymmetry in multi-core architectures for dense linear algebra operations
Parallel Computing, Vol. 68, pp. 59-76
-
Time and energy modeling of a high-performance multi-threaded Cholesky factorization
Journal of Supercomputing, Vol. 73, Núm. 1, pp. 139-151
2016
-
Architecture-aware configuration and scheduling of matrix multiplication on asymmetric multicore processors
Cluster Computing, Vol. 19, Núm. 3, pp. 1037-1051
-
Refactoring conventional task schedulers to exploit asymmetric ARM big.LITTLE architectures in dense linear algebra
Proceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016
2015
-
Time and energy modeling of high-performance Level-3 BLAS on x86 architectures
Simulation Modelling Practice and Theory, Vol. 55, pp. 77-94