SANDRA
CATALÁN PALLARÉS
Forscherin bis um 2022
Publikationen (45) Publikationen von SANDRA CATALÁN PALLARÉS
2024
-
Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors
International Journal of High Performance Computing Applications, Vol. 38, Núm. 2, pp. 55-68
-
Inference with Transformer Encoders on ARM and RISC-V Multicore Processors
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
-
Parallel GEMM-based convolutions for deep learning on multicore ARM and RISC-V architectures
Journal of Systems Architecture, Vol. 153
2023
-
Automatic Generation of Micro-kernels for Performance Portability of Matrix Multiplication on RISC-V Vector Processors
ACM International Conference Proceeding Series
-
Fine-grain task-parallel algorithms for matrix factorizations and inversion on many-threaded CPUs
Concurrency and Computation: Practice and Experience
-
Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures
Journal of Parallel and Distributed Computing, Vol. 175, pp. 51-65
2022
-
NUMA-Aware Dense Matrix Factorizations and Inversion with Look-Ahead on Multicore Processors
Proceedings - Symposium on Computer Architecture and High Performance Computing
-
QR Factorization Using Malleable BLAS on Multicore Processors
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2021
-
A New Generation of Task-Parallel Algorithms for Matrix Inversion in Many-Threaded CPUs
Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2021
-
Leveraging teaching on demand: Approaching HPC to undergrads
Journal of Parallel and Distributed Computing, Vol. 156, pp. 148-162
-
Scalable Hybrid Loop- And Task-Parallel Matrix Inversion for Multicore Processors
2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021
2020
-
Programming parallel dense matrix factorizations with look-ahead and OpenMP
Cluster Computing, Vol. 23, Núm. 1, pp. 359-375
-
Towards an auto-tuned and task-based spmv (lass library)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
-
sLASs: A fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library)
Journal of Parallel and Distributed Computing, Vol. 138, pp. 153-171
2019
-
A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization with Partial Pivoting
IEEE Access, Vol. 7, pp. 17617-17633
-
Accelerating conjugate gradient using OmpSs
Proceedings - 2019 20th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2019
-
BLAS-3 Optimized by OmpSs Regions (LASs Library)
Proceedings - 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2019
-
Dynamic look-ahead in the reduction to band form for the singular value decomposition
Parallel Computing, Vol. 81, pp. 22-31
-
Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD
Numerical Algorithms, Vol. 80, Núm. 2, pp. 635-660
-
Tasking in accelerators: Performance evaluation
Proceedings - 2019 20th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2019