Informática
Facultad
University of Texas at Austin
Austin, Estados UnidosPublicaciones en colaboración con investigadores/as de University of Texas at Austin (17)
2022
-
Algorithm 1022: Efficient Algorithms for Computing a Rank-Revealing UTV Factorization on Parallel Computing Architectures
ACM Transactions on Mathematical Software, Vol. 48, Núm. 2
2019
-
A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization with Partial Pivoting
IEEE Access, Vol. 7, pp. 17617-17633
-
Enhanced Limited Magnitude Error Correcting Codes for Multilevel Cell Main Memories
IEEE Transactions on Nanotechnology, Vol. 18, pp. 1023-1026
2016
-
Analytical modeling is enough for high-performance BLIS
ACM Transactions on Mathematical Software, Vol. 43, Núm. 2
-
The BLIS framework: Experiments in portability
ACM Transactions on Mathematical Software, Vol. 42, Núm. 2
2013
-
Implementing triple adjacent Error Correction in double error correction Orthogonal Latin Squares Codes
Proceedings - IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems
-
Scheduling algorithms-by-blocks on small clusters
Concurrency Computation Practice and Experience, Vol. 25, Núm. 3, pp. 367-384
2012
-
A runtime system for programming out-of-core maatrix algorithms-by-tiles on multithreaded architectures
ACM Transactions on Mathematical Software, Vol. 38, Núm. 4
-
Level-3 BLAS on a GPU: Picking the low hanging fruit
AIP Conference Proceedings
-
Level-3 BLAS on the TI C6678 multi-core DSP
Proceedings - Symposium on Computer Architecture and High Performance Computing
-
The FLAME approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations
Journal of Parallel and Distributed Computing, Vol. 72, Núm. 9, pp. 1134-1143
-
Unleashing the high-performance and low-power of multi-core DSPs for general-purpose HPC
International Conference for High Performance Computing, Networking, Storage and Analysis, SC
2011
-
Power-aware dense linear algebra implementations on multi-core and many-core processors
3rd Many-Core Applications Research Community Symposium, MARC 2011
2010
-
Retargeting PLAPACK to clusters with hardware accelerators
Proceedings of the 2010 International Conference on High Performance Computing and Simulation, HPCS 2010
2009
-
Out-of-core solution of linear systems on graphics processors
International Journal of Parallel, Emergent and Distributed Systems, Vol. 24, Núm. 6, pp. 521-538
-
Solving dense linear systems on platforms with multiple hardware accelerators
ACM SIGPLAN Notices, Vol. 44, Núm. 4, pp. 121-129
-
Solving dense linear systems on platforms with multiple hardware accelerators
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP