Publications

Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, “Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” In High Performance Computing and Grids in Action (to appear), Amsterdam, IOS Press, 00 2007.

(122.01 KB)

Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, “A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures,” Parallel Computing, vol. 35, pp. 38-53, 00 2009.

(274.74 KB)

Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, “A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures,” University of Tennessee Computer Science Technical Report, no. UT-CS-07-600 (also LAPACK Working Note 191), January 2007.

(274.74 KB)

Buttari, A., P. Luszczek, J. Kurzak, J. Dongarra, and G. Bosilca, “SCOP3: A Rough Guide to Scientific Computing On the PlayStation 3,” University of Tennessee Computer Science Dept. Technical Report, UT-CS-07-595, 00 2007.

(1.74 MB)

Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, “Parallel Tiled QR Factorization for Multicore Architectures,” Concurrency and Computation: Practice and Experience, vol. 20, pp. 1573-1590, January 2008.

(277.92 KB)

Buttari, A., J. Dongarra, J. Langou, J. Langou, P. Luszczek, and J. Kurzak, “Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems,” International Journal of High Performance Computer Applications (to appear), August 2007.

(157.4 KB)

Buttari, A., J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov, “The Impact of Multicore on Math Software,” PARA 2006, Umea, Sweden, June 2006.

(223.53 KB)

Buttari, A., J. Dongarra, and J. Kurzak, “Limitations of the Playstation 3 for High Performance Cluster Computing,” University of Tennessee Computer Science Technical Report, UT-CS-07-597 (Also LAPACK Working Note 185), 00 2007.

(171.01 KB)

Buttari, A., J. Dongarra, J. Kurzak, P. Luszczek, and S. Tomov, “Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy,” ACM Transactions on Mathematical Software, vol. 34, no. 4, pp. 17-22, 00 2008.

(364.48 KB)

Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, “Parallel Tiled QR Factorization for Multicore Architectures,” University of Tennessee Computer Science Dept. Technical Report, UT-CS-07-598 (also LAPACK Working Note 190), 00 2007.

(277.92 KB)

Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, “Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” in High Performance Computing and Grids in Action, Amsterdam, IOS Press, January 2008.

(92.95 KB)

Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, “A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures,” Parallel Computing (to appear), 00 2010.

(612.23 KB)

Buttari, A., J. Dongarra, P. Husbands, J. Kurzak, and K. Yelick, “Multithreading for synchronization tolerance in matrix factorization,” Journal of Physics: Conference Series, SciDAC 2007, vol. 78, no. 2007, January 2007.

(577.73 KB)

Buttari, A., V. Eijkhout, J. Langou, and S. Filippone, “Performance Optimization and Modeling of Blocked Sparse Kernels,” ICL Technical Report, no. ICL-UT-04-05, 00 2004.

(229.58 KB)

Bujanovic, Z., and Z. Drmac, “New Robust ScaLAPACK Routine for Computing the QR Factorization with Column Pivoting,” LAPACK Working Note, no. LAWN 296, ICL-UT-19-14: University of Tennessee, October 2019.

(454.83 KB)

“,” 8th International Conference on Computational Science (ICCS), Proceedings Parts I, II, and III, Lecture Notes in Computer Science, vol. 5101, Krakow, Poland, Springer Berlin, January 2008.

Browne, S., J. Dongarra, and A. Trefethen, “Numerical Libraries and Tools for Scalable Parallel Cluster Computing,” IEEE Cluster Computing BOF at SC99, Portland, Oregon, January 1999.

(37.38 KB)

Browne, S., J. Dongarra, N. Garner, K. London, and P. Mucci, “A Scalable Cross-Platform Infrastructure for Application Performance Tuning Using Hardware Counters,” Proceedings of SuperComputing 2000 (SC'00), Dallas, TX, November 2000.

(178.15 KB)

Browne, S., J. Dongarra, N. Garner, K. London, and P. Mucci, “A Portable Programming Interface for Performance Evaluation on Modern Processors,” University of Tennessee Computer Science Technical Report, UT-CS-00-444, July 2000.

(655.17 KB)

Browne, S., C. Deane, G. Ho, and P. Mucci, “PAPI: A Portable Interface to Hardware Performance Counters,” Proceedings of Department of Defense HPCMP Users Group Conference, June 1999.

(57.77 KB)

Browne, S., J. Dongarra, J. Horner, P. McMahan, and S. Wells, “National HPCC Software Exchange (NHSE): Uniting the High Performance Computing and Communications Community,” D-Lib Magazine, January 1998.

(56.15 KB)

Browne, S., J. Dongarra, N. Garner, G. Ho, and P. Mucci, “A Portable Programming Interface for Performance Evaluation on Modern Processors,” The International Journal of High Performance Computing Applications, vol. 14, no. 3, pp. 189-204, September 2000.

(655.17 KB)

Browne, S., J. Dongarra, and A. Trefethen, “Numerical Libraries and Tools for Scalable Parallel Cluster Computing,” International Journal of High Performance Applications and Supercomputing, vol. 15, no. 2, pp. 175-180, October 2002.

(37.38 KB)

Browne, S., P. McMahan, and S. Wells, “Repository in a Box Toolkit for Software and Resource Sharing,” University of Tennessee Computer Science Department Technical Report, no. ICL-UT-05-05, 00 2001.

(195.96 KB)

Brown, C., A. Abdelfattah, S. Tomov, and J. Dongarra, hipMAGMA v2.0 : Zenodo, July 2020.

Brown, C., A. Abdelfattah, S. Tomov, and J. Dongarra, “Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs,” 2020 IEEE High Performance Extreme Computing Virtual Conference: IEEE, September 2020.

(476.36 KB)

Brown, C., A. Abdelfattah, S. Tomov, and J. Dongarra, hipMAGMA v1.0 : Zenodo, March 2020.

Brown, C., A. Abdelfattah, S. Tomov, and J. Dongarra, “Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs,” Innovative Computing Laboratory Technical Report, no. ICL-UT-20-12: University of Tennessee, August 2020.

(476.36 KB)

Brown, J., A. Abdelfattah, V. Barra, V. Dobrev, Y. Dudouit, P. Fischer, T. Kolev, D. Medina, M. Min, T. Ratnayaka, et al., CEED ECP Milestone Report: Public release of CEED 2.0 : Zenodo, April 2019.

(4.98 MB)

Brown, J., A. Abdelfattah, V. Barra, N. Beams, J-S. Camier, V. Dobrev, Y. Dudouit, L. Ghaffari, T. Kolev, D. Medina, et al., “libCEED: Fast algebra for high-order element-based discretizations,” Journal of Open Source Software, vol. 6, no. 63, pp. 2945, 2021.

Brady, T., A. Lastovetsky, K. Seymour, M. Guidolin, and J. Dongarra, “SmartGridRPC: The new RPC model for high performance Grid Computing and Its Implementation in SmartGridSolve,” Concurrency and Computation: Practice and Experience (to appear), January 2010.

(1.08 MB)

Bouteiller, A., G. Bosilca, and J. Dongarra, “Redesigning the Message Logging Model for High Performance,” International Supercomputer Conference (ISC 2008), Dresden, Germany, January 2008.

(622.1 KB)

Bouteiller, A., T. Herault, G. Bosilca, and J. Dongarra, “Correlated Set Coordination in Fault Tolerant Message Logging Protocols,” Proceedings of 17th International Conference, Euro-Par 2011, Part II, vol. 6853, Bordeaux, France, Springer, pp. 51-64, August 2011.

(486.68 KB)

Bouteiller, A., G. Bosilca, and J. Dongarra, “Redesigning the Message Logging Model for High Performance,” Concurrency and Computation: Practice and Experience (online version), June 2010.

(438.42 KB)

Bouteiller, A., F. Cappello, J. Dongarra, A. Guermouche, T. Herault, and Y. Robert, “Multi-criteria checkpointing strategies: optimizing response-time versus resource utilization,” University of Tennessee Computer Science Technical Report, no. ICL-UT-13-01, February 2013.

(497.64 KB)

Bouteiller, A., and G. Bosilca, “Implicit Actions and Non-blocking Failure Recovery with MPI,” 2022 IEEE/ACM 12th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS), Dallas, TX, USA, IEEE, January 2023, 2022.

Bouteiller, A., T. Herault, G. Bosilca, and J. Dongarra, “Correlated Set Coordination in Fault Tolerant Message Logging Protocols,” Concurrency and Computation: Practice and Experience, vol. 25, issue 4, pp. 572-585, March 2013.

(636.68 KB)

Bouteiller, A., T. Ropars, G. Bosilca, C. Morin, and J. Dongarra, “Reasons for a Pessimistic or Optimistic Message Logging Protocol in MPI Uncoordinated Failure Recovery,” CLUSTER '09, New Orleans, IEEE, August 2009.

(191.36 KB)

Bouteiller, A., G. Bosilca, and J. Dongarra, “Retrospect: Deterministic Relay of MPI Applications for Interactive Distributed Debugging,” Accepted for Euro PVM/MPI 2007: Springer, September 2007.

Bouteiller, A., S. Pophale, S. Boehm, M. B. Baker, and M G. Venkata, “Evaluating Contexts in OpenSHMEM-X Reference Implementation,” OpenSHMEM and Related Technologies. Big Compute and Big Data Convergence, Cham, Springer International Publishing, pp. 50–62, 2018.

Bouteiller, A., G. Bosilca, T. Herault, and J. Dongarra, “Data Movement Interfaces to Support Dataflow Runtimes,” Innovative Computing Laboratory Technical Report, no. ICL-UT-18-03: University of Tennessee, May 2018.

(210.94 KB)

Bouteiller, A., T. Herault, G. Bosilca, P. Du, and J. Dongarra, “Algorithm-based Fault Tolerance for Dense Matrix Factorizations, Multiple Failures, and Accuracy,” ACM Transactions on Parallel Computing, vol. 1, issue 2, no. 10, pp. 10:1-10:28, January 2015.

(1.14 MB)

Bouteiller, A., G. Bosilca, and J. Dongarra, “Plan B: Interruption of Ongoing MPI Operations to Support Failure Recovery,” 22nd European MPI Users' Group Meeting, Bordeaux, France, ACM, September 2015.

(543.32 KB)

Bouteiller, A., and F. Desprez, “Fault Tolerance Management for a Hierarchical GridRPC Middleware,” 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), Lyon, France, January 2008.

(319.79 KB)

Bouteiller, A., F. Cappello, J. Dongarra, A. Guermouche, T. Herault, and Y. Robert, “Multi-criteria Checkpointing Strategies: Response-Time versus Resource Utilization,” Euro-Par 2013, Aachen, Germany, Springer, August 2013.

(431.84 KB)

Bouteiller, A., G. Bosilca, and M G. Venkata, “Surviving Errors with OpenSHMEM,” OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, Baltimore, MD, USA, Springer International Publishing, pp. 66–81, 2016.

Bouteiller, A., T. Herault, and G. Bosilca, “A Multithreaded Communication Substrate for OpenSHMEM,” 8th International Conference on Partitioned Global Address Space Programming Models (PGAS), Eugene, OR, October 2014.

(261.66 KB)

Boulet, P., J. Dongarra, Y. Robert, and F. Vivien, “Static Tiling for Heterogeneous Computing Platforms,” Parallel Computing, vol. 25, no. 5, pp. 547-568, January 1999.

(301.17 KB)

Boulet, P., J. Dongarra, F. Rastello, Y. Robert, and F. Vivien, “Algorithmic Issues on Heterogeneous Computing Platforms,” Parallel Processing Letters, vol. 9, no. 2, pp. 197-213, January 1999.

(301.17 KB)

Anzt, H., M. Casas, C. I. Malossi, E. S. Quintana-Ortí, F. Scheidegger, and S. Zhuang, “Approximate Computing for Scientific Applications,” Approximate Computing Techniques, 322: Springer International Publishing, pp. 415 - 465, January 2022.

Main menu

Pages