Publications

Export 997 results:
Journal Article
Bosilca, G., A. Bouteiller, A. Guermouche, T. Herault, Y. Robert, P. Sens, and J. Dongarra, A Failure Detector for HPC Platforms,” The International Journal of High Performance Computing Applications, vol. 32, issue 1, pp. 139–158, January 2018.  (1.04 MB)
Abdelfattah, A., A. Haidar, S. Tomov, and J. Dongarra, Factorization and Inversion of a Million Matrices using GPUs: Challenges and Countermeasures,” Procedia Computer Science, vol. 108, pp. 606–615, June 2017.
Bland, W., P. Du, A. Bouteiller, T. Herault, G. Bosilca, and J. Dongarra, Extending the scope of the Checkpoint-on-Failure protocol for forward recovery in standard MPI,” Concurrency and Computation: Practice and Experience, July 2013.  (3.89 MB)
Langou, J., J. Langou, P. Luszczek, J. Kurzak, A. Buttari, and J. Dongarra, Exploiting the Performance of 32 bit Floating Point Arithmetic in Obtaining 64 bit Accuracy,” University of Tennessee Computer Science Tech Report, no. UT-CS-06-574, LAPACK Working Note #175, April 2006.  (221.39 KB)
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” In High Performance Computing and Grids in Action (to appear), Amsterdam, IOS Press, 00-2007.  (122.01 KB)
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” in High Performance Computing and Grids in Action, Amsterdam, IOS Press, January 2008.  (92.95 KB)
Fischer, M., and J. Dongarra, Experiences with Windows 95/NT as a Cluster Computing Platform for Parallel Computing,” Parallel and Distributed Computing Practices, Special Issue: Cluster Computing, vol. 2, no. 2: Nova Science Publishers, USA, pp. 119-128, February 1999.  (164.04 KB)
Anzt, H., B. Haugen, J. Kurzak, P. Luszczek, and J. Dongarra, Experiences in autotuning matrix multiplication for energy minimization on GPUs,” Concurrency in Computation: Practice and Experience, vol. 27, issue 17, pp. 5096-5113, December 2015.  (1.98 MB)
Anzt, H., B. Haugen, J. Kurzak, P. Luszczek, and J. Dongarra, Experiences in Autotuning Matrix Multiplication for Energy Minimization on GPUs,” Concurrency and Computation: Practice and Experience, vol. 27, issue 17, pp. 5096 - 5113, Oct-December 2015.  (1.99 MB)
Bland, W., A. Bouteiller, T. Herault, J. Hursey, G. Bosilca, and J. Dongarra, An evaluation of User-Level Failure Mitigation support in MPI,” Computing, vol. 95, issue 12, pp. 1171-1184, December 2013.  (311.23 KB)
Graham, R. L., R. Brightwell, B. Barrett, G. Bosilca, and J. Pjesivac–Grbovic, An Evaluation of Open MPI's Matching Transport Layer on the Cray XT,” EuroPVM/MPI 2007, September 2007.  (369.01 KB)
M. Lopez, G., V. Larrea, W. Joubert, O. Hernandez, A. Haidar, S. Tomov, and J. Dongarra, Evaluation of Directive-based Performance Portable Programming Models,” International Journal of High Performance Computing and Networking (IJHPCN), vol. (In Press), 2017.
M. Lopez, G., W. Joubert, V. Larrea, O. Hernandez, A. Haidar, S. Tomov, and J. Dongarra, Evaluation of Directive-Based Performance Portable Programming Models,” International Journal of High Performance Computing and Networking (to appear), 2019.
Jagode, H., A. Danalis, R. Hoque, M. Faverge, and J. Dongarra, Evaluation of Dataflow Programming Models for Electronic Structure Theory,” Concurrency and Computation: Practice and Experience: Special Issue on Parallel and Distributed Algorithms, vol. 2018, issue e4490, pp. 1–20, May 2018.  (1.69 MB)
Gabriel, E., G. Fagg, and J. Dongarra, Evaluating The Performance Of MPI-2 Dynamic Communicators And One-Sided Communication,” Lecture Notes in Computer Science, Recent Advances in Parallel Virtual Machine and Message Passing Interface, 10th European PVM/MPI User's Group Meeting, vol. 2840, Venice, Italy, Springer-Verlag, Berlin, pp. 88-97, September 2003.  (254.08 KB)
Hadri, B., H. Ltaeif, E. Agullo, and J. Dongarra, Enhancing Parallelism of Tile QR Factorization for Multicore Architectures,” Submitted to Transaction on Parallel and Distributed Systems, December 2009.  (464.23 KB)
Ltaeif, H., P. Luszczek, and J. Dongarra, Enhancing Parallelism of Tile Bidiagonal Transformation on Multicore Architectures using Tree Reduction,” Lecture Notes in Computer Science, vol. 7203, pp. 661-670, September 2012.  (185.77 KB)
Hiroyasu, T., M. Miki, S. Ogura, K. Aoi, T. Yoshida, Y. Okamoto, and J. Dongarra, Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover,” Special Issue on Biological Applications of Genetic and Evolutionary Computation (submitted), March 2003.  (438.68 KB)
Lively, C., X. Wu, V. Taylor, S. Moore, H-C. Chang, and K. Cameron, Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems,” International Journal of High Performance Computing Applications, vol. 25, no. 3, pp. 342-350, 00-2011.  (467.18 KB)
Li, Y., A. YarKhan, J. Dongarra, K. Seymour, and A. Hurault, Enabling Workflows in GridSolve: Request Sequencing and Service Trading,” Journal of Supercomputing, vol. 64, issue 3, pp. 1133-1152, June 2013.  (821.29 KB)
Dongarra, J., and S. Moore, Empirical Performance Tuning of Dense Linear Algebra Software,” in Performance Tuning of Scientific Applications (to appear), 00-2010.
Baboulin, M., D. Becker, G. Bosilca, A. Danalis, and J. Dongarra, An Efficient Distributed Randomized Algorithm for Solving Large Dense Symmetric Indefinite Linear Systems,” Parallel Computing, vol. 40, issue 7, pp. 213-223, July 2014.  (1.42 MB)
Benoit, A., S. K. Raina, and Y. Robert, Efficient Checkpoint/Verification Patterns,” International Journal on High Performance Computing Applications, July 2015.  (392.76 KB)
Voemel, C., S. Tomov, and J. Dongarra, Divide & Conquer on Hybrid GPU-Accelerated Multicore Systems,” SIAM Journal on Scientific Computing (submitted), August 2010.
Voemel, C., S. Tomov, and J. Dongarra, Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems,” SIAM Journal on Scientific Computing, vol. 34(2), pp. C70-C82, April 2012.
Yamazaki, I., A. Ida, R. Yokota, and J. Dongarra, Distributed-Memory Lattice H-Matrix Factorization,” The International Journal of High Performance Computing Applications, August 2019.  (1.14 MB)
Boehmann, T. B., Distributed Storage in RIB,” ICL Tech Report, no. ICL-UT-03-01, March 2003.  (213.02 KB)
Hiroyasu, T., M. Miki, M. Sano, H. Shimosaka, S. Tsutsui, and J. Dongarra, Distributed Probablistic Model-Building Genetic Algorithm,” Lecture Notes in Computer Science, vol. 2723: Springer-Verlag, Heidelberg, pp. 1015-1028, January 2003.  (288.91 KB)
Dongarra, J., Z. Chen, G. Bosilca, and J. Langou, Disaster Survival Guide in Petascale Computing: An Algorithmic Approach,” in Petascale Computing: Algorithms and Applications (to appear): Chapman & Hall - CRC Press, 00-2007.  (260.18 KB)
Kurzak, J., P. Luszczek, I. Yamazaki, Y. Robert, and J. Dongarra, Design and Implementation of the PULSAR Programming System for Large Scale Computing,” Supercomputing Frontiers and Innovations, vol. 4, issue 1, 2017.
D'Azevedo, E., and J. Dongarra, The Design and Implementation of the Parallel Out of Core ScaLAPACK LU, QR, and Cholesky Factorization Routines,” Concurrency: Practice and Experience, vol. 12, no. 15, pp. 1481-1493, January 2000.  (374.18 KB)
Casanova, H., J. Plank, M. Beck, and J. Dongarra, Deploying Fault-tolerance and Task Migration with NetSolve,” Future Generation Computer Systems, vol. 15, no. 5-6: Elsevier, pp. 745-755, October 1999.  (236 KB)
Kurzak, J., H. Ltaeif, J. Dongarra, and R. M. Badia, Dependency-Driven Scheduling of Dense Matrix Factorizations on Shared-Memory Systems,” PPAM 2009, Poland, September 2009.
Bosilca, G., A. Bouteiller, A. Danalis, T. Herault, P. Luszczek, and J. Dongarra, Dense Linear Algebra on Distributed Heterogeneous Hardware with a Symbolic DAG Approach,” Scalable Computing and Communications: Theory and Practice: John Wiley & Sons, pp. 699-735, March 2013.  (1.01 MB)
Dongarra, J., J. Kurzak, P. Luszczek, and S. Tomov, Dense Linear Algebra on Accelerated Multicore Hardware,” High Performance Scientific Computing: Algorithms and Applications, London, UK, Springer-Verlag, 00-2012.
Pjesivac–Grbovic, J., G. Bosilca, G. Fagg, T. Angskun, and J. Dongarra, Decision Trees and MPI Collective Algorithm Selection Problem,” Euro-Par 2007, Rennes, France, Springer, pp. 105–115, August 2007.  (552.94 KB)
Dongarra, J., R. Graybill, W. Harrod, R. Lucas, E. Lusk, P. Luszczek, J. McMahon, A. Snavely, J. Vetter, K. Yelick, et al., DARPA's HPCS Program: History, Models, Tools, Languages,” in Advances in Computers, vol. 72: Elsevier, January 2008.  (3.61 MB)
Bosilca, G., A. Bouteiller, A. Danalis, T. Herault, P. Lemariner, and J. Dongarra, DAGuE: A generic distributed DAG Engine for High Performance Computing.,” Parallel Computing, vol. 38, no. 1-2: Elsevier, pp. 27-51, 00-2012.  (830.85 KB)
Gruetzmacher, T., T. Cojean, G. Flegar, F. Göbel, and H. Anzt, A Customized Precision Format Based on Mantissa Segmentation for Accelerating Sparse Linear Algebra,” Concurrency and Computation: Practice and Experience, vol. 40319, issue 262, January 2019.
Mellor-Crummey, J., P. Beckman, J. Dongarra, B. Miller, and K. Yelick, Creating Software Technology to Harness the Power of Leadership-class Computing Systems,” DOE SciDAC Review (to appear), June 2007.  (617.02 KB)
Agarwal, P., R. A.. Alexander, E.. Apra, S. Balay, A. S. Bland, J. Colgan, E. D'Azevedo, J. Dongarra, T. Dunigan, M. Fahey, et al., Cray X1 Evaluation Status Report,” Oak Ridge National Laboratory Report, vol. /-2004/13, January 2004.  (817.33 KB)
Aupy, G., A. Benoit, B. Goglin, L. Pottier, and Y. Robert, Co-scheduling HPC workloads on cache-partitioned CMP platforms,” Int. Journal of High Performance Computing Applications, To appear, 2019.  (930.28 KB)
Aupy, G., A. Benoit, S. Dai, L. Pottier, P. Raghavan, Y. Robert, and M. Shantharam, Co-Scheduling Amdhal Applications on Cache-Partitioned Systems,” International Journal of High Performance Computing Applications, vol. 32, issue 1, pp. 123–138, January 2018.  (672.52 KB)
Bouteiller, A., T. Herault, G. Bosilca, and J. Dongarra, Correlated Set Coordination in Fault Tolerant Message Logging Protocols,” Concurrency and Computation: Practice and Experience, vol. 25, issue 4, pp. 572-585, March 2013.  (636.68 KB)
Benoit, A., A. Cavelan, F. Cappello, P. Raghavan, Y. Robert, and H. Sun, Coping with Silent and Fail-Stop Errors at Scale by Combining Replication and Checkpointing,” Journal of Parallel and Distributed Computing, vol. 122, pp. 209–225, December 2018.  (837 KB)
Arnold, D., S. Vadhiyar, and J. Dongarra, On the Convergence of Computational and Data Grids,” Parallel Processing Letters, vol. 11, no. 2-3, pp. 187-202, January 2001.  (213.35 KB)
Lemariner, P., G. Bosilca, C. Coti, T. Herault, and J. Dongarra, Constructing Resilient Communication Infrastructure for Runtime Environments,” ParCo 2009, Lyon France, September 2009.
Bosilca, G., C. Coti, T. Herault, P. Lemariner, and J. Dongarra, Constructing Resiliant Communication Infrastructure for Runtime Environments in Advances in Parallel Computing,” Advances in Parallel Computing - Parallel Computing: From Multicores and GPU's to Petascale, vol. 19, pp. 441-451, 2010.
Tomov, S., J. Langou, A. Canning, L-W. Wang, and J. Dongarra, Conjugate-Gradient Eigenvalue Solvers in Computing Electronic Properties of Nanostructure Architectures,” International Journal of Computational Science and Engineering (to appear), January 2005.  (428.21 KB)
Tomov, S., J. Langou, J. Dongarra, A. Canning, and L-W. Wang, Conjugate-Gradient Eigenvalue Solvers in Computing Electronic Properties of Nanostructure Architectures,” International Journal of Computational Science and Engineering, vol. 2, no. 3/4, pp. 205-212, 00-2006.  (428.21 KB)

Pages