Publications

Export 789 results:
Filters: Author is Jack Dongarra  [Clear All Filters]
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
H
Wu, W., A. Bouteiller, G. Bosilca, M. Faverge, and J. Dongarra, Hierarchical DAG scheduling for Hybrid Distributed Systems,” 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS), Hyderabad, India, IEEE, May 2015.  (1.11 MB)
Dongarra, J., M. Faverge, T. Herault, J. Langou, and Y. Robert, Hierarchical QR factorization algorithms for multi-core cluster systems,” IPDPS 2012, the 26th IEEE International Parallel and Distributed Processing Symposium, Shanghai, China, IEEE Computer Society Press, May 2012.  (405.71 KB)
Dongarra, J., M. Faverge, T. Herault, J. Langou, and Y. Robert, Hierarchical QR factorization algorithms for multi-core cluster systems,” University of Tennessee Computer Science Technical Report (also Lawn 257), no. UT-CS-11-684, October 2011.  (405.71 KB)
Dongarra, J., M. Faverge, T. Herault, M. Jacquelin, J. Langou, and Y. Robert, Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems,” Parallel Computing, vol. 39, issue 4-5, pp. 212-232, May 2013.  (1.43 MB)
Ma, T., G. Bosilca, A. Bouteiller, and J. Dongarra, HierKNEM: An Adaptive Framework for Kernel-Assisted and Topology-Aware Collective Communications on Many-core Clusters,” IPDPS 2012 (Best Paper), Shanghai, China, May 2012.  (165.9 KB)
Ltaeif, H., P. Luszczek, and J. Dongarra, High Performance Bidiagonal Reduction using Tile Algorithms on Homogeneous Multicore Architectures,” University of Tennessee Computer Science Technical Report, UT-CS-11-673, (also Lawn 247), May 2011.  (424.93 KB)
Ltaeif, H., P. Luszczek, and J. Dongarra, High Performance Bidiagonal Reduction using Tile Algorithms on Homogeneous Multicore Architectures,” ACM Transactions on Mathematical Software (TOMS), vol. 39, issue 3, no. 16, 2013.  (665.7 KB)
Palma, J., J. Dongarra, and V. Hernández, High Performance Computing for Computational Science,” Lecture Notes in Computer Science, vol. 2565, VECPAR 2002, 5th International Conference June 26-28, 2002, Springer-Verlag, Berlin, January 2003.
Dongarra, J., and A. J. van der Steen, High Performance Computing Systems: Status and Outlook,” Acta Numerica, vol. 21, Cambridge, UK, Cambridge University Press, pp. 379-474, May 2012.  (1.48 MB)
Dongarra, J., H. Meuer, H. D. Simon, and E. Strohmaier, High Performance Computing Today,” FOMMS 2000: Foundations of Molecular Modeling and Simulation Conference (to appear), January 2000.  (66 KB)
Dongarra, J., H. Meuer, H. D. Simon, and E. Strohmaier, High Performance Computing Trends,” HERMIS, vol. 2, pp. 155-163, November 2001.
Dongarra, J., High Performance Computing Trends and Self Adapting Numerial Software,” Lecture Notes in Computer Science, High Performance Computing, 5th International Symposium ISHPC, vol. 2858, Tokyo-Odaiba, Japan, Springer-Verlag, Heidelberg, pp. 1-9, January 2003.
Dongarra, J., High Performance Computing Trends, Supercomputers, Clusters, and Grids,” Information Processing Society of Japan Symposium Series, vol. 2003, no. 14, pp. 55-58, January 2003.
Dongarra, J., M. Heroux, and P. Luszczek, High Performance Conjugate Gradient Benchmark: A new Metric for Ranking High Performance Computing Systems,” International Journal of High Performance Computing Applications, vol. 30, issue 1, pp. 3 - 10, February 2016.  (277.51 KB)
Du, P., P. Luszczek, and J. Dongarra, High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors,” ICCS 2012, Omaha, NE, June 2012.  (1.27 MB)
Du, P., P. Luszczek, and J. Dongarra, High Performance Dense Linear System Solver with Soft Error Resilience,” IEEE Cluster 2011, Austin, TX, September 2011.  (1.27 MB)
Dongarra, J., and P. Luszczek, High Performance Development for High End Computing with Python Language Wrapper (PLW),” International Journal for High Performance Computer Applications, vol. 21, no. 3, pp. 360-369, 00-2007.  (179.32 KB)
Dongarra, J., M. Faverge, H. Ltaeif, and P. Luszczek, High Performance Matrix Inversion Based on LU Factorization for Multicore Architectures,” Proceedings of MTAGS11, Seattle, WA, November 2011.  (879.49 KB)
Yamazaki, I., S. Nooshabadi, S. Tomov, and J. Dongarra, High Performance Realtime Convex Solver for Embedded Systems,” University of Tennessee Computer Science Technical Report, no. UT-EECS-16-745, October 2016.  (225.43 KB)
Haidar, A., A. Abdelfattah, S. Tomov, and J. Dongarra, High-performance Cholesky Factorization for GPU-only Execution,” Proceedings of the General Purpose GPUs (GPGPU-10), Austin, TX, ACM, February 2017.  (872.18 KB)
Dongarra, J., N. J. Higham, M. R. Dennis, P. Glendinning, P. A. Martin, F. Santosa, and J. Tanner, High-Performance Computing,” The Princeton Companion to Applied Mathematics, Princeton, New Jersey, Princeton University Press, pp. 839-842, 2015.
Dongarra, J., M. Heroux, and P. Luszczek, High-Performance Conjugate-Gradient Benchmark: A New Metric for Ranking High-Performance Computing Systems,” The International Journal of High Performance Computing Applications, 2015.  (336.19 KB)
Masliah, I., A. Abdelfattah, A. Haidar, S. Tomov, J. Falcou, and J. Dongarra, High-performance Matrix-matrix Multiplications of Very Small Matrices,” 22nd International European Conference on Parallel and Distributed Computing (Euro-Par'16), Grenoble, France, Springer International Publishing, August 2016.
Abdelfattah, A., M. Baboulin, V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, et al., High-Performance Tensor Contractions for GPUs,” University of Tennessee Computer Science Technical Report, no. UT-EECS-16-738: University of Tennessee, January 2016.  (2.36 MB)
Abdelfattah, A., M. Baboulin, V. Dobrev, J. Dongarra, C. Earl, J. Falcou, A. Haidar, I. Karlin, T. Kolev, I. Masliah, et al., High-Performance Tensor Contractions for GPUs,” International Conference on Computational Science (ICCS'16), San Diego, CA, June 2016.  (2.36 MB)
Jagode, H., J. Dongarra, S. Alam, J. Vetter, W.. Spear, and A. Maloney, A Holistic Approach for Performance Measurement and Analysis for Petascale Applications,” ICCS 2009 Joint Workshop: Tools for Program Development and Analysis in Computational Science and Software Engineering for Large-Scale Computing, vol. 2009, Baton Rouge, Louisiana, Springer-Verlag Berlin Heidelberg 2009, pp. 686-695, May 2009.  (3.96 MB)
Dongarra, J., and P. Luszczek, How Elegant Code Evolves With Hardware: The Case Of Gaussian Elimination,” in Beautiful Code Leading Programmers Explain How They Think: O'Reilly Media, Inc., June 2007.  (257 KB)
Dongarra, J., and P. Luszczek, How Elegant Code Evolves With Hardware: The Case Of Gaussian Elimination,” in Beautiful Code Leading Programmers Explain How They Think (Chapter 14), pp. 243-282, January 2008.  (257 KB)
Dongarra, J., and P. Luszczek, HPC Challenge: Design, History, and Implementation Highlights,” On the Road to Exascale Computing: Contemporary Architectures in High Performance Computing (to appear): Chapman & Hall/CRC Press, 00-2012.  (469.92 KB)
Dongarra, J., and P. Luszczek, HPC Challenge: Design, History, and Implementation Highlights,” Contemporary High Performance Computing: From Petascale Toward Exascale, Boca Raton, FL, Taylor and Francis, 2013.  (790.01 KB)
Luszczek, P., D. Bailey, J. Dongarra, J. Kepner, R. Lucas, R. Rabenseifner, and D. Takahashi, The HPC Challenge (HPCC) Benchmark Suite,” SC06 Conference Tutorial, Tampa, Florida, IEEE, November 2006.  (1.08 MB)
Haidar, A., J. Dongarra, K. Kabir, M. Gates, P. Luszczek, S. Tomov, and Y. Jia, HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi,” Scientific Programming, vol. 23, issue 1, January 2015.  (553.94 KB)
Dongarra, J., M. Heroux, and P. Luszczek, HPCG Benchmark: a New Metric for Ranking High Performance Computing Systems,” University of Tennessee Computer Science Technical Report , no. ut-eecs-15-736: University of Tennessee, January 2015.
Dongarra, J., J. Demmel, P. Husbands, and P. Luszczek, HPCS Library Study Effort,” University of Tennessee Computer Science Technical Report, UT-CS-08-617, January 2008.  (73.22 KB)
Dongarra, J., The HPL Benchmark: Past, Present & Future , ISC High Performance, Frankfurt, Germany, July 2016.  (3.41 MB)
Ltaeif, H., S. Tomov, R. Nath, and J. Dongarra, Hybrid Multicore Cholesky Factorization with Multiple GPU Accelerators,” IEEE Transaction on Parallel and Distributed Systems (submitted), March 2010.  (3.75 MB)
Lukarski, D., H. Anzt, S. Tomov, and J. Dongarra, Hybrid Multi-Elimination ILU Preconditioners on GPUs,” International Heterogeneity in Computing Workshop (HCW), IPDPS 2014, Phoenix, AZ, IEEE, May 2014.  (1.67 MB)
Agullo, E., C. Augonnet, J. Dongarra, H. Ltaeif, R. Namyst, S. Thibault, and S. Tomov, A Hybridization Methodology for High-Performance Linear Algebra Software for GPUs,” in GPU Computing Gems, Jade Edition, vol. 2: Elsevier, pp. 473-484, 00-2011.
Dong, T., V. Dobrev, T. Kolev, R. Rieben, S. Tomov, and J. Dongarra, Hydrodynamic Computation with Hybrid Programming on CPU-GPU Clusters,” University of Tennessee Computer Science Technical Report, no. ut-cs-13-714, July 2013.  (866.68 KB)
I
Ma, T., A. Bouteiller, G. Bosilca, and J. Dongarra, Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW,” 18th EuroMPI, Santorini, Greece, Springer, pp. 247-254, September 2011.
Ma, T., A. Bouteiller, G. Bosilca, and J. Dongarra, Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW,” 18th EuroMPI, Santorini, Greece, Springer, pp. 247-254, September 2011.
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov, The Impact of Multicore on Math Software,” PARA 2006, Umea, Sweden, June 2006.  (223.53 KB)
Youseff, L., K. Seymour, H. You, J. Dongarra, and R. Wolski, The Impact of Paravirtualized Memory Hierarchy on Linear Algebra Computational Kernels and Software,” ACM/IEEE International Symposium on High Performance Distributed Computing, Boston, MA., June 2008.  (403.89 KB)
Kurzak, J., H. Anzt, M. Gates, and J. Dongarra, Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs,” IEEE Transactions on Parallel and Distributed Systems, no. 1045-9219, November 2015.
Keller, R., G. Bosilca, G. Fagg, M. Resch, and J. Dongarra, Implementation and Usage of the PERUSE-Interface in Open MPI,” Euro PVM/MPI 2006, Bonn, Germany, September 2006.  (310.76 KB)
Kurzak, J., and J. Dongarra, Implementation of Mixed Precision in Solving Systems of Linear Equations on the Cell Processor,” Concurrency and Computation: Practice and Experience, vol. 19, no. 10, pp. 1371-1385, July 2007.  (453.78 KB)
Abdelfattah, A., M. Gates, J. Kurzak, P. Luszczek, and J. Dongarra, Implementation of the C++ API for Batch BLAS,” SLATE Working Notes, no. 7, ICL-UT-18-04: Innovative Computing Laboratory, University of Tennessee, June 2018.  (1.07 MB)
Kurzak, J., and J. Dongarra, Implementation of the Mixed-Precision High Performance LINPACK Benchmark on the CELL Processor,” University of Tennessee Computer Science Tech Report, no. UT-CS-06-580, LAPACK Working Note #177, September 2006.  (506.18 KB)
Kurzak, J., R. Nath, P. Du, and J. Dongarra, An Implementation of the Tile QR Factorization for a GPU and Multiple CPUs,” Applied Parallel and Scientific Computing, vol. 7133, pp. 248-257, 00-2012.  (623.5 KB)
Yamazaki, I., D. Becker, J. Dongarra, A. Druinsky, I.. Peled, S. Toledo, G. Ballard, J. Demmel, and O. Schwartz, Implementing a Blocked Aasen’s Algorithm with a Dynamic Scheduler on Multicore Architectures,” IPDPS 2013 (submitted), Boston, MA, 00-2013.  (1.22 MB)

Pages