Publications

Export 995 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
D
Dongarra, J., M. Gates, Y. Jia, K. Kabir, P. Luszczek, and S. Tomov, MAGMA MIC: Linear Algebra Library for Intel Xeon Phi Coprocessors , Salt Lake City, UT, The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC12), November 2012.  (6.4 MB)
Dongarra, J., M. Faverge, T. Herault, J. Langou, and Y. Robert, Hierarchical QR Factorization Algorithms for Multi-Core Cluster Systems,” IPDPS 2012, the 26th IEEE International Parallel and Distributed Processing Symposium, Shanghai, China, IEEE Computer Society Press, May 2012.  (405.71 KB)
Dongarra, J., S. Hammarling, N. Higham, S. Relton, P. Valero-Lara, and M. Zounon, The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems,” International Conference on Computational Science (ICCS 2017), Zürich, Switzerland, Elsevier, June 2017.
Dongarra, J., and A. Lastovetsky, An Overview of Heterogeneous High Performance and Grid Computing,” Engineering the Grid (to appear): Nova Science Publishers, Inc., 00-2004.  (199.93 KB)
Dongarra, J., M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, and I. Yamazaki, Accelerating Numerical Dense Linear Algebra Calculations with GPUs,” Numerical Computations with GPUs: Springer International Publishing, pp. 3-28, 2014.  (1.06 MB)
Dongarra, J., S. Moore, and A. Trefethen, Numerical Libraries and Tools for Scalable Parallel Cluster Computing,” International Journal of High Performance Applications and Supercomputing, vol. 15, no. 2, pp. 175-180, January 2001.  (37.38 KB)
Dongarra, J., A Not So Simple Matter of Software,” NCSA Access Online: NCSA, 00-2005.  (457.69 KB)
Dongarra, J., S. Moore, G. D. Peterson, S. Tomov, J. Allred, V. Natoli, and D. Richie, Exploring New Architectures in Accelerating CFD for Air Force Applications,” Proceedings of the DoD HPCMP User Group Conference, Seattle, Washington, January 2008.  (492.86 KB)
Dongarra, J., and V. Eijkhout, Numerical Linear Algebra Algorithms and Software,” Journal of Computational and Applied Mathematics, vol. 123, no. 1-2, pp. 489-514, October 1999.  (258.62 KB)
Dongarra, J., T. Herault, and Y. Robert, Fault Tolerance Techniques for High-performance Computing,” University of Tennessee Computer Science Technical Report (also LAWN 289), no. UT-EECS-15-734: University of Tennessee, May 2015.
Dongarra, J., H. Meuer, H. D. Simon, and E. Strohmaier, High Performance Computing Trends,” HERMIS, vol. 2, pp. 155-163, November 2001.
Dongarra, J., T. Herault, and Y. Robert, Revisiting the Double Checkpointing Algorithm,” 15th Workshop on Advances in Parallel and Distributed Computational Models, at the IEEE International Parallel & Distributed Processing Symposium, Boston, MA, May 2013.  (591.1 KB)
Dongarra, J., M. Heroux, and P. Luszczek, High Performance Conjugate Gradient Benchmark: A new Metric for Ranking High Performance Computing Systems,” International Journal of High Performance Computing Applications, vol. 30, issue 1, pp. 3 - 10, February 2016.  (277.51 KB)
Dongarra, J., High Performance Computing Trends, Supercomputers, Clusters, and Grids,” Information Processing Society of Japan Symposium Series, vol. 2003, no. 14, pp. 55-58, January 2003.
Dongarra, J., E. Jeannot, E. Saule, and Z. Shi, Bi-objective Scheduling Algorithms for Optimizing Makespan and Reliability on Heterogeneous Systems,” 19th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA) (submitted), San Diego, CA, June 2007.  (223.82 KB)
Dongarra, J., V. Eijkhout, and H. van der Vorst, An Iterative Solver Benchmark,” Scientific Programming (to appear), 00-2002.  (142.67 KB)
Dongarra, J., and P. Beckman, International Exascale Software Project Roadmap v1.0,” University of Tennessee Computer Science Technical Report, UT-CS-10-654, May 2010.  (719.74 KB)
Dongarra, J., T. Dong, M. Gates, A. Haidar, S. Tomov, and I. Yamazaki, MAGMA: A New Generation of Linear Algebra Library for GPU and Multicore Architectures , Salt Lake City, UT, The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC12), Presentation, November 2012.  (4.69 MB)
Dongarra, J., M. Gates, A. Haidar, J. Kurzak, P. Luszczek, P. Wu, I. Yamazaki, A. YarKhan, M. Abalenkovs, N. Bagherpour, et al., PLASMA: Parallel Linear Algebra Software for Multicore Using OpenMP,” ACM Transactions on Mathematical Software (to appear), 2019.  (7.5 MB)
Dongarra, J., P. Luszczek, and A. Petitet, The LINPACK Benchmark: Past, Present, and Future,” Concurrency: Practice and Experience, vol. 15, pp. 803-820, 00-2008.  (94.86 KB)
Dongarra, J., and V. Eijkhout, Finite-choice Algorithm Optimization in Conjugate Gradients (LAPACK Working Note 159),” University of Tennessee Computer Science Technical Report, UT-CS-03-502, January 2003.  (64.52 KB)
Dongarra, J., G. H. Golub, C. Moler, and K. Moore, Netlib and NA-Net: building a scientific computing community,” In IEEE Annals of the History of Computing (to appear), August 2007.  (352.71 KB)
Dongarra, J., V. Eijkhout, and P. Luszczek, Recursive approach in sparse matrix LU factorization,” Proceedings of 1st SGI Users Conference, Cracow, Poland (ACC Cyfronet UMM, 2000), pp. 409-418, January 2000.  (176.14 KB)
Dongarra, J., Report on the TianHe-2A System,” Innovative Computing Laboratory Technical Report, no. ICL-UT-17-04: University of Tennessee, September 2017.  (7.15 MB)
Dongarra, J., A. Maloney, S. Moore, P. Mucci, and S. Shende, Performance Instrumentation and Measurement for Terascale Systems,” ICCS 2003 Terascale Workshop, Melbourne, Australia, June 2003.  (5.36 MB)
Doolin, D., J. Dongarra, and K. Seymour, JLAPACK - Compiling LAPACK Fortran to Java,” Scientific Programming, vol. 7, no. 2, pp. 111-138, October 2002.  (307.46 KB)
Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, Algorithm-Based Fault Tolerance for Dense Matrix Factorization,” Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, New Orleans, LA, USA, ACM, pp. 225-234, February 2012.  (865.79 KB)
Du, P., P. Luszczek, and J. Dongarra, High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors,” ICCS 2012, Omaha, NE, June 2012.  (1.27 MB)
Du, P., P. Luszczek, and J. Dongarra, High Performance Dense Linear System Solver with Soft Error Resilience,” IEEE Cluster 2011, Austin, TX, September 2011.  (1.27 MB)
Du, P., S. Tomov, and J. Dongarra, Providing GPU Capability to LU and QR within the ScaLAPACK Framework,” University of Tennessee Computer Science Technical Report (also LAWN 272), no. UT-CS-12-699, September 2012.  (7.48 MB)
Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, Algorithm-based Fault Tolerance for Dense Matrix Factorizations,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-676, Knoxville, TN, August 2011.  (865.79 KB)
Du, P., R. Weber, P. Luszczek, S. Tomov, G. D. Peterson, and J. Dongarra, From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming,” Parallel Computing, vol. 38, no. 8, pp. 391-407, August 2012.  (1.64 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Mixed-Tool Performance Analysis on Hybrid Multicore Architectures,” First International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2010), San Diego, CA, September 2010.  (1.24 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System with GPGPU,” Journal of Computational Science, Seattle, WA, Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems at SC11, November 2011.  (965.88 KB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System with GPGPU,” Journal of Computational Science, vol. 4, issue 6, pp. 457–464, November 2013.  (995.45 KB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-675, Knoxville, TN, July 2011.  (1.39 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System,” UT-CS-11-675 (also LAPACK Working Note #252), no. ICL-CS-11-675, July 2011.  (1.39 MB)
Du, P., M. Parsons, E. Fuentes, S-L. Shaw, and J. Dongarra, Tuning Principal Component Analysis for GRASS GIS on Multi-core and GPU Architectures,” FOSS4G 2010, Barcelona, Spain, September 2010.  (1.57 MB)
Du, P., P. Luszczek, and J. Dongarra, OpenCL Evaluation for Numerical Linear Algebra Library Development,” Symposium on Application Accelerators in High-Performance Computing (SAAHPC '10), Knoxville, TN, July 2010.  (2.69 MB)
E
Eberius, D., T. Patinyasakdikul, and G. Bosilca, Using Software-Based Performance Counters to Expose Low-Level Open MPI Performance Information,” EuroMPI, Chicago, IL, ACM, September 2017.  (745.58 KB)
Eidson, T., J. Dongarra, and V. Eijkhout, Applying Aspect-Oriented Programming Concepts to a Component-based Programming Model,” IPDPS 2003, Workshop on NSF-Next Generation Software, Nice, France, March 2003.  (66.99 KB)
Eidson, T., V. Eijkhout, and J. Dongarra, Improvements in the Efficient Composition of Applications,” IPDPS 2004, NGS Workshop (to appear), Sante Fe, 00-2004.  (42.85 KB)
Eijkhout, V., E. Fuentes, T. Eidson, and J. Dongarra, The Component Structure of a Self-Adapting Numerical Software System,” International Journal of Parallel Programming, vol. 33, no. 2, June 2005.  (64.88 KB)
Eijkhout, V., and E. Fuentes, A Proposed Standard for Matrix Metadata,” Innovative Computing Laboratory Technical Report, no. ICL-UT-03-02, Submitted to ACM TOMS, November 2003.  (13.39 KB)
Eijkhout, V., Automatic Determination of Matrix-Blocks,” Lapack Working Note 151, University of Tennessee Computer Science Technical Report, no. UT-CS-01-458, January 2001.  (1.15 MB)
Eijkhout, V., The 'Weighted Modification' Incomplete Factorisation Method,” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-436, December 1999.  (198.71 KB)
Eijkhout, V., Polynomial Acceleration of Optimised Multi-grid Smoothers; Basic Theory,” ICL Technical Report, vol. 156, no. ICL-UT-02-03, January 2002.  (100.66 KB)
Eijkhout, V., On the Existence Problem of Incomplete Factorisation Methods,” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-435, December 1999.  (222.2 KB)
Eijkhout, V., Numerical Metadata API Reference,” Innovative Computing Laboratory Technical Report, February 2007.  (454.79 KB)
Elwasif, W., M. Beck, and J. Plank, IBP - Internet Backplane Protocol: Infrastructure for Distributed Storage (V O.2),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-430, February 1999.  (37.72 KB)

Pages