Publications

Export 946 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
D
Dongarra, J., and P. Raghavan, A New Recursive Implementation of Sparse Cholesky Factorization,” Proceedings of 16th IMACS World Congress 2000 on Scientific Computing, Applications Mathematics and Simulation, Lausanne, Switzerland, August 2000.
Dongarra, J., G. H. Golub, E. Grosse, C. Moler, and K. Moore, Netlib and NA-Net: Building a Scientific Computing Community,” IEEE Annals of the History of Computing, vol. 30, no. 2, pp. 30-41, January 2008.  (352.71 KB)
Dongarra, J., J-F. Pineau, Y. Robert, Z. Shi, and F. Vivien, Revisiting Matrix Product on Master-Worker Platforms,” International Journal of Foundations of Computer Science (IJFCS) (accepted), 00-2007.  (248.66 KB)
Dongarra, J., Performance of Various Computers Using Standard Linear Equations Software (Linpack Benchmark Report),” University of Tennessee Computer Science Technical Report, no. CS-89-85, January 2001.  (6.42 MB)
Dongarra, J., S. Moore, P. Mucci, K. Seymour, and H. You, Accurate Cache and TLB Characterization Using hardware Counters,” Proceedings of ICCS 2004 (to appear), Krakow Poland, January 2004.  (167.1 KB)
Dongarra, J., M. Gates, J. Kurzak, P. Luszczek, and Y. Tsai, Autotuning Numerical Dense Linear Algebra for Batched Computation With GPU Hardware Accelerators,” Proceedings of the IEEE, vol. 106, issue 11, pp. 2040–2055, November 2018.
Dongarra, J., G. Fagg, R. Hempel, and D. W. Walker, Message Passing Software Systems,” Encyclopedia of Electrical and Engineering, Supplement 1: John Wiley & Sons, Inc., 00-2000.  (289.38 KB)
Dongarra, J., Performance of Various Computers Using Standard Linear Equations Software (Linpack Benchmark Report),” University of Tennessee Computer Science Technical Report, CS-89-85, January 2008.  (6.42 MB)
Doolin, D., J. Dongarra, and K. Seymour, JLAPACK - Compiling LAPACK Fortran to Java,” Scientific Programming, vol. 7, no. 2, pp. 111-138, October 2002.  (307.46 KB)
Du, P., R. Weber, P. Luszczek, S. Tomov, G. D. Peterson, and J. Dongarra, From CUDA to OpenCL: Towards a Performance-portable Solution for Multiplatform GPU Programming,” Parallel Computing (submitted), August 2010.
Du, P., P. Luszczek, and J. Dongarra, OpenCL Evaluation for Numerical Linear Algebra Library Development,” Symposium on Application Accelerators in High-Performance Computing (SAAHPC '10), Knoxville, TN, July 2010.  (2.69 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System with GPGPU,” Journal of Computational Science, vol. 4, issue 6, pp. 457–464, November 2013.  (995.45 KB)
Du, P., R. Weber, P. Luszczek, S. Tomov, G. D. Peterson, and J. Dongarra, From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming,” Parallel Computing (submitted), 00-2010.
Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, Algorithm-Based Fault Tolerance for Dense Matrix Factorization,” Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, New Orleans, LA, USA, ACM, pp. 225-234, February 2012.  (865.79 KB)
Du, P., P. Luszczek, and J. Dongarra, High Performance Dense Linear System Solver with Soft Error Resilience,” IEEE Cluster 2011, Austin, TX, September 2011.  (1.27 MB)
Du, P., P. Luszczek, and J. Dongarra, High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors,” ICCS 2012, Omaha, NE, June 2012.  (1.27 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System with GPGPU,” Journal of Computational Science, Seattle, WA, Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems at SC11, November 2011.  (965.88 KB)
Du, P., A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra, Algorithm-based Fault Tolerance for Dense Matrix Factorizations,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-676, Knoxville, TN, August 2011.  (865.79 KB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Mixed-Tool Performance Analysis on Hybrid Multicore Architectures,” First International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2010), San Diego, CA, September 2010.  (1.24 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System,” University of Tennessee Computer Science Technical Report, no. UT-CS-11-675, Knoxville, TN, July 2011.  (1.39 MB)
Du, P., P. Luszczek, S. Tomov, and J. Dongarra, Soft Error Resilient QR Factorization for Hybrid System,” UT-CS-11-675 (also LAPACK Working Note #252), no. ICL-CS-11-675, July 2011.  (1.39 MB)
Du, P., M. Parsons, E. Fuentes, S-L. Shaw, and J. Dongarra, Tuning Principal Component Analysis for GRASS GIS on Multi-core and GPU Architectures,” FOSS4G 2010, Barcelona, Spain, September 2010.  (1.57 MB)
Du, P., S. Tomov, and J. Dongarra, Providing GPU Capability to LU and QR within the ScaLAPACK Framework,” University of Tennessee Computer Science Technical Report (also LAWN 272), no. UT-CS-12-699, September 2012.  (7.48 MB)
Du, P., R. Weber, P. Luszczek, S. Tomov, G. D. Peterson, and J. Dongarra, From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming,” Parallel Computing, vol. 38, no. 8, pp. 391-407, August 2012.  (1.64 MB)
E
Eberius, D., T. Patinyasakdikul, and G. Bosilca, Using Software-Based Performance Counters to Expose Low-Level Open MPI Performance Information,” EuroMPI, Chicago, IL, ACM, September 2017.  (745.58 KB)
Eidson, T., V. Eijkhout, and J. Dongarra, Improvements in the Efficient Composition of Applications,” IPDPS 2004, NGS Workshop (to appear), Sante Fe, 00-2004.  (42.85 KB)
Eidson, T., J. Dongarra, and V. Eijkhout, Applying Aspect-Oriented Programming Concepts to a Component-based Programming Model,” IPDPS 2003, Workshop on NSF-Next Generation Software, Nice, France, March 2003.  (66.99 KB)
Eijkhout, V., Numerical Metadata API Reference,” Innovative Computing Laboratory Technical Report, February 2007.  (454.79 KB)
Eijkhout, V., and E. Fuentes, A Proposed Standard for Matrix Metadata,” Innovative Computing Laboratory Technical Report, no. ICL-UT-03-02, Submitted to ACM TOMS, November 2003.  (13.39 KB)
Eijkhout, V., Automatic Determination of Matrix-Blocks,” Lapack Working Note 151, University of Tennessee Computer Science Technical Report, no. UT-CS-01-458, January 2001.  (1.15 MB)
Eijkhout, V., The 'Weighted Modification' Incomplete Factorisation Method,” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-436, December 1999.  (198.71 KB)
Eijkhout, V., E. Fuentes, T. Eidson, and J. Dongarra, The Component Structure of a Self-Adapting Numerical Software System,” International Journal of Parallel Programming, vol. 33, no. 2, June 2005.  (64.88 KB)
Eijkhout, V., Polynomial Acceleration of Optimised Multi-grid Smoothers; Basic Theory,” ICL Technical Report, vol. 156, no. ICL-UT-02-03, January 2002.  (100.66 KB)
Eijkhout, V., On the Existence Problem of Incomplete Factorisation Methods,” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-435, December 1999.  (222.2 KB)
Elwasif, W., M. Beck, and J. Plank, IBP - Internet Backplane Protocol: Infrastructure for Distributed Storage (V O.2),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-430, February 1999.  (37.72 KB)
Emad, N., S. A. S. Fazeli, and J. Dongarra, An Asynchronous Algorithm on NetSolve Global Computing System,” PRiSM - Laboratoire de recherche en informatique, Université de Versailles St-Quentin Technical Report, March 2004.  (377.33 KB)
F
Fagg, G., E. Gabriel, Z. Chen, T. Angskun, G. Bosilca, J. Pjesivac–Grbovic, and J. Dongarra, Process Fault-Tolerance: Semantics, Design and Applications for High Performance Computing,” International Journal for High Performance Applications and Supercomputing (to appear), April 2004.  (186.9 KB)
Fagg, G., A. Bukovsky, and J. Dongarra, HARNESS and Fault Tolerant MPI,” Parallel Computing, vol. 27, no. 11, pp. 1479-1496, January 2001.  (164.2 KB)
Fagg, G., and J. Dongarra, Building and using a Fault Tolerant MPI implementation,” International Journal of High Performance Applications and Supercomputing (to appear), 00-2004.
Fagg, G., A. Bukovsky, and J. Dongarra, Fault Tolerant MPI for the HARNESS Meta-Computing System,” Proceedings of International Conference of Computational Science - ICCS 2001, Lecture Notes in Computer Science, vol. 2073, Berlin, Springer Verlag, pp. 355-366, 00-2001.
Fagg, G., and J. Dongarra, HARNESS Fault Tolerant MPI Design, Usage and Performance Issues,” Future Generation Computer Systems, vol. 18, no. 8, pp. 1127-1142, January 2002.  (403.41 KB)
Fagg, G., E. Gabriel, G. Bosilca, T. Angskun, Z. Chen, J. Pjesivac–Grbovic, K. London, and J. Dongarra, Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems,” Proceedings of ISC2004 (to appear), Heidelberg, Germany, June 2004.  (548.38 KB)
Fagg, G., and J. Dongarra, FT-MPI: Fault Tolerant MPI, Supporting Dynamic Applications in a Dynamic World,” Lecture Notes in Computer Science: Proceedings of EuroPVM-MPI 2000, (Hungary: Springer Verlag, 2000), pp. V1908,346-353, January 2000.  (51.95 KB)
Fagg, G., T. Angskun, G. Bosilca, J. Pjesivac–Grbovic, and J. Dongarra, Scalable Fault Tolerant MPI: Extending the Recovery Algorithm,” Proceedings of 12th European Parallel Virtual Machine and Message Passing Interface Conference - Euro PVM/MPI, vol. 3666, Sorrento (Naples) , Italy, Springer-Verlag Berlin, pp. 67, September 2005.  (144.86 KB)
Fagg, G., E. Gabriel, Z. Chen, T. Angskun, G. Bosilca, A. Bukovsky, and J. Dongarra, Fault Tolerant Communication Library and Applications for High Performance Computing,” Los Alamos Computer Science Institute (LACSI) Symposium 2003 (presented), Santa Fe, NM, October 2003.  (146.05 KB)
Fagg, G., J. Pjesivac–Grbovic, G. Bosilca, T. Angskun, and J. Dongarra, Flexible collective communication tuning architecture applied to Open MPI,” 2006 Euro PVM/MPI (submitted), Bonn, Germany, January 2006.  (206.58 KB)
Fagg, G., E. Gabriel, and M. Resch, Parallel IO Support for Meta-Computing Applications: MPI_Connect IO Applied to PACX-MPI,” 8th European PVM/MPI User's Group Meeting, Lecture Notes in Computer Science, vol. 2131, Greece, Springer Verlag, Berlin, September 2001.  (129.3 KB)
Fagg, G., K. Moore, and J. Dongarra, Scalable Networked Information Processing Environment (SNIPE),” Journal on Future Generation Computer Systems, vol. 15, no. 5/6, pp. 595-605, January 1999.  (189.21 KB)
Fang, A., A. Cavelan, Y. Robert, and A. Chien, Resilience for Stencil Computations with Latent Errors,” International Conference on Parallel Processing (ICPP), Bristol, UK, IEEE Computer Society Press, August 2017.  (1.19 MB)
Faverge, M., J. Herrmann, J. Langou, B. Lowery, Y. Robert, and J. Dongarra, Mixing LU-QR Factorization Algorithms to Design High-Performance Dense Linear Algebra Solvers,” Journal of Parallel and Distributed Computing, vol. 85, pp. 32-46, November 2015.  (5.06 MB)

Pages