Publications

Export 995 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
D
Demmel, J., and J. Dongarra, LAPACK 2005 Prospectus: Reliable and Scalable Software for Linear Algebra Computations on High End Computers : LAPACK Working Note 164, January 2005.  (172.59 KB)
Dempsey, B., and D. Weiss, Towards An Efficient, Scalable Replication Mechanism for the I2-DSI Project,” University of North Carolina School of Library and Information Science Technical Report, no. TR-1999-01, January 1999.
Dewolfs, D., J. Broeckhove, V. Sunderam, and G. Fagg, FT-MPI, Fault-Tolerant Metacomputing and Generic Name Services: A Case Study,” Lecture Notes in Computer Science, vol. 4192, no. ICL-UT-06-14: Springer Berlin / Heidelberg, pp. 133-140, 00-2006.  (362.44 KB)
Donfack, S., J. Dongarra, M. Faverge, M. Gates, J. Kurzak, P. Luszczek, and I. Yamazaki, A Survey of Recent Developments in Parallel Implementations of Gaussian Elimination,” Concurrency and Computation: Practice and Experience, vol. 27, issue 5, pp. 1292-1309, April 2015.  (783.45 KB)
Donfack, S., S. Tomov, and J. Dongarra, Dynamically balanced synchronization-avoiding LU factorization with multicore and GPUs,” University of Tennessee Computer Science Technical Report, no. ut-cs-13-713, July 2013.  (659.77 KB)
Donfack, S., S. Tomov, and J. Dongarra, Dynamically balanced synchronization-avoiding LU factorization with multicore and GPUs,” Fourth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2014, May 2014.  (490.08 KB)
Donfack, S., J. Dongarra, M. Faverge, M. Gates, J. Kurzak, P. Luszczek, and I. Yamazaki, On Algorithmic Variants of Parallel Gaussian Elimination: Comparison of Implementations in Terms of Performance and Numerical Properties,” University of Tennessee Computer Science Technical Report, no. UT-CS-13-715, July 2013, 2012.  (358.98 KB)
Donfack, S., S. Tomov, and J. Dongarra, Performance evaluation of LU factorization through hardware counter measurements,” University of Tennessee Computer Science Technical Report, no. ut-cs-12-700, October 2012.  (794.82 KB)
Dong, T., V. Dobrev, T. Kolev, R. Rieben, S. Tomov, and J. Dongarra, A Step towards Energy Efficient Computing: Redesigning A Hydrodynamic Application on CPU-GPU,” IPDPS 2014, Phoenix, AZ, IEEE, May 2014.  (1.01 MB)
Dong, T., A. Haidar, S. Tomov, and J. Dongarra, A Fast Batched Cholesky Factorization on a GPU,” International Conference on Parallel Processing (ICPP-2014), Minneapolis, MN, September 2014.  (1.37 MB)
Dong, T., V. Dobrev, T. Kolev, R. Rieben, S. Tomov, and J. Dongarra, Hydrodynamic Computation with Hybrid Programming on CPU-GPU Clusters,” University of Tennessee Computer Science Technical Report, no. ut-cs-13-714, July 2013.  (866.68 KB)
Dong, T., A. Haidar, S. Tomov, and J. Dongarra, Accelerating the SVD Bi-Diagonalization of a Batch of Small Matrices using GPUs,” Journal of Computational Science, vol. 26, pp. 237–245, May 2018.
Dong, T., T. Kolev, R. Rieben, V. Dobrev, S. Tomov, and J. Dongarra, Acceleration of the BLAST Hydro Code on GPU,” Supercomputing '12 (poster), Salt Lake City, Utah, SC12, November 2012.
Dong, T., A. Haidar, P. Luszczek, J. Harris, S. Tomov, and J. Dongarra, LU Factorization of Small Matrices: Accelerating Batched DGETRF on the GPU,” 16th IEEE International Conference on High Performance Computing and Communications (HPCC), Paris, France, IEEE, August 2014.  (684.73 KB)
Dong, T., A. Haidar, S. Tomov, and J. Dongarra, Optimizing the SVD Bidiagonalization Process for a Batch of Small Matrices,” International Conference on Computational Science (ICCS 2017), Zurich, Switzerland, Procedia Computer Science, June 2017.  (364.95 KB)
Dongarra, J., M. Faverge, Y. Ishikawa, R. Namyst, F. Rue, and F. Trahay, EZTrace: a generic framework for performance analysis,” ICL Technical Report, no. ICL-UT-11-01, December 2010.
Dongarra, J., and P. Luszczek, Introduction to the HPCChallenge Benchmark Suite,” ICL Technical Report, no. ICL-UT-05-01, (Also appears as CS Dept. Tech Report UT-CS-05-544), January 2005.  (124.86 KB)
Dongarra, J., M. Faverge, H. Ltaeif, and P. Luszczek, High Performance Matrix Inversion Based on LU Factorization for Multicore Architectures,” Proceedings of MTAGS11, Seattle, WA, November 2011.  (879.49 KB)
Dongarra, J., G. Bosilca, R. Delmas, and J. Langou, Algorithmic Based Fault Tolerance Applied to High Performance Computing,” Journal of Parallel and Distributed Computing, vol. 69, pp. 410-416, 00-2009.  (313.55 KB)
Dongarra, J., M. Faverge, H. Ltaeif, and P. Luszczek, Achieving Numerical Accuracy and High Performance using Recursive Tile LU Factorization,” University of Tennessee Computer Science Technical Report (also as a LAWN), no. ICL-UT-11-08, September 2011.  (618.53 KB)
Dongarra, J., J-F. Pineau, Y. Robert, Z. Shi, and F. Vivien, Revisiting Matrix Product on Master-Worker Platforms,” International Journal of Foundations of Computer Science (IJFCS), vol. 19, no. 6, pp. 1317-1336, December 2008.  (248.66 KB)
Dongarra, J., V. Eijkhout, and P. Luszczek, Recursive approach in sparse matrix LU factorization,” Proceedings of 1st SGI Users Conference, Cracow, Poland (ACC Cyfronet UMM, 2000), pp. 409-418, January 2000.  (176.14 KB)
Dongarra, J., H. Meuer, H. D. Simon, and E. Strohmaier, Recent Trends in High Performance Computing,” in Birth of Numerical Analysis (to appear), 00-2009.
Dongarra, J., A. Maloney, S. Moore, P. Mucci, and S. Shende, Performance Instrumentation and Measurement for Terascale Systems,” ICCS 2003 Terascale Workshop, Melbourne, Australia, June 2003.  (5.36 MB)
Dongarra, J., V. Getov, and K. Walsh, The 30th Anniversary of the Supercomputing Conference: Bringing the Future Closer—Supercomputing History and the Immortality of Now,” Computer, vol. 51, issue 10, pp. 74–85, November 2018.
Dongarra, J., J-F. Pineau, Y. Robert, and F. Vivien, Matrix Product on Heterogeneous Master Worker Platforms,” 2008 PPoPP Conference, Salt Lake City, Utah, January 2008.
Dongarra, J., J. Demmel, J. Langou, and J. Langou, 2016 Dense Linear Algebra Software Packages Survey,” University of Tennessee Computer Science Technical Report, no. UT-EECS-16-744 / LAWN 290: University of Tennessee, September 2016.  (366.43 KB)
Dongarra, J., Performance of Various Computers Using Standard Linear Equations Software (Linpack Benchmark Report),” University of Tennessee Computer Science Dept. Technical Report CS-89-85, 00-2007.  (6.42 MB)
Dongarra, J., LINPACK on Future Manycore and GPu Based Systems,” PARA 2010, Reykjavik, Iceland, June 2010.
Dongarra, J., G. H. Golub, E. Grosse, C. Moler, and K. Moore, Netlib and NA-Net: Building a Scientific Computing Community,” IEEE Annals of the History of Computing, vol. 30, no. 2, pp. 30-41, January 2008.  (352.71 KB)
Dongarra, J., J-F. Pineau, Y. Robert, Z. Shi, and F. Vivien, Revisiting Matrix Product on Master-Worker Platforms,” International Journal of Foundations of Computer Science (IJFCS) (accepted), 00-2007.  (248.66 KB)
Dongarra, J., I. Duff, M. Gates, A. Haidar, S. Hammarling, N. J. Higham, J. Hogg, P. Valero Lara, P. Luszczek, M. Zounon, et al., Batched BLAS (Basic Linear Algebra Subprograms) 2018 Specification , July 2018.  (483.05 KB)
Dongarra, J., P. Beckman, and et al., The International Exascale Software Project Roadmap,” International Journal of High Performance Computing, vol. 25, no. 1, pp. 3-60, 00-2011.  (719.74 KB)
Dongarra, J., and P. Luszczek, Reducing the time to tune parallel dense linear algebra routines with partial execution and performance modelling,” University of Tennessee Computer Science Technical Report, no. UT-CS-10-661, October 2010.  (287.87 KB)
Dongarra, J., M. Gates, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, and I. Yamazaki, The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme Scale,” SIAM Review, vol. 60, issue 4, pp. 808–865, November 2018.
Dongarra, J., Performance of Various Computers Using Standard Linear Equations Software (Linpack Benchmark Report),” University of Tennessee Computer Science Technical Report, CS-89-85, January 2008.  (6.42 MB)
Dongarra, J., Report on the Sunway TaihuLight System,” University of Tennessee Computer Science Technical Report, no. UT-EECS-16-742: University of Tennessee, June 2016.
Dongarra, J., V. Eijkhout, and H. van der Vorst, Iterative Solver Benchmark (LAPACK Working Note 152),” Scientific Programming, vol. 9, no. 4, pp. 223-231, 00-2001.  (168.05 KB)
Dongarra, J., Performance of Various Computers Using Standard Linear Equations Software (Linpack Benchmark Report),” University of Tennessee Computer Science Technical Report, no. CS-89-85, January 2001.  (6.42 MB)
Dongarra, J., and P. Luszczek, How Elegant Code Evolves With Hardware: The Case Of Gaussian Elimination,” in Beautiful Code Leading Programmers Explain How They Think (Chapter 14), pp. 243-282, January 2008.  (257 KB)
Dongarra, J., and P. Luszczek, HPC Challenge: Design, History, and Implementation Highlights,” Contemporary High Performance Computing: From Petascale Toward Exascale, Boca Raton, FL, Taylor and Francis, 2013.  (790.01 KB)
Dongarra, J., H. Meuer, H. D. Simon, and E. Strohmaier, Biannual Top-500 Computer Lists Track Changing Environments for Scientific Computing,” SIAM News, vol. 34, no. 9, October 2002.  (2.62 MB)
Dongarra, J., and P. Raghavan, A New Recursive Implementation of Sparse Cholesky Factorization,” Proceedings of 16th IMACS World Congress 2000 on Scientific Computing, Applications Mathematics and Simulation, Lausanne, Switzerland, August 2000.
Dongarra, J., G. Fagg, R. Hempel, and D. W. Walker, Message Passing Software Systems,” Encyclopedia of Electrical and Engineering, Supplement 1: John Wiley & Sons, Inc., 00-2000.  (289.38 KB)
Dongarra, J., S. Moore, P. Mucci, K. Seymour, and H. You, Accurate Cache and TLB Characterization Using hardware Counters,” Proceedings of ICCS 2004 (to appear), Krakow Poland, January 2004.  (167.1 KB)
Dongarra, J., T. Herault, and Y. Robert, Revisiting the Double Checkpointing Algorithm,” University of Tennessee Computer Science Technical Report (LAWN 274), no. ut-cs-13-705, January 2013.  (682.22 KB)
Dongarra, J., J. Kurzak, P. Luszczek, and I. Yamazaki, PULSAR Users’ Guide, Parallel Ultra-Light Systolic Array Runtime,” University of Tennessee EECS Technical Report, no. UT-EECS-14-733: University of Tennessee, November 2014.  (561.56 KB)
Dongarra, J., M. Faverge, H. Ltaeif, and P. Luszczek, Exploiting Fine-Grain Parallelism in Recursive LU Factorization,” Proceedings of PARCO'11, no. ICL-UT-11-04, Gent, Belgium, April 2011.
Dongarra, J., and S. Tomov, An Introduction to the MAGMA project - Acceleration of Dense Linear Algebra : NVIDIA Webinar, June 2010.
Dongarra, J., H. Meuer, and E. Strohmaier, Top500 Supercomputer Sites (14th edition),” University of Tennessee Computer Science Department Technical Report, no. UT-CS-99-434, November 1999.  (281.81 KB)

Pages