Publications

Export 1026 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
S
Song, F., S. Moore, and J. Dongarra, Analytical Modeling for Affinity-Based Thread Scheduling on Multicore Platforms,” University of Tennessee Computer Science Technical Report, UT-CS-08-626, January 2008.  (650.75 KB)
Song, F., H. Ltaeif, B. Hadri, and J. Dongarra, Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems,” University of Tennessee Computer Science Technical Report, vol. –10-653, April 2010.  (3.42 MB)
Solcà, R., A. Haidar, S. Tomov, J. Dongarra, and T. C. Schulthess, A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks,” Supercomputing '12 (poster), Salt Lake City, Utah, November 2012.
Solcà, R., A. Kozhevnikov, A. Haidar, S. Tomov, T. C. Schulthess, and J. Dongarra, Efficient Implementation Of Quantum Materials Simulations On Distributed CPU-GPU Systems,” The International Conference for High Performance Computing, Networking, Storage and Analysis (SC15), Austin, TX, ACM, November 2015.  (1.09 MB)
Proceedings of the International Conference on Computational Science,” ICCS 2010, Amsterdam, Elsevier, May 2010.
Sloot, P. M., D. Abramson, A. V. Bogdanov, J. Dongarra, A. Zomaya, and Y. Gorbachev, Computational Science — ICCS 2003,” Lecture Notes in Computer Science, vol. 2657-2660, ICCS 2003, International Conference. Melbourne, Australia, Springer-Verlag, Berlin, June 2003.
Shipman, G. M., G. Bosilca, and A. B. Maccabe, High Performance RDMA Protocols in HPC,” Euro PVM/MPI 2006, Bonn, Germany, September 2006.  (1.06 MB)
Shimosaka, H., T. Hiroyasu, M. Miki, and J. Dongarra, Optimization Problem Solving System Using GridRPC,” IEEE Transactions on Parallel and Distributed Systems (submitted), January 2005.  (740.57 KB)
Shende, S., A. D. Malony, A. Morris, and F. Wolf, Performance Profiling Overhead Compensation for MPI Programs,” In Proc. of the 12th European Parallel Virtual Machine and Message Passing Interface Conference: Springer LNCS, September 2005.  (220.26 KB)
Shende, S., A. D. Malony, S. Moore, and D. Cronk, Memory Leak Detection in Fortran Applications using TAU,” Proc. DoD HPCMP Users Group Conference (HPCMP-UGC'07), Pittsburgh, PA, IEEE Computer Society, January 2007.
Shamis, P.., M G. Venkata, M. G. Lopez, M.. B. Baker, O.. Hernandez, Y.. Itigin, M.. Dubman, G.. Shainer, R.. L. Graham, L.. Liss, et al., UCX: An Open Source Framework for HPC Network APIs and Beyond,” 2015 IEEE 23rd Annual Symposium on High-Performance Interconnects, Santa Clara, CA, USA, IEEE, pp. 40-43, 2015. DOI: 10.1109/HOTI.2015.13
Shaiek, H., S. Tomov, A. Ayala, A. Haidar, and J. Dongarra, GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems,” EuroMPI'19 Posters, Zurich, Switzerland, no. icl-ut-19-06: ICL, September 2019.  (2.25 MB)
Seymour, K., H. You, and J. Dongarra, A Comparison of Search Heuristics for Empirical Code Optimization,” The 3rd international Workshop on Automatic Performance Tuning, Tsukuba, Japan, October 2008.  (772.48 KB)
Seymour, K., A. YarKhan, and J. Dongarra, Transparent Cross-Platform Access to Software Services using GridSolve and GridRPC,” in Cloud Computing and Software Services: Theory and Techniques (to appear): CRC Press, 00 2009.
Seymour, K., H. Nakada, S. Matsuoka, J. Dongarra, C. Lee, and H. Casanova, GridRPC: A Remote Procedure Call API for Grid Computing,” ICL Technical Report, no. ICL-UT-02-06, November 2002.  (287.73 KB)
Seymour, K., A. YarKhan, S. Agrawal, and J. Dongarra, NetSolve: Grid Enabling Scientific Computing Environments,” Grid Computing and New Frontiers of High Performance Processing, no. 14: Elsevier, 00 2005.  (425 KB)
Seymour, K., and J. Dongarra, Automatic Translation of Fortran to JVM Bytecode,” Joint ACM Java Grande - ISCOPE 2001 Conference (submitted), Stanford University, California, June 2001.  (185.8 KB)
Seymour, K., H. You, and J. Dongarra, ATLAS on the BlueGene/L – Preliminary Results,” ICL Technical Report, no. ICL-UT-06-10, January 2006.  (46.19 KB)
Seymour, K., H. Nakada, S. Matsuoka, J. Dongarra, C. Lee, and H. Casanova, Overview of GridRPC: A Remote Procedure Call API for Grid Computing,” Proceedings of the Third International Workshop on Grid Computing, pp. 274-278, January 2002.  (221.82 KB)
Seymour, K., and J. Dongarra, Automatic Translation of Fortran to JVM Bytecode,” Concurrency and Computation: Practice and Experience, vol. 15, no. 3-5, pp. 202-207, 00 2003.  (185.8 KB)
Seo, S., A. Amer, P. Balaji, C. Bordage, G. Bosilca, A. Brooks, P. Carns, A. Castello, D. Genet, T. Herault, et al., Argobots: A Lightweight Low-Level Threading and Tasking Framework,” IEEE Transactions on Parallel and Distributed Systems, October 2017. DOI: 10.1109/TPDS.2017.2766062
R
Roche, K., and J. Dongarra, Deploying Parallel Numerical Library Routines to Cluster Computing in a Self Adapting Fashion,” Parallel Computing: Advances and Current Issues:Proceedings of the International Conference ParCo2001, London, England, Imperial College Press, January 2002.  (381.89 KB)
Ribizel, T., and H. Anzt, Approximate and Exact Selection on GPUs,” 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Rio de Janeiro, Brazil, IEEE, 2019. DOI: 10.1109/IPDPSW.2019.00088  (440.71 KB)
Reed, D., and J. Dongarra, Exascale Computing and Big Data,” Communications of the ACM, vol. 58, no. 7: ACM, pp. 56-68, July 2015. DOI: 10.1145/2699414  (7.3 MB)
Raman, G., and J. Dongarra, Design and Implementation of NetSolve using DCOM as the Remoting Layer,” University of Tennessee Computer Science Department Technical Report, no. UT-CS-00-440, May 2000.  (65.45 KB)
Ramakrishan, L., D. Nurmi, A. Mandal, C. Koelbel, D. Gannon, M. Huang, Y-S. Kee, G. Obertelli, K. Thyagaraja, R. Wolski, et al., VGrADS: Enabling e-Science Workflows on Grids and Clouds with Fault Tolerance,” SC’09 The International Conference for High Performance Computing, Networking, Storage and Analysis (to appear), Portland, OR, 00 2009.  (648.82 KB)
P
Portillo, R., P. J. Teller, D. Cronk, and S. Moore, Making Performance Analysis and Tuning Part of the Software Development Cycle,” Proceedings of DoD HPCMP UGC 2009, San Diego, CA, IEEE, June 2009.
Plank, J., M. Beck, J. Dongarra, R. Wolski, and H. Casanova, Optimizing Performance and Reliability in Distributed Computing Systems Through Wide Spectrum Storage,” Proceedings of the IPDPS 2003, NGS Workshop, Nice, France, pp. 209, January 2003.
Pjesivac–Grbovic, J., T. Angskun, G. Bosilca, G. Fagg, E. Gabriel, and J. Dongarra, Performance Analysis of MPI Collective Operations,” 4th International Workshop on Performance Modeling, Evaluation, and Optmization of Parallel and Distributed Systems (PMEO-PDS '05), Denver, Colorado, April 2005.  (1018.28 KB)
Pjesivac–Grbovic, J., G. Fagg, T. Angskun, G. Bosilca, and J. Dongarra, MPI Collective Algorithm Selection and Quadtree Encoding,” Lecture Notes in Computer Science, vol. 4192, no. ICL-UT-06-13: Springer Berlin / Heidelberg, pp. 40-48, September 2006.  (308.39 KB)
Pjesivac–Grbovic, J., T. Angskun, G. Bosilca, G. Fagg, E. Gabriel, and J. Dongarra, Performance Analysis of MPI Collective Operations,” Cluster computing, vol. 10, no. 2: Springer Netherlands, pp. 127-143, June 2007.  (1018.28 KB)
Pjesivac–Grbovic, J., T. Angskun, G. Bosilca, G. Fagg, E. Gabriel, and J. Dongarra, Performance Analysis of MPI Collective Operations,” Cluster Computing Journal (to appear), January 2005.  (1018.28 KB)
Pjesivac–Grbovic, J., G. Bosilca, G. Fagg, T. Angskun, and J. Dongarra, Decision Trees and MPI Collective Algorithm Selection Problem,” Euro-Par 2007, Rennes, France, Springer, pp. 105–115, August 2007.  (552.94 KB)
Pjesivac–Grbovic, J., G. Fagg, T. Angskun, G. Bosilca, and J. Dongarra, MPI Collective Algorithm Selection and Quadtree Encoding,” ICL Technical Report, no. ICL-UT-06-11, 00 2006.  (308.39 KB)
Pjesivac–Grbovic, J., G. Bosilca, G. Fagg, T. Angskun, and J. Dongarra, MPI Collective Algorithm Selection and Quadtree Encoding,” Parallel Computing (Special Edition: EuroPVM/MPI 2006): Elsevier, 00 2007.  (308.39 KB)
Petitet, A., H. Casanova, C. Whaley, J. Dongarra, and Y. Robert, A Numerical Linear Algebra Problem Solving Environment Designer's Perspective (LAPACK Working Note 139),” SIAM Annual Meeting, Atlanta, GA, May 1999.  (319.71 KB)
Petitet, A., and J. Dongarra, Algorithmic Redistribution Methods for Block Cyclic Decompositions,” IEEE Transactions on Parallel and Distributed Computing, vol. 10, no. 12, pp. 201-220, October 2002.  (524.82 KB)
Petitet, A., S. Blackford, J. Dongarra, B. Ellis, G. Fagg, K. Roche, and S. Vadhiyar, Numerical Libraries and The Grid: The Grads Experiments with ScaLAPACK,” University of Tennessee Computer Science Technical Report, no. UT-CS-01-460, January 2001.  (91.78 KB)
Petitet, A., S. Blackford, J. Dongarra, B. Ellis, G. Fagg, K. Roche, and S. Vadhiyar, Numerical Libraries and The Grid,” International Journal of High Performance Applications and Supercomputing, vol. 15, no. 4, pp. 359-374, January 2001.  (67.09 KB)
Petitet, A., H. Casanova, J. Dongarra, Y. Robert, and C. Whaley, Parallel and Distributed Scientific Computing: A Numerical Linear Algebra Problem Solving Environment Designer's Perspective,” Handbook on Parallel and Distributed Processing, January 1999.  (323.01 KB)
Patinyasakdikul, T., D. Eberius, G. Bosilca, and N. Hjelm, Give MPI Threading a Fair Chance: A Study of Multithreaded MPI Designs,” IEEE Cluster, Albuquerque, NM, IEEE, September 2019.  (220.84 KB)
Parker, S., J. Mellor-Crummey, D. H. Ahn, H. Jagode, H. Brunst, S. Shende, A. D. Malony, D. DelSignore, R. Tschuter, R. Castain, et al., Performance Analysis and Debugging Tools at Scale,” Exascale Scientific Applications: Scalability and Performance Portability: Chapman & Hall / CRC Press, pp. 17-50, November 2017. DOI: 10.1201/b21930
Palma, J., J. Dongarra, and V. Hernández, High Performance Computing for Computational Science,” Lecture Notes in Computer Science, vol. 2565, VECPAR 2002, 5th International Conference June 26-28, 2002, Springer-Verlag, Berlin, January 2003.
N
Nichols, D., K. Wong, S. Tomov, L. Ng, S. Chen, and A. Gessinger, MagmaDNN: Accelerated Deep Learning Using MAGMA,” Practice and Experience in Advanced Research Computing (PEARC ’19), Chicago, IL, ACM, July 2019.  (1.09 MB)
Nichols, D., N-S. Tomov, F. Betancourt, S. Tomov, K. Wong, and J. Dongarra, MagmaDNN: Towards High-Performance Data Analytics and Machine Learning for Data-Driven Scientific Computing,” ISC High Performance, Frankfurt, Germany, Springer International Publishing, June 2019.  (1.37 MB)
Ng, L., S. Chen, A. Gessinger, D. Nichols, S. Cheng, A. Meenasorna, K. Wong, S. Tomov, A. Haidar, E. D'Azevedo, et al., MagmaDNN 0.2 High-Performance Data Analytics for Manycore GPUs and CPUs : University of Tennessee, January 2019. DOI: 10.13140/RG.2.2.14906.64961  (7.84 MB)
Ng, L., K. Wong, A. Haidar, S. Tomov, and J. Dongarra, MagmaDNN – High-Performance Data Analytics for Manycore GPUs and CPUs , Knoxville, TN, 2017 Summer Research Experiences for Undergraduate (REU), Presentation, December 2017.  (5.06 MB)
Newburn, C. J., G. Bansal, M. Wood, L. Crivelli, J. Planas, A. Duran, P. Souza, L. Borges, P. Luszczek, S. Tomov, et al., Heterogeneous Streaming,” The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2016, Chicago, IL, IEEE, May 2016.  (2.73 MB)
Nelson, J., Analyzing PAPI Performance on Virtual Machines,” VMWare Technical Journal, vol. Winter 2013, January 2014.
Nelson, J., Analyzing PAPI Performance on Virtual Machines,” ICL Technical Report, no. ICL-UT-13-02, August 2013.  (437.37 KB)

Pages