Publications

Export 1014 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
S
Seymour, K., and J. Dongarra, Automatic Translation of Fortran to JVM Bytecode,” Concurrency and Computation: Practice and Experience, vol. 15, no. 3-5, pp. 202-207, 20March 00.  (185.8 KB)
Seymour, K., H. You, and J. Dongarra, A Comparison of Search Heuristics for Empirical Code Optimization,” The 3rd international Workshop on Automatic Performance Tuning, Tsukuba, Japan, 20August 10.  (772.48 KB)
Seymour, K., A. YarKhan, and J. Dongarra, Transparent Cross-Platform Access to Software Services using GridSolve and GridRPC,” in Cloud Computing and Software Services: Theory and Techniques (to appear): CRC Press, 20September 00.
Seymour, K., H. Nakada, S. Matsuoka, J. Dongarra, C. Lee, and H. Casanova, GridRPC: A Remote Procedure Call API for Grid Computing,” ICL Technical Report, no. ICL-UT-02-06, 20February 11.  (287.73 KB)
Seymour, K., A. YarKhan, S. Agrawal, and J. Dongarra, NetSolve: Grid Enabling Scientific Computing Environments,” Grid Computing and New Frontiers of High Performance Processing, no. 14: Elsevier, 20May 00.  (425 KB)
Seymour, K., and J. Dongarra, Automatic Translation of Fortran to JVM Bytecode,” Joint ACM Java Grande - ISCOPE 2001 Conference (submitted), Stanford University, California, 20January 06.  (185.8 KB)
Shaiek, H., S. Tomov, A. Ayala, A. Haidar, and J. Dongarra, GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems,” EuroMPI'19 Posters, Zurich, Switzerland, no. icl-ut-19-06: ICL, 2019-09.  (2.25 MB)
Shamis, P.., M.. G. Venkata, M.. G. Lopez, M.. B. Baker, O.. Hernandez, Y.. Itigin, M.. Dubman, G.. Shainer, R.. L. Graham, L.. Liss, et al., UCX: An Open Source Framework for HPC Network APIs and Beyond,” 2015 IEEE 23rd Annual Symposium on High-Performance Interconnects, Santa Clara, CA, USA, IEEE, pp. 40-43, Aug, 2015. DOI: 10.1109/HOTI.2015.13
Shende, S., A. Maloney, A. Morris, and F. Wolf, Performance Profiling Overhead Compensation for MPI Programs,” In Proc. of the 12th European Parallel Virtual Machine and Message Passing Interface Conference: Springer LNCS, 20May 09.  (220.26 KB)
Shende, S., A. Maloney, S. Moore, and D. Cronk, Memory Leak Detection in Fortran Applications using TAU,” Proc. DoD HPCMP Users Group Conference (HPCMP-UGC'07), Pittsburgh, PA, IEEE Computer Society, 20July 01.
Shimosaka, H., T. Hiroyasu, M. Miki, and J. Dongarra, Optimization Problem Solving System Using GridRPC,” IEEE Transactions on Parallel and Distributed Systems (submitted), 20May 01.  (740.57 KB)
Shipman, G. M., G. Bosilca, and A. B. Maccabe, High Performance RDMA Protocols in HPC,” Euro PVM/MPI 2006, Bonn, Germany, 20June 09.  (1.06 MB)
Proceedings of the International Conference on Computational Science,” ICCS 2010, Amsterdam, Elsevier, 20October 05.
Sloot, P. M., D. Abramson, A. V. Bogdanov, J. Dongarra, A. Zomaya, and Y. Gorbachev, Computational Science — ICCS 2003,” Lecture Notes in Computer Science, vol. 2657-2660, ICCS 2003, International Conference. Melbourne, Australia, Springer-Verlag, Berlin, 20March 06.
Solcà, R., A. Kozhevnikov, A. Haidar, S. Tomov, T. C. Schulthess, and J. Dongarra, Efficient Implementation Of Quantum Materials Simulations On Distributed CPU-GPU Systems,” The International Conference for High Performance Computing, Networking, Storage and Analysis (SC15), Austin, TX, ACM, 2015-11.  (1.09 MB)
Solcà, R., A. Haidar, S. Tomov, J. Dongarra, and T. C. Schulthess, A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks,” Supercomputing '12 (poster), Salt Lake City, Utah, 20December 11.
Song, F., F. Wolf, N. Bhatia, J. Dongarra, and S. Moore, An Algebra for Cross-Experiment Performance Analysis,” 2004 International Conference on Parallel Processing (ICCP-04), Montreal, Quebec, Canada, 20April 08.  (166.12 KB)
Song, F., and J. Dongarra, A Scalable Framework for Heterogeneous GPU-Based Clusters,” The 24th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2012), Pittsburgh, PA, USA, ACM, 20December 06.  (3.39 MB)
Song, F., S. Moore, and J. Dongarra, Analytical Modeling and Optimization for Affinity Based Thread Scheduling on Multicore Systems,” IEEE Cluster 2009, New Orleans, 20September 08.  (395.53 KB)
Song, F., S. Tomov, and J. Dongarra, Enabling and Scaling Matrix Computations on Heterogeneous Multi-Core and Multi-GPU Systems,” 26th ACM International Conference on Supercomputing (ICS 2012), San Servolo Island, Venice, Italy, ACM, 20December 06.  (5.88 MB)
Song, F., A. YarKhan, and J. Dongarra, Dynamic Task Scheduling for Linear Algebra Algorithms on Distributed-Memory Multicore Systems,” International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '09), Portland, OR, 20September 11.  (502.49 KB)
Song, F., S. Moore, and J. Dongarra, Modeling of L2 Cache Behavior for Thread-Parallel Scientific Programs on Chip Multi-Processors,” University of Tennessee Computer Science Technical Report, no. UT-CS-06-583, 20June 01.  (652.93 KB)
Song, F., J. Dongarra, and S. Moore, Experiments with Strassen's Algorithm: From Sequential to Parallel,” 18th IASTED International Conference on Parallel and Distributed Computing and Systems PDCS 2006 (submitted), Dallas, Texas, 20June 01.  (514.33 KB)
Song, F., and J. Dongarra, Scaling Up Matrix Computations on Shared-Memory Manycore Systems with 1000 CPU Cores,” International conference on Supercomputing, Munich, Germany, ACM, pp. 333-342, 2014-06. DOI: 10.1145/2597652.2597670  (2.9 MB)
Song, F., S. Moore, and J. Dongarra, Analytical Modeling for Affinity-Based Thread Scheduling on Multicore Platforms,” University of Tennessee Computer Science Technical Report, UT-CS-08-626, 20August 01.  (650.75 KB)
Song, F., H. Ltaeif, B. Hadri, and J. Dongarra, Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems,” University of Tennessee Computer Science Technical Report, vol. –10-653, 20October 04.  (3.42 MB)
Song, F., H. Ltaeif, B. Hadri, and J. Dongarra, Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems,” SC'10, New Orleans, LA, ACM SIGARCH/ IEEE Computer Society, 20October 11.  (3.42 MB)
Song, F., S. Moore, and J. Dongarra, A Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling,” The International Conference on Computational Science 2009 (ICCS 2009), vol. 5544, Baton Rouge, LA, pp. 195-204, 20September 05.  (228.45 KB)
Song, F., S. Moore, and J. Dongarra, Feedback-Directed Thread Scheduling with Memory Considerations,” IEEE International Symposium on High Performance Distributed Computing, Monterey Bay, CA, 20July 06.  (297.24 KB)
Song, F., and J. Dongarra, A Scalable Approach to Solving Dense Linear Algebra Problems on Hybrid CPU-GPU Systems,” Concurrency and Computation: Practice and Experience, vol. 27, issue 14, pp. 3702-3723, 2015-09. DOI: 10.1002/cpe.3403  (8.16 MB)
Song, F., S. Tomov, and J. Dongarra, Efficient Support for Matrix Computations on Heterogeneous Multi-core and Multi-GPU Architectures,” University of Tennessee Computer Science Technical Report, UT-CS-11-668, (also Lawn 250), 20November 06.  (5.93 MB)
Song, F., and F. Wolf, CUBE User Manual,” ICL Technical Report, no. ICL-UT-04-01, 20April 02.  (429.12 KB)
Song, F., S. Moore, and J. Dongarra, L2 Cache Modeling for Scientific Applications on Chip Multi-Processors,” Proceedings of the 2007 International Conference on Parallel Processing, Xi'an, China, IEEE Computer Society, 20July 01.  (654.11 KB)
Sourbier, F., A. Haidar, L. Giraud, H. Ben-Hadj-Ali, S. Operto, and J. Virieux, Three-dimensional parallel frequency-domain visco-acoustic wave modelling based on a hybrid direct/iterative solver.,” To appear in Geophysical Prospecting journal., 20November 00.  (1.04 MB)
Steen, A J.. van der, and J. Dongarra, Overview of High Performance Computers,” Handbook of Massive Data Sets: Kluwer Academic Publishers, pp. 791-852, 20January 01.  (442.71 KB)
Strohmaier, E., H. Meuer, J. Dongarra, and H. D. Simon, The TOP500 List and Progress in High-Performance Computing,” IEEE Computer, vol. 48, issue 11, pp. 42-49, 2015-11. DOI: doi:10.1109/MC.2015.338
Strohmaier, E., J. Dongarra, H. Meuer, and H. D. Simon, The Marketplace for High-Performance Computers,” Parallel Computing, vol. 25, no. 13-14, pp. 1517-1545, 20February 10.  (285.78 KB)
Sun, J., J. Fu, J. Drake, Q. Zhu, A. Haidar, M. Gates, S. Tomov, and J. Dongarra, Computational Benefit of GPU Optimization for Atmospheric Chemistry Modeling,” Journal of Advances in Modeling Earth Systems, vol. 10, issue 8, pp. 1952–1969, 2018-08. DOI: 10.1029/2018MS001276
Supinski, B. R. de, J. K. Hollingsworth, S. Moore, and P. H. Worley, Results of the PERI survey of SciDAC applications,” Journal of Physics: Conference Series, SciDAC 2007, vol. 78, no. 2007, 20July 01.  (692.83 KB)
Supinski, B. R. de, S. Alam, D. Bailey, L. Carrington, C. Daley, A. Dubey, T. Gamblin, D. Gunter, P. D. Hovland, H. Jagode, et al., Modeling the Office of Science Ten Year Facilities Plan: The PERI Architecture Tiger Team,” SciDAC 2009, Journal of Physics: Conference Series, vol. 180(2009)012039, San Diego, California, IOP Publishing, 20September 07.  (906.39 KB)
T
Tang, Y., G. Fagg, and J. Dongarra, Proposal of MPI operation level Checkpoint/Rollback and one implementation,” Proceedings of IEEE CCGrid 2006: IEEE Computer Society, 20June 01.  (277.27 KB)
Tang, C., A. Bouteiller, T. Herault, M. Gorentla Venkata, and G. Bosilca, From MPI to OpenSHMEM: Porting LAMMPS,” OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, Annapolis, MD, USA, Springer International Publishing, pp. 121–137, 2015. DOI: 10.1007/978-3-319-26428-8_8
Tang, Y., Technical Comparison between several representative checkpoint/rollback solutions for MPI programs,” ICL Technical Report, no. ICL-UT-06-09, 20June 01.  (84.67 KB)
Terpstra, D., H. Jagode, H. You, and J. Dongarra, Collecting Performance Data with PAPI-C,” Tools for High Performance Computing 2009, 3rd Parallel Tools Workshop, Dresden, Germany, Springer Berlin / Heidelberg, pp. 157-173, 20October 05. DOI: 10.1007/978-3-642-11261-4_11  (4.45 MB)
Tisseur, F., and J. Dongarra, Parallelizing the Divide and Conquer Algorithm for the Symmetric Tridiagonal Eigenvalue Problem on Distributed Memory Architectures,” SIAM Journal on Scientific Computing, vol. 6, no. 20, pp. 2223-2236, 20February 10.  (321.36 KB)
Tomov, S., J. Dongarra, and M. Baboulin, Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems,” Parallel Computing, vol. 36, no. 5-6, pp. 232-240, 20October 00.  (606.41 KB)
Tomov, S., and J. Dongarra, Accelerating the Reduction to Upper Hessenberg Form through Hybrid GPU-Based Computing,” University of Tennessee Computer Science Technical Report, UT-CS-09-642 (also LAPACK Working Note 219), 20September 05.  (2.37 MB)
Tomov, S., Linear Algebra Software for High-Performance Computing (Part 2: Software for Hardware Accelerators and Coprocessors) , Frankfurt, Germany, ISC High Performance (ISC18), Tutorial Presentation, 2015-06.  (15.41 MB)
Tomov, S., A. Haidar, D. Schultz, and J. Dongarra, Evaluation and Design of FFT for Distributed Accelerated Systems,” ECP WBS 2.3.3.09 Milestone Report, no. FFT-ECP ST-MS-10-1216: Innovative Computing Laboratory, University of Tennessee, 2018-10.  (7.53 MB)
Tomov, S., J. Langou, A. Canning, L-W. Wang, and J. Dongarra, Conjugate-Gradient Eigenvalue Solvers in Computing Electronic Properties of Nanostructure Architectures,” International Journal of Computational Science and Engineering (to appear), 20May 01.  (428.21 KB)

Pages