Publications

Export 1024 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
R
Raman, G., and J. Dongarra, Design and Implementation of NetSolve using DCOM as the Remoting Layer,” University of Tennessee Computer Science Department Technical Report, no. UT-CS-00-440, May 2000.  (65.45 KB)
Reed, D., and J. Dongarra, Exascale Computing and Big Data,” Communications of the ACM, vol. 58, no. 7: ACM, pp. 56-68, July 2015. DOI: 10.1145/2699414  (7.3 MB)
Ribizel, T., and H. Anzt, Approximate and Exact Selection on GPUs,” 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Rio de Janeiro, Brazil, IEEE, 2019. DOI: 10.1109/IPDPSW.2019.00088  (440.71 KB)
Roche, K., and J. Dongarra, Deploying Parallel Numerical Library Routines to Cluster Computing in a Self Adapting Fashion,” Parallel Computing: Advances and Current Issues:Proceedings of the International Conference ParCo2001, London, England, Imperial College Press, January 2002.  (381.89 KB)
S
Seo, S., A. Amer, P. Balaji, C. Bordage, G. Bosilca, A. Brooks, P. Carns, A. Castello, D. Genet, T. Herault, et al., Argobots: A Lightweight Low-Level Threading and Tasking Framework,” IEEE Transactions on Parallel and Distributed Systems, October 2017. DOI: 10.1109/TPDS.2017.2766062
Seymour, K., H. Nakada, S. Matsuoka, J. Dongarra, C. Lee, and H. Casanova, Overview of GridRPC: A Remote Procedure Call API for Grid Computing,” Proceedings of the Third International Workshop on Grid Computing, pp. 274-278, January 2002.  (221.82 KB)
Seymour, K., H. You, and J. Dongarra, ATLAS on the BlueGene/L – Preliminary Results,” ICL Technical Report, no. ICL-UT-06-10, January 2006.  (46.19 KB)
Seymour, K., and J. Dongarra, Automatic Translation of Fortran to JVM Bytecode,” Concurrency and Computation: Practice and Experience, vol. 15, no. 3-5, pp. 202-207, 00 2003.  (185.8 KB)
Seymour, K., H. You, and J. Dongarra, A Comparison of Search Heuristics for Empirical Code Optimization,” The 3rd international Workshop on Automatic Performance Tuning, Tsukuba, Japan, October 2008.  (772.48 KB)
Seymour, K., A. YarKhan, and J. Dongarra, Transparent Cross-Platform Access to Software Services using GridSolve and GridRPC,” in Cloud Computing and Software Services: Theory and Techniques (to appear): CRC Press, 00 2009.
Seymour, K., H. Nakada, S. Matsuoka, J. Dongarra, C. Lee, and H. Casanova, GridRPC: A Remote Procedure Call API for Grid Computing,” ICL Technical Report, no. ICL-UT-02-06, November 2002.  (287.73 KB)
Seymour, K., A. YarKhan, S. Agrawal, and J. Dongarra, NetSolve: Grid Enabling Scientific Computing Environments,” Grid Computing and New Frontiers of High Performance Processing, no. 14: Elsevier, 00 2005.  (425 KB)
Seymour, K., and J. Dongarra, Automatic Translation of Fortran to JVM Bytecode,” Joint ACM Java Grande - ISCOPE 2001 Conference (submitted), Stanford University, California, June 2001.  (185.8 KB)
Shaiek, H., S. Tomov, A. Ayala, A. Haidar, and J. Dongarra, GPUDirect MPI Communications and Optimizations to Accelerate FFTs on Exascale Systems,” EuroMPI'19 Posters, Zurich, Switzerland, no. icl-ut-19-06: ICL, September 2019.  (2.25 MB)
Shamis, P.., M G. Venkata, M. G. Lopez, M.. B. Baker, O.. Hernandez, Y.. Itigin, M.. Dubman, G.. Shainer, R.. L. Graham, L.. Liss, et al., UCX: An Open Source Framework for HPC Network APIs and Beyond,” 2015 IEEE 23rd Annual Symposium on High-Performance Interconnects, Santa Clara, CA, USA, IEEE, pp. 40-43, 2015. DOI: 10.1109/HOTI.2015.13
Shende, S., A. D. Malony, A. Morris, and F. Wolf, Performance Profiling Overhead Compensation for MPI Programs,” In Proc. of the 12th European Parallel Virtual Machine and Message Passing Interface Conference: Springer LNCS, September 2005.  (220.26 KB)
Shende, S., A. D. Malony, S. Moore, and D. Cronk, Memory Leak Detection in Fortran Applications using TAU,” Proc. DoD HPCMP Users Group Conference (HPCMP-UGC'07), Pittsburgh, PA, IEEE Computer Society, January 2007.
Shimosaka, H., T. Hiroyasu, M. Miki, and J. Dongarra, Optimization Problem Solving System Using GridRPC,” IEEE Transactions on Parallel and Distributed Systems (submitted), January 2005.  (740.57 KB)
Shipman, G. M., G. Bosilca, and A. B. Maccabe, High Performance RDMA Protocols in HPC,” Euro PVM/MPI 2006, Bonn, Germany, September 2006.  (1.06 MB)
Sloot, P. M., D. Abramson, A. V. Bogdanov, J. Dongarra, A. Zomaya, and Y. Gorbachev, Computational Science — ICCS 2003,” Lecture Notes in Computer Science, vol. 2657-2660, ICCS 2003, International Conference. Melbourne, Australia, Springer-Verlag, Berlin, June 2003.
Proceedings of the International Conference on Computational Science,” ICCS 2010, Amsterdam, Elsevier, May 2010.
Solcà, R., A. Haidar, S. Tomov, J. Dongarra, and T. C. Schulthess, A Novel Hybrid CPU-GPU Generalized Eigensolver for Electronic Structure Calculations Based on Fine Grained Memory Aware Tasks,” Supercomputing '12 (poster), Salt Lake City, Utah, November 2012.
Solcà, R., A. Kozhevnikov, A. Haidar, S. Tomov, T. C. Schulthess, and J. Dongarra, Efficient Implementation Of Quantum Materials Simulations On Distributed CPU-GPU Systems,” The International Conference for High Performance Computing, Networking, Storage and Analysis (SC15), Austin, TX, ACM, November 2015.  (1.09 MB)
Song, F., J. Dongarra, and S. Moore, Experiments with Strassen's Algorithm: From Sequential to Parallel,” 18th IASTED International Conference on Parallel and Distributed Computing and Systems PDCS 2006 (submitted), Dallas, Texas, January 2006.  (514.33 KB)
Song, F., and J. Dongarra, Scaling Up Matrix Computations on Shared-Memory Manycore Systems with 1000 CPU Cores,” International conference on Supercomputing, Munich, Germany, ACM, pp. 333-342, June 2014. DOI: 10.1145/2597652.2597670  (2.9 MB)
Song, F., S. Moore, and J. Dongarra, Analytical Modeling for Affinity-Based Thread Scheduling on Multicore Platforms,” University of Tennessee Computer Science Technical Report, UT-CS-08-626, January 2008.  (650.75 KB)
Song, F., H. Ltaeif, B. Hadri, and J. Dongarra, Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems,” University of Tennessee Computer Science Technical Report, vol. –10-653, April 2010.  (3.42 MB)
Song, F., S. Moore, and J. Dongarra, A Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling,” The International Conference on Computational Science 2009 (ICCS 2009), vol. 5544, Baton Rouge, LA, pp. 195-204, May 2009.  (228.45 KB)
Song, F., H. Ltaeif, B. Hadri, and J. Dongarra, Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems,” SC'10, New Orleans, LA, ACM SIGARCH/ IEEE Computer Society, November 2010.  (3.42 MB)
Song, F., S. Moore, and J. Dongarra, Feedback-Directed Thread Scheduling with Memory Considerations,” IEEE International Symposium on High Performance Distributed Computing, Monterey Bay, CA, June 2007.  (297.24 KB)
Song, F., and J. Dongarra, A Scalable Approach to Solving Dense Linear Algebra Problems on Hybrid CPU-GPU Systems,” Concurrency and Computation: Practice and Experience, vol. 27, issue 14, pp. 3702-3723, September 2015. DOI: 10.1002/cpe.3403  (8.16 MB)
Song, F., S. Tomov, and J. Dongarra, Efficient Support for Matrix Computations on Heterogeneous Multi-core and Multi-GPU Architectures,” University of Tennessee Computer Science Technical Report, UT-CS-11-668, (also Lawn 250), June 2011.  (5.93 MB)
Song, F., and F. Wolf, CUBE User Manual,” ICL Technical Report, no. ICL-UT-04-01, February 2004.  (429.12 KB)
Song, F., S. Moore, and J. Dongarra, L2 Cache Modeling for Scientific Applications on Chip Multi-Processors,” Proceedings of the 2007 International Conference on Parallel Processing, Xi'an, China, IEEE Computer Society, January 2007.  (654.11 KB)
Song, F., and J. Dongarra, A Scalable Framework for Heterogeneous GPU-Based Clusters,” The 24th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 2012), Pittsburgh, PA, USA, ACM, June 2012.  (3.39 MB)
Song, F., F. Wolf, N. Bhatia, J. Dongarra, and S. Moore, An Algebra for Cross-Experiment Performance Analysis,” 2004 International Conference on Parallel Processing (ICCP-04), Montreal, Quebec, Canada, August 2004.  (166.12 KB)
Song, F., S. Tomov, and J. Dongarra, Enabling and Scaling Matrix Computations on Heterogeneous Multi-Core and Multi-GPU Systems,” 26th ACM International Conference on Supercomputing (ICS 2012), San Servolo Island, Venice, Italy, ACM, June 2012.  (5.88 MB)
Song, F., S. Moore, and J. Dongarra, Analytical Modeling and Optimization for Affinity Based Thread Scheduling on Multicore Systems,” IEEE Cluster 2009, New Orleans, August 2009.  (395.53 KB)
Song, F., A. YarKhan, and J. Dongarra, Dynamic Task Scheduling for Linear Algebra Algorithms on Distributed-Memory Multicore Systems,” International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '09), Portland, OR, November 2009.  (502.49 KB)
Song, F., S. Moore, and J. Dongarra, Modeling of L2 Cache Behavior for Thread-Parallel Scientific Programs on Chip Multi-Processors,” University of Tennessee Computer Science Technical Report, no. UT-CS-06-583, January 2006.  (652.93 KB)
Sourbier, F., A. Haidar, L. Giraud, H. Ben-Hadj-Ali, S. Operto, and J. Virieux, Three-dimensional parallel frequency-domain visco-acoustic wave modelling based on a hybrid direct/iterative solver.,” To appear in Geophysical Prospecting journal., 00 2011.  (1.04 MB)
Steen, A J.. van der, and J. Dongarra, Overview of High Performance Computers,” Handbook of Massive Data Sets: Kluwer Academic Publishers, pp. 791-852, January 2001.  (442.71 KB)
Strohmaier, E., H. Meuer, J. Dongarra, and H. D. Simon, The TOP500 List and Progress in High-Performance Computing,” IEEE Computer, vol. 48, issue 11, pp. 42-49, November 2015. DOI: doi:10.1109/MC.2015.338
Strohmaier, E., J. Dongarra, H. Meuer, and H. D. Simon, The Marketplace for High-Performance Computers,” Parallel Computing, vol. 25, no. 13-14, pp. 1517-1545, October 2002.  (285.78 KB)
Sun, J., J. Fu, J. Drake, Q. Zhu, A. Haidar, M. Gates, S. Tomov, and J. Dongarra, Computational Benefit of GPU Optimization for Atmospheric Chemistry Modeling,” Journal of Advances in Modeling Earth Systems, vol. 10, issue 8, pp. 1952–1969, August 2018. DOI: 10.1029/2018MS001276
Supinski, B. R. de, J. K. Hollingsworth, S. Moore, and P. H. Worley, Results of the PERI survey of SciDAC applications,” Journal of Physics: Conference Series, SciDAC 2007, vol. 78, no. 2007, January 2007.  (692.83 KB)
Supinski, B. R. de, S. Alam, D. Bailey, L. Carrington, C. Daley, A. Dubey, T. Gamblin, D. Gunter, P. D. Hovland, H. Jagode, et al., Modeling the Office of Science Ten Year Facilities Plan: The PERI Architecture Tiger Team,” SciDAC 2009, Journal of Physics: Conference Series, vol. 180(2009)012039, San Diego, California, IOP Publishing, July 2009.  (906.39 KB)
T
Tang, C., A. Bouteiller, T. Herault, M G. Venkata, and G. Bosilca, From MPI to OpenSHMEM: Porting LAMMPS,” OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, Annapolis, MD, USA, Springer International Publishing, pp. 121–137, 2015. DOI: 10.1007/978-3-319-26428-8_8
Tang, Y., Technical Comparison between several representative checkpoint/rollback solutions for MPI programs,” ICL Technical Report, no. ICL-UT-06-09, January 2006.  (84.67 KB)
Tang, Y., G. Fagg, and J. Dongarra, Proposal of MPI operation level Checkpoint/Rollback and one implementation,” Proceedings of IEEE CCGrid 2006: IEEE Computer Society, January 2006.  (277.27 KB)

Pages