Philip J. Mucci's Publications and White Papers

A more current list may be found on my ResearchGate profile.

PAPI 5: Measuring Power, Energy and the Cloud.
Vince Weaver, James Ralph, Tushar Mohan, Philip Mucci, Dan Terpstra, Heike McCraw, Matt Johnson, Kiran Kumar Kasichayanula, John Nelson, Shirley Moore
2013 IEEE International Symposium on Performance Analysis of Systems and Software
Austin, TX, April 21-23, 2013
[PDF]

PAPI-V: Performance Monitoring for Virtual Machines.
Vince Weaver, Tushar Mohan, Philip Mucci, Dan Terpstra, Heike McCraw, Matt Johnson, John Nelson, Shirley Moore
CloudTech-HPC 2012
Pittsburgh, PA, September 10-13, 2012
[PDF]

An Open Source performance tools software suite for scientific computing.
Philip J. Mucci, Tushar Mohan
Concurrency and Computation: Practice and Experience 22(2): 206-216 (2010)
[PDF]

Enabling Data Structure Oriented Performance Analysis with Hardware Performance Counter Support.
Karl Furlinger, Daniel Terpstra, Haihang You, Philip Mucci, Shirley Moore
Euro-Par Workshops 2008: 263-272
[PDF]

Benchmarking of Integrated OGSA-BES with the Grid Middleware
Fredrik Hedman, Morris Riedel, Phillip Mucci, Gilbert Netzer, Ali Gholami, M. Shahbaz Memon, A. Shiraz Memon, Zeeshan Ali Shah
Euro-Par Workshops 2008: 113-122 2007
[PDF]

A Black-Box Approach to Performance Analysis of Grid Middleware.
Per Alexius, B. Maryam Elahi, Fredrik Hedman, Phillip Mucci, Gilbert Netzer, Zeeshan Ali Shah
Euro-Par Workshops 2007: 62-71
[PDF]

An Open Source Performance Tools Software Suite for Scientific Computing
Mohan, T., Mucci, P.
International Supercomputing Conference 2007:, Dresden, Germany, June, 2007.
[PDF]

PAPI users group - PAPI users group
Mucci, Moore
SC 2006: 43

Analysis and Optimization of Yee_Bench using Hardware Performance Counters
Andersson, U., Mucci, P.
ParCo 2005: Parallel Computing 2005, Malaga, Spain, September, 2005.
[PDF]

PerfMiner: Cluster-Wide Collection, Storage and Presentation of Application Level Hardware Performance Data
Mucci, P., Ahlin, D., Danielsson, J., Ekman, P., Malinowski, L.
Euro-Par 2005: European Conference on Parallel Computers, Monte de Caparica, Portugal, August/September 2005.
[PDF]

Design Considerations for Shared Memory MPI Implementations on Linux NUMA Systems: An MPICH/MPICH2 Case Study
Ekman, P., Mucci, P.
AMD, July, 2005.
[PDF]

Memory Bandwidth and the Performance of Scientific Applications: A Study of the AMD Opteron Processor
Mucci, P.
AMD Technical Whitepaper, June 2004.
[PDF]

Accurate Cache and TLB Characterization Using Hardware Counters
Dongarra, J., Moore, S., Mucci, P., Seymour, K., You, H.
International Conference on Computational Science 2004, Krakow, Poland, June 2004.
[PDF]

Optimizing Cluster Applications with PAPI
Mucci, P., London K.
ClusterWorld Magazine, May 2004.

Automating the Large-scale Collection and Analysis of Performance Data on Linux Clusters
Mucci, P., Dongarra, J., Moore, S., Song, F., Wolf, F.
Proceedings of the 5th LCI International Conference on Linux Clusters: The HPC Revolution, Austin, Texas, May 2004.
[PDF]

Performance Technologies for Peta-Scale Systems: A White Paper Prepared by the Performance Evaluation Research Center and Collaborators
D.H. Bailey, B. de Supinski, J. Dongarra, T. Dunigan, G. Gao, A. Hoisie, P. Hovland, J. Hollingsworth, D. Jefferson, C. Kamath, A. Malony, B. Norris, D. Quinlan, S. McKee, C. Mendes, S. Moore, D. Reed, A. Snavely, E. Strohmaier, J.S. Vetter, P. Worley
Prepared in Response to the Invitation to Submit White Papers to High End Computing Revitalization Task Force, 2003.
[PDF]

Production Quality Open Source Performance Tools: A White Paper Prepared by the Performance Evaluation Research Center and Collaborators
Jack Dongarra, Shirley Moore, Philip Mucci, Daniel Terpstra, Allen Malony, Sameer Shende, Jeffrey Hollingsworth, Barton Miller, Daniel Reed, Celso Mendes, Allan Snavely
Prepared in Response to the Request for Information for Open Source Software Development Acceleration
National Nuclear Security Administration, Advanced Simulation and Computing Initiative and the ASCI Pathforward Program, 2003.
[PDF]

Performance Instrumentation and Measurement for Terascale Systems
Dongarra, J., Malony, A., Moore, S., Mucci, P., Shende, S.
International Conference on Computational Science 2003, Melbourne, Australia, June 2003.
[PDF]

Experiences and Lessons Learned with a Portable Interface to Hardware Performance Counters
Dongarra, J., London, K., Moore, S., Mucci, P., Terpstra, D., You, H., Zhou, M.
IPDPS2003, Nice, France, April 2003. and
Lecture Notes in Computer Science, Springer-Verlag, Heidelberg, Volume 2723, pp. 53-62, January, 2003.
[PDF]

End-user Tools for Application Performance Analysis, Using Hardware Counters
London, K., Dongarra, J., Moore, S., Mucci, P., Seymour, K., Spencer, T.
International Conference on Parallel and Distributed Computing Systems, Dallas, TX, August 8-10, 2001.
[PDF]

Using PAPI for Hardware Performance Monitoring on Linux Systems
Dongarra, J., London, K., Moore, S., Mucci, P., Terpstra, D.
Conference on Linux Clusters: The HPC Revolution, Urbana, Illinois, June 25-27, 2001.
[PDF]

The PAPI Cross-Platform Interface to Hardware Performance Counters
London, K., Moore, S., Mucci, P., Seymour, K., Luczak, R.
Department of Defense Users' Group Conference Proceedings, Biloxi, Mississippi, June 18-21, 2001.
[PDF]

A Scalable Cross-Platform Infrastructure for Application Performance Tuning Using Hardware Counters
Browne, S., Dongarra, J., Garner, N., London, K., Mucci, P.
Proceedings of SuperComputing 2000 (SC'00), Dallas, TX, November 2000.
[PDF]

A Portable Programming Interface for Performance Evaluation on Modern Processors
Browne, S., Dongarra, J., Garner, N., Ho, G., Mucci, P.
The International Journal of High Performance Computing Applications, Volume 14, number 3, pp. 189-204, Fall 2000.
[PDF]

A Portable Programming Interface for Performance Evaluation on Modern Processors
Browne, S., Dongarra, J., Garner, N., London, K., Mucci, P.
UT Computer Science Technical Report #444, July 2000.
[PDF]

PAPI: A Portable Interface to Hardware Performance Counters
Browne, S., Deane, C., Ho, G., Mucci, P.
Proceedings of Department of Defense HPCMP Users Group Conference, June 1999.
[PDF]

Efficient Transport Independent Active Messaging Implementation for PVM
Mucci, P.
UT Computer Science Technical Report #399, August 1998.
[PDF]

Low Level Architectural Characterization Benchmarks for Parallel Computers
Mucci, P., London, K.
UT Computer Science Technical Report #394, July 1998.
[PDF]

Architectural Characterization of DoD MSRC HPC Platforms
Mucci, P., London, K.
D.o.D. HPC Users' Group Conference, July 1998.
[PDF]

The BLASBench Report
Mucci, P., London, K.
CEWES/ERDC MSRC/PET Technical Report 98-27, July 1998.
[PDF]

The MPBench Report
Mucci, P., London, K.
CEWES/ERDC MSRC/PET Technical Report 98-26, July 1998.
[PDF]

The CacheBench Report
Mucci, P., London, K.
CEWES/ERDC MSRC/PET Technical Report 98-25, July 1998.
[PDF]

Possibilities for Active Messaging in PVM
Dongarra, J., Mucci, P.
UT Computer Science Technical Report #277, February 1995.
[PDF]

A Test Suite for PVM
Do, M., Dongarra, J., Jeannot, E., Mucci, P.
UT Computer Science Technical Report #276, February 1995.
[PDF]