Publications

Export 66 results:
Filters: Author is Julien Langou  [Clear All Filters]
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
T
Canning, A., J. Dongarra, J. Langou, O. Marques, S. Tomov, C. Voemel, and L-W. Wang, Towards bulk based preconditioning for quantum dot computations,” IEEE/ACM Proceedings of HPCNano SC06 (to appear), January 2006.  (172.46 KB)
S
Bosilca, G., Z. Chen, J. Dongarra, V. Eijkhout, G. Fagg, E. Fuentes, J. Langou, P. Luszczek, J. Pjesivac–Grbovic, K. Seymour, et al., Self Adapting Numerical Software SANS Effort,” IBM Journal of Research and Development, vol. 50, no. 2/3, pp. 223-238, January 2006.  (357.53 KB)
R
Giraud, L., J. Langou, M. Rozložník, and J. van den Eshof, Rounding Error Analysis of the Classical Gram-Schmidt Orthogonalization Process,” Numerische Mathematik, vol. 101, no. 1, pp. 87-100, January 2005.  (157.48 KB)
Gustavson, F. G., J. Wasniewski, J. Dongarra, and J. Langou, Rectangular Full Packed Format for Cholesky's Algorithm: Factorization, Solution and Inversion,” ACM Transactions on Mathematical Software (TOMS), vol. 37, no. 2, April 2010.  (896.03 KB)
Gustavson, F. G., J. Wasniewski, J. Dongarra, and J. Langou, Rectangular Full Packed Format for Cholesky's Algorithm: Factorization, Solution and Inversion,” ACM TOMS (to appear), 00 2009.  (896.03 KB)
Gustavson, F. G., J. Wasniewski, J. Dongarra, and J. Langou, Rectangular Full Packed Format for Cholesky’s Algorithm: Factorization, Solution, and Inversion,” ACM Transactions on Mathematical Software (TOMS), vol. 37, no. 2, Atlanta, GA, April 2010.  (896.03 KB)
Langou, J., Z. Chen, G. Bosilca, and J. Dongarra, Recovery Patterns for Iterative Methods in a Parallel Unstable Environment,” SIAM SISC (to appear), May 2007.  (241.36 KB)
Bosilca, G., Z. Chen, J. Dongarra, and J. Langou, Recovery Patterns for Iterative Methods in a Parallel Unstable Environment,” University of Tennessee Computer Science Department Technical Report, UT-CS-04-538, 00 2005.  (241.36 KB)
Bosilca, G., Z. Chen, J. Dongarra, and J. Langou, Recovery Patterns for Iterative Methods in a Parallel Unstable Environment,” ICL Technical Report, no. ICL-UT-04-04, January 2004.  (241.36 KB)
Q
Agullo, E., C. Coti, J. Dongarra, T. Herault, and J. Langou, QR Factorization of Tall and Skinny Matrices in a Grid Computing Environment,” 24th IEEE International Parallel and Distributed Processing Symposium (also LAWN 224), Atlanta, GA, April 2010.  (261.55 KB)
Agullo, E., C. Coti, T. Herault, J. Langou, S. Peyronnet, A.. Rezmerita, F. Cappello, and J. Dongarra, QCG-OMPI: MPI Applications on Grids.,” Future Generation Computer Systems, vol. 27, no. 4, pp. 435-369, January 2011.  (1.48 MB)
Agullo, E., C. Coti, T. Herault, J. Langou, S. Peyronnet, A.. Rezmerita, F. Cappello, and J. Dongarra, QCG-OMPI: MPI Applications on Grids,” Future Generation Computer Systems, vol. 27, no. 4, pp. 357-369, March 2010.  (1.48 MB)
P
Demmel, J., J. Dongarra, J. Langou, J. Langou, P. Luszczek, and M. Mahoney, Prospectus for the Next LAPACK and ScaLAPACK Libraries: Basic ALgebra LIbraries for Sustainable Technology with Interdisciplinary Collaboration (BALLISTIC),” LAPACK Working Notes, no. 297, ICL-UT-20-07: University of Tennessee.  (1.41 MB)
Demmel, J., J. Dongarra, B.. Parlett, W. Kahan, M. Gu, D. Bindel, Y. Hida, X. Li, O. Marques, J. E. Riedy, et al., Prospectus for the Next LAPACK and ScaLAPACK Libraries,” PARA 2006, Umea, Sweden, June 2006.  (460.11 KB)
Demmel, J., J. Dongarra, B.. Parlett, W. Kahan, M. Gu, D. Bindel, Y. Hida, X. Li, O. Marques, J. E. Riedy, et al., Prospectus for the Next LAPACK and ScaLAPACK Libraries,” PARA 2006, Umea, Sweden, June 2006.  (460.11 KB)
Langou, J., and J. Dongarra, The Problem with the Linpack Benchmark Matrix Generator,” International Journal of High Performance Computing Applications, vol. 23, no. 1, pp. 5-14, 00 2009.  (136.41 KB)
Dongarra, J., and J. Langou, The Problem with the Linpack Benchmark Matrix Generator,” University of Tennessee Computer Science Technical Report, UT-CS-08-621 (also LAPACK Working Note 206), June 2008.  (136.41 KB)
Zunger, A., A. Franceschetti, G. Bester, W. B. Jones, K. Kim, P. A. Graf, L-W. Wang, A. Canning, O. Marques, C. Voemel, et al., Predicting the electronic properties of 3D, million-atom semiconductor nanostructure architectures,” J. Phys.: Conf. Ser. 46, vol. :101088/1742-6596/46/1/040, pp. 292-298, January 2006.  (644.1 KB)
Buttari, A., V. Eijkhout, J. Langou, and S. Filippone, Performance Optimization and Modeling of Blocked Sparse Kernels,” ICL Technical Report, no. ICL-UT-04-05, 00 2004.  (229.58 KB)
Canning, A., J. Dongarra, J. Langou, O. Marques, S. Tomov, C. Voemel, and L-W. Wang, Performance evaluation of eigensolvers in nano-structure computations,” IEEE/ACM Proceedings of HPCNano SC06 (to appear), January 2006.  (120.61 KB)
Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, Parallel Tiled QR Factorization for Multicore Architectures,” Concurrency and Computation: Practice and Experience, vol. 20, pp. 1573-1590, January 2008.  (277.92 KB)
Buttari, A., J. Langou, J. Kurzak, and J. Dongarra, Parallel Tiled QR Factorization for Multicore Architectures,” University of Tennessee Computer Science Dept. Technical Report, UT-CS-07-598 (also LAPACK Working Note 190), 00 2007.  (277.92 KB)
Giraud, L., J. Langou, and G.. Sylvand, On the Parallel Solution of Large Industrial Wave Propagation Problems,” Journal of Computational Acoustics (to appear), January 2005.  (1.08 MB)
Buttari, A., J. Dongarra, J. Kurzak, and J. Langou, Parallel Dense Linear Algebra Software in the Multicore Era,” in Cyberinfrastructure Technologies and Applications: Nova Science Publishers, Inc., pp. 9-24, 00 2009.
M
Kurzak, J., P. Luszczek, A. YarKhan, M. Faverge, J. Langou, H. Bouwmeester, and J. Dongarra, Multithreading in the PLASMA Library,” Multi and Many-Core Processing: Architecture, Programming, Algorithms, & Applications: Taylor & Francis, 00 2013.  (536.28 KB)
Faverge, M., J. Herrmann, J. Langou, B. Lowery, Y. Robert, and J. Dongarra, Mixing LU-QR Factorization Algorithms to Design High-Performance Dense Linear Algebra Solvers,” Journal of Parallel and Distributed Computing, vol. 85, pp. 32-46, November 2015.  (5.06 MB)
Buttari, A., J. Dongarra, J. Langou, J. Langou, P. Luszczek, and J. Kurzak, Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems,” International Journal of High Performance Computer Applications (to appear), August 2007.  (157.4 KB)
L
Agullo, E., C. Augonnet, J. Dongarra, M. Faverge, J. Langou, H. Ltaeif, and S. Tomov, LU Factorization for Accelerator-Based Systems,” IEEE/ACS AICCSA 2011, Sharm-El-Sheikh, Egypt, December 2011.  (234.86 KB)
Gustavson, F. G., J. Wasniewski, J. Dongarra, J. Herrero, and J. Langou, Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms,” ACM Transactions on Mathematical Software (TOMS), vol. 39, issue 2, February 2013.  (439.46 KB)
Bai, Z., J. Demmel, J. Dongarra, J. Langou, and J. Wang, LAPACK,” Handbook of Linear Algebra, Second, Boca Raton, FL, CRC Press, 2013.  (223.21 KB)
I
Canning, A., J. Dongarra, J. Langou, O. Marques, S. Tomov, C. Voemel, and L-W. Wang, Interior State Computation of Nano Structures,” PARA 2008, 9th International Workshop on State-of-the-Art in Scientific and Parallel Computing, Trondheim, Norway, May 2008.  (137.12 KB)
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov, The Impact of Multicore on Math Software,” PARA 2006, Umea, Sweden, June 2006.  (223.53 KB)
H
Langou, J., B. Hoffman, and B. King, How LAPACK library enables Microsoft Visual Studio support with CMake and LAPACKE,” University of Tennessee Computer Science Technical Report (also LAWN 270), no. UT-CS-12-698, July 2012.  (501.53 KB)
Dongarra, J., M. Faverge, T. Herault, M. Jacquelin, J. Langou, and Y. Robert, Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems,” Parallel Computing, vol. 39, issue 4-5, pp. 212-232, May 2013.  (1.43 MB)
Dongarra, J., M. Faverge, T. Herault, J. Langou, and Y. Robert, Hierarchical QR Factorization Algorithms for Multi-Core Cluster Systems,” IPDPS 2012, the 26th IEEE International Parallel and Distributed Processing Symposium, Shanghai, China, IEEE Computer Society Press, May 2012.  (405.71 KB)
Dongarra, J., M. Faverge, T. Herault, J. Langou, and Y. Robert, Hierarchical QR Factorization Algorithms for Multi-Core Cluster Systems,” University of Tennessee Computer Science Technical Report (also Lawn 257), no. UT-CS-11-684, October 2011.  (405.71 KB)
Bosilca, G., J. Dongarra, G. Fagg, and J. Langou, Hash Functions for Datatype Signatures in MPI,” Proceedings of 12th European Parallel Virtual Machine and Message Passing Interface Conference - Euro PVM/MPI, vol. 3666, Sorrento (Naples), Italy, Springer-Verlag Berlin, pp. 76-83, September 2005.  (304.2 KB)
F
Bosilca, G., A. Bouteiller, A. Danalis, M. Faverge, A. Haidar, T. Herault, J. Kurzak, J. Langou, P. Lemariner, H. Ltaeif, et al., Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA,” Proceedings of the Workshops of the 25th IEEE International Symposium on Parallel and Distributed Processing (IPDPS 2011 Workshops), Anchorage, Alaska, USA, IEEE, pp. 1432-1441, May 2011.  (1.26 MB)
Chen, Z., G. Fagg, E. Gabriel, J. Langou, T. Angskun, G. Bosilca, and J. Dongarra, Fault Tolerant High Performance Computing by a Coding Approach,” Proceedings of ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (to appear), Chicago, Illinois, January 2005.  (209.37 KB)
E
Langou, J., J. Langou, P. Luszczek, J. Kurzak, A. Buttari, and J. Dongarra, Exploiting the Performance of 32 bit Floating Point Arithmetic in Obtaining 64 bit Accuracy,” University of Tennessee Computer Science Tech Report, no. UT-CS-06-574, LAPACK Working Note #175, April 2006.  (221.39 KB)
Langou, J., J. Langou, P. Luszczek, J. Kurzak, A. Buttari, and J. Dongarra, Exploiting the Performance of 32 bit Floating Point Arithmetic in Obtaining 64 bit Accuracy,” University of Tennessee Computer Science Tech Report, no. UT-CS-06-574, LAPACK Working Note #175, April 2006.  (221.39 KB)
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” in High Performance Computing and Grids in Action, Amsterdam, IOS Press, January 2008.  (92.95 KB)
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” in High Performance Computing and Grids in Action, Amsterdam, IOS Press, January 2008.  (92.95 KB)
Buttari, A., J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov, Exploiting Mixed Precision Floating Point Hardware in Scientific Computations,” In High Performance Computing and Grids in Action (to appear), Amsterdam, IOS Press, 00 2007.  (122.01 KB)
D
Bosilca, G., A. Bouteiller, A. Danalis, M. Faverge, A. Haidar, T. Herault, J. Kurzak, J. Langou, P. Lemariner, H. Ltaeif, et al., Distributed-Memory Task Execution and Dependence Tracking within DAGuE and the DPLASMA Project,” Innovative Computing Laboratory Technical Report, no. ICL-UT-10-02, 00 2010.  (400.75 KB)
Bosilca, G., A. Bouteiller, A. Danalis, M. Faverge, A. Haidar, T. Herault, J. Kurzak, J. Langou, P. Lemariner, H. Ltaeif, et al., Distributed Dense Numerical Linear Algebra Algorithms on Massively Parallel Architectures: DPLASMA,” University of Tennessee Computer Science Technical Report, UT-CS-10-660, September 2010.  (366.26 KB)
Dongarra, J., Z. Chen, G. Bosilca, and J. Langou, Disaster Survival Guide in Petascale Computing: An Algorithmic Approach,” in Petascale Computing: Algorithms and Applications (to appear): Chapman & Hall - CRC Press, 00 2007.  (260.18 KB)

Pages