2023-09-27T11:51:04Zhttps://ipsj.ixsq.nii.ac.jp/ej/?action=repository_oaipmhoai:ipsj.ixsq.nii.ac.jp:000751552023-04-27T10:00:04Z01164:02735:06337:06472
Implementation and performance evaluation of new inverse iteration algorithm with Householder transformation in terms of the compact WY representationImplementation and performance evaluation of new inverse iteration algorithm with Householder transformation in terms of the compact WY representationenghttp://id.nii.ac.jp/1001/00075155/Technical Reporthttps://ipsj.ixsq.nii.ac.jp/ej/?action=repository_action_common_download&item_id=75155&item_no=1&attribute_id=1&file_no=1Copyright (c) 2011 by the Information Processing Society of JapanGraduate school of Informatics, Kyoto UniversityGraduate school of Informatics, Kyoto UniversityGraduate school of Informatics, Kyoto UniversityHiroyuki, IshigamiKinji, KimuraYoshimasa, NakamuraA new inverse iteration algorithm that can be used to compute all the eigenvectors of a real symmetric tridiagonal matrix on parallel computers is developed. In the classical inverse iteration, the modified Gram-Schmidt orthogonalization is used, and this causes a bottleneck in parallel computing. In this paper, the use of the compact WY representation is proposed in the orthogonalization process of the inverse iteration with the Householder transformation. This change results in drastically reduced synchronization cost in parallel computing. The new algorithm is evaluated on a 32-core parallel computer, and it is shown that the algorithm is up to 7.46 times faster than the classical algorithm in computing all the eigenvectors of matrices with several thousand dimensions.A new inverse iteration algorithm that can be used to compute all the eigenvectors of a real symmetric tridiagonal matrix on parallel computers is developed. In the classical inverse iteration, the modified Gram-Schmidt orthogonalization is used, and this causes a bottleneck in parallel computing. In this paper, the use of the compact WY representation is proposed in the orthogonalization process of the inverse iteration with the Householder transformation. This change results in drastically reduced synchronization cost in parallel computing. The new algorithm is evaluated on a 32-core parallel computer, and it is shown that the algorithm is up to 7.46 times faster than the classical algorithm in computing all the eigenvectors of matrices with several thousand dimensions.AN10505667研究報告数理モデル化と問題解決（MPS）2011-MPS-848162011-07-112011-07-07