000 03402cam a2200457Ii 4500
001 u15142
003 SA-PMU
005 20210418124625.0
008 160905t20172017mauad b 001 0 eng d
040 _aYDX
_beng
_erda
_cYDX
_dOLC
_dOCLCO
_dOCLCF
_dTXI
_dBTCTA
_dWTU
_dIUL
_dMFS
019 _a957680278
_a964801939
_a972618997
_a972780043
_a972919162
_a973148803
_a978721114
_a978992806
_a979387894
020 _a0128119861
_qpaperback
020 _a9780128119860
_qpaperback
035 _a(OCoLC)978448582
_z(OCoLC)957680278
_z(OCoLC)964801939
_z(OCoLC)972618997
_z(OCoLC)972780043
_z(OCoLC)972919162
_z(OCoLC)973148803
_z(OCoLC)978721114
_z(OCoLC)978992806
_z(OCoLC)979387894
050 4 _aQA76.642
_b.K57 2017
082 0 4 _a004/.35
_223
100 1 _aKirk, David,
_d1960-
_eauthor.
245 1 0 _aProgramming massively parallel processors :
_ba hands-on approach /
_cDavid B. Kirk, Wen-mei W. Hwu.
250 _aThird edition.
264 1 _aCambridge, MA, United States :
_bMorgan Kaufmann,
_c[2017]
264 4 _c©2017
300 _axxii, 550 pages :
_billustrations, charts ;
_c24 cm.
336 _atext
_btxt
_2rdacontent
337 _aunmediated
_bn
_2rdamedia
338 _avolume
_bnc
_2rdacarrier
500 _aPrevious edition: 2013.
504 _aIncludes bibliographical references and index.
505 0 _aIntroduction -- Data parallel computing -- Scalable parallel execution -- Memory and data locality - Performance considerations -- Numerical considerations -- Parallel patterns: convolution -- Parallel patterns: prefix sum -- Parallel patterns: parallel histogram computation -- Parallel patterns: sparse matrix computation -- Parallel patterns: merge sort -- Parallel patterns: graph search -- CUDA dynamic parallelism -- Application case study: non-cartesian magnetic resonance imaging -- Application case study: molecular visualization and analysis -- Application case study: machine learning -- Parallel programming and computational thinking -- Programming a heterogeneous computing cluster -- Parallel programming with OpenACC -- More on CUDA and graphics processing unit computing -- Conclusion and outlook.
520 _aThis book shows both student and professional alike the basic concepts of parallel programming and GPU architecture, exploring, in detail, various techniques for constructing parallel programs. Case studies demonstrate the development process, detailing computational thinking and ending with effective and efficient parallel programs. Topics of performance, floating-point format, parallel patterns, and dynamic parallelism are covered in-depth. This edition contains updated coverage of CUDA, including coverage of newer libraries, such as CuDNN, moved content that has become less important to appendices, added two new chapters on parallel patterns, and updated case studies to reflect current industry practices.
650 0 _aParallel programming (Computer science)
650 0 _aParallel processing (Electronic computers)
650 0 _aMultiprocessors.
650 0 _aComputer architecture.
650 7 _aComputer architecture.
_2fast
_0(OCoLC)fst00872026
650 7 _aMultiprocessors.
_2fast
_0(OCoLC)fst01029068
650 7 _aParallel processing (Electronic computers)
_2fast
_0(OCoLC)fst01052928
650 7 _aParallel programming (Computer science)
_2fast
_0(OCoLC)fst01052939
700 1 _aHwu, Wen-mei,
_eauthor.
596 _a1
942 _cBOOK
999 _c8854
_d8854