MARC View

000			03402cam a2200457Ii 4500
001			u15142
003			SA-PMU
005			20210418124625.0
008			160905t20172017mauad b 001 0 eng d
040			_aYDX _beng _erda _cYDX _dOLC _dOCLCO _dOCLCF _dTXI _dBTCTA _dWTU _dIUL _dMFS
019			_a957680278 _a964801939 _a972618997 _a972780043 _a972919162 _a973148803 _a978721114 _a978992806 _a979387894
020			_a0128119861 _qpaperback
020			_a9780128119860 _qpaperback
035			_a(OCoLC)978448582 _z(OCoLC)957680278 _z(OCoLC)964801939 _z(OCoLC)972618997 _z(OCoLC)972780043 _z(OCoLC)972919162 _z(OCoLC)973148803 _z(OCoLC)978721114 _z(OCoLC)978992806 _z(OCoLC)979387894
050		4	_aQA76.642 _b.K57 2017
082	0	4	_a004/.35 _223
100	1		_aKirk, David, _d1960- _eauthor.
245	1	0	_aProgramming massively parallel processors : _ba hands-on approach / _cDavid B. Kirk, Wen-mei W. Hwu.
250			_aThird edition.
264		1	_aCambridge, MA, United States : _bMorgan Kaufmann, _c[2017]
264		4	_c©2017
300			_axxii, 550 pages : _billustrations, charts ; _c24 cm.
336			_atext _btxt _2rdacontent
337			_aunmediated _bn _2rdamedia
338			_avolume _bnc _2rdacarrier
500			_aPrevious edition: 2013.
504			_aIncludes bibliographical references and index.
505	0		_aIntroduction -- Data parallel computing -- Scalable parallel execution -- Memory and data locality - Performance considerations -- Numerical considerations -- Parallel patterns: convolution -- Parallel patterns: prefix sum -- Parallel patterns: parallel histogram computation -- Parallel patterns: sparse matrix computation -- Parallel patterns: merge sort -- Parallel patterns: graph search -- CUDA dynamic parallelism -- Application case study: non-cartesian magnetic resonance imaging -- Application case study: molecular visualization and analysis -- Application case study: machine learning -- Parallel programming and computational thinking -- Programming a heterogeneous computing cluster -- Parallel programming with OpenACC -- More on CUDA and graphics processing unit computing -- Conclusion and outlook.
520			_aThis book shows both student and professional alike the basic concepts of parallel programming and GPU architecture, exploring, in detail, various techniques for constructing parallel programs. Case studies demonstrate the development process, detailing computational thinking and ending with effective and efficient parallel programs. Topics of performance, floating-point format, parallel patterns, and dynamic parallelism are covered in-depth. This edition contains updated coverage of CUDA, including coverage of newer libraries, such as CuDNN, moved content that has become less important to appendices, added two new chapters on parallel patterns, and updated case studies to reflect current industry practices.
650		0	_aParallel programming (Computer science)
650		0	_aParallel processing (Electronic computers)
650		0	_aMultiprocessors.
650		0	_aComputer architecture.
650		7	_aComputer architecture. _2fast _0(OCoLC)fst00872026
650		7	_aMultiprocessors. _2fast _0(OCoLC)fst01029068
650		7	_aParallel processing (Electronic computers) _2fast _0(OCoLC)fst01052928
650		7	_aParallel programming (Computer science) _2fast _0(OCoLC)fst01052939
700	1		_aHwu, Wen-mei, _eauthor.
596			_a1
942			_cBOOK
999			_c8854 _d8854