Roofline Performance Model Analysis
Books and Textbooks
Hennessy, J. L., & Patterson, D. A. (2019). Computer Architecture: A Quantitative Approach (6th ed.). Morgan Kaufmann.
- Chapter 4: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
- Appendix B: Review of Memory Hierarchy
Williams, S., Waterman, A., & Patterson, D. (2009). Roofline: an insightful visual performance model for multicore architectures. Communications of the ACM, 52(4), 65-76.