Skip to content

Commit

Permalink
Chapter5. Cleanup
Browse files Browse the repository at this point in the history
  • Loading branch information
dbakhval authored and dbakhval committed Sep 20, 2023
1 parent ed12926 commit 33bfd77
Show file tree
Hide file tree
Showing 2 changed files with 0 additions and 5 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -7,8 +7,3 @@
- scenario 4: there is a new shiny library that claims to be faster than the one you currently have integrated in your project; you've decided to compare their performance.
- scenario 5: you were asked to analyze performance of an unfamiliar code; you want to know how frequently a certain branch is taken and how many iterations the loop is doing.
2. Run the application that you're working with on a daily basis. Practice doing performance analysis using approaches we discussed in this chapter. Collect raw counts for various CPU performance events, find hotspots, collect roofline data, generate and study the compiler optimization report fot the hot function(s) in your program.

[TODO]: Coding exercise?
Coding exercise: write a microbenchmark that will demonstrate the effect of FMA unrolling.
At the end of this chapter we offer you a coding exercise to write a microbenchmark that will demonstrate the effect we explored in this study.
The main author of the book has done many experiments on this code with speedups ranging between 1.1x and 1.9x.
Binary file removed img/perf-analysis/FMA_tput_uica_orig.png
Binary file not shown.

0 comments on commit 33bfd77

Please sign in to comment.