Gprof Profiling via Binary Instrumentation |
Results
In both figures 2 and 3 we see that for the micro
benchmark, our tool performs flawlessly. The call graphs match in the -pg
version and our tool. There is also no difference between the call percentages
either. The second application compared was the LINPACK benchmark. This
benchmark was introduced by Jack Dongarra as a floating point performance
benchmark. It is far more complex than our micro benchmark in that it actually
does work. This benchmark is not the most complex because it was written for far
slower computers and only runs for less In both figures 4 and 5 we see that for the LINPACK benchmark, our tool performs well (but not perfect on the percentages). The call graphs match in the -pg version and our tool. There are slight differences between the call percentages. We attribute this percentage difference to machine load and a difference in the cost of the original mcount and our modified mcount. The third application compared was llloop. This benchmark is the most complex benchmark used. It does real work, has many function calls, and runs for a long time. This was the most strenuous test for our tool. |
|