Software profiling allows you to inspect the performance of a piece of software, seeing where the bottlenecks are and how they can be improved. gperftools is a software profiling package. It contains a CPU profiler, thread-caching malloc library, memory leak detection tool, memory allocation profiler and pprof (discussed later). More information about gperftools and its components can be found on the project’s GitHub and documentation page.
It is strongly recommended to install libunwind before trying to configure or install gperftools.
If you’re using Ubuntu you can use the following command to install gperftools:
apt install google-perftools libgoogle-perftools-dev
Alternatively, you can use the following commands to install it:
$ git clone https://github.com/gperftools/gperftools $ cd gperftools $ git tag -l # checkout the latest release version $ git checkout <gperftools-X.x> $ ./autogen.sh $ ./configure --prefix=<your/desired/install/dir> $ make $ make install
This will install gperftools in
your/desired/install/dir/lib which is the path you should use when specifying where gperftools is, if necessary.
If you wish to install gperftools to a directory that is not one of the standard directories for libraries and therefore not findable by the
-l compiler flag, you will need to specify the path to it with the
GPERF_INSTALL_DIR option at build time.
Further information about installing gperftools is here.
pprof is a tool for visualising and analysing profiling data. An older version of pprof comes bundled with gperftools but using the newer Go version comes with several benefits: nicer looking graphs and additional options that make looking through specific sections of a program easier being among them.
Link gperftools Libraries When Compiling¶
The library needed to run the CPU profilier should be linked into the ACTS project using the following build option:
Similarly, to enable the memory profiler the following build option should be used:
Alternative to Recompiling¶
Alternatively, you can avoid rebuiding the project by pointing the
LD_PRELOAD environment variable to the profiler library for CPU profiling:
LD_PRELOAD="<path/to/libprofiler.so>" <other_options> <path/to/binary> <binary_flags>
You can do the same thing with the tcmalloc library for memory profiling:
LD_PRELOAD="<path/to/libtcmalloc.so>" <other_options> <path/to/binary> <binary_flags>
LD_PRELOAD method is not recommended by the developers of gperftools so using the build options is preferable. Both CPU and memory profiling can be enabled at the same time but note that turning on memory profiling (or the heap checker) will affect performance.
Specify multiple libraries to load with
LD_PRELOAD using a space-separated list e.g.
Note that these steps don’t turn on profiling, they only enable it to work. The following section details how to turn it on.
Produce a CPU Profile¶
To turn on CPU profiling when running an executable define the
CPUPROFILE environment variable when executing the program:
CPUPROFILE=<path/to/profile> <path/to/binary> [binary args]
This variable specifies where the profile will be written to. There are additional environemnt variables that modify the behaviour of the profiler. Would you like to know more?
Produce a Memory Profile¶
To turn on memory profiling use the following command:
HEAPPROFILE=<path/to/profile> <path/to/binary> [binary args]
Run the Heap Checker¶
To run the heap checker for checking for memory leaks run the following command:
PPROF_PATH=<path/to/pprof> HEAPCHECK=normal <path/to/binary> [binary args]
The CPU profiler, memory profiler and heap checker can be used in tandem.
View Profile as a Graph¶
A graphical representaion of a profile can be produced using:
pprof -pdf <path/to/binary> <path/to/profile> > <path/to/pdf>
path/to/binary is the binary is used to produce the profile in the first place.
Other output formats are available.
The following opens the graph in your web browser:
pprof -web <path/to/binary> <path/to/profile>
To launch pprof in interactive mode use the following command:
pprof <path/to/binary> <path/to/profile>
The following command will display the top x entries by the current sorting criteria:
To view the statistics of a function line by line use:
Various options can be specified to filter, sort and set the granularity of entries. There are also a number of other commands available. Read more about pprof here.