viztracer
A debugging and profiling tool that can trace and visualize python code execution
Top Related Projects
🚴 Call stack profiler for Python. Shows you why your code is slow!
Sampling profiler for Python programs
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Line-by-line profiling for Python
Monitor Memory usage of Python code
Quick Overview
VizTracer is a low-overhead profiling tool for Python that can trace and visualize your Python program's execution. It generates an interactive HTML report that allows you to explore the program's performance and behavior, including function calls, time spent in each function, and system events.
Pros
- Low overhead, suitable for production environments
- Generates detailed, interactive visualizations of program execution
- Supports multi-threading and multi-processing
- Integrates with existing Python debugging and profiling tools
Cons
- May slightly impact performance when tracing is enabled
- Large trace files can be generated for long-running programs
- Learning curve for interpreting complex visualizations
- Limited support for certain Python implementations (e.g., PyPy)
Code Examples
- Basic usage:
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Your code here
def fibonacci(n):
if n <= 1:
return n
return fibonacci(n-1) + fibonacci(n-2)
fibonacci(10)
tracer.stop()
tracer.save() # Saves the report as 'result.html'
- Tracing only specific functions:
from viztracer import VizTracer
def trace_function(func):
def wrapper(*args, **kwargs):
tracer = VizTracer(output_file=f"{func.__name__}.html")
tracer.start()
result = func(*args, **kwargs)
tracer.stop()
tracer.save()
return result
return wrapper
@trace_function
def expensive_operation():
# Your code here
pass
expensive_operation()
- Using VizTracer as a context manager:
from viztracer import VizTracer
with VizTracer(output_file="context_trace.html"):
# Your code here
for i in range(10000):
_ = i ** 2
Getting Started
-
Install VizTracer:
pip install viztracer
-
Add tracing to your Python script:
from viztracer import VizTracer tracer = VizTracer() tracer.start() # Your code here tracer.stop() tracer.save() # Saves the report as 'result.html'
-
Run your script as usual.
-
Open the generated HTML file (e.g., 'result.html') in a web browser to view the visualization.
Competitor Comparisons
🚴 Call stack profiler for Python. Shows you why your code is slow!
Pros of Pyinstrument
- Simpler setup and usage, requiring minimal code changes
- Provides a more intuitive flame graph visualization
- Lower overhead, suitable for production environments
Cons of Pyinstrument
- Less detailed profiling information compared to VizTracer
- Limited support for asynchronous code profiling
- Fewer customization options for output and data collection
Code Comparison
Pyinstrument:
from pyinstrument import Profiler
profiler = Profiler()
profiler.start()
# Your code here
profiler.stop()
profiler.print()
VizTracer:
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Your code here
tracer.stop()
tracer.save()
Both tools offer straightforward integration, but Pyinstrument requires slightly less setup. VizTracer provides more advanced features and customization options, while Pyinstrument focuses on simplicity and ease of use. The choice between the two depends on the specific profiling needs and the complexity of the project being analyzed.
Sampling profiler for Python programs
Pros of py-spy
- Low-overhead sampling profiler, minimal impact on program performance
- Can profile running Python processes without code modifications
- Supports profiling multi-threaded Python programs
Cons of py-spy
- Limited visualization options compared to VizTracer
- Doesn't provide as detailed function-level tracing
- May require root access for some features on certain systems
Code Comparison
VizTracer:
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Your code here
tracer.stop()
tracer.save("result.json")
py-spy:
# No code changes required
# Run from command line:
# py-spy record -o profile.svg -- python your_script.py
VizTracer requires code modification to instrument the program, while py-spy can profile existing processes without changes. VizTracer offers more detailed tracing and visualization options, but py-spy provides a simpler, lower-overhead sampling approach. VizTracer is better suited for in-depth analysis of specific code sections, while py-spy excels at getting quick performance overviews of entire programs or live processes.
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Pros of Scalene
- Provides detailed memory profiling, including per-line memory usage
- Offers CPU and GPU profiling capabilities
- Generates interactive HTML reports for easy analysis
Cons of Scalene
- May have higher overhead compared to VizTracer for certain workloads
- Requires Python 3.6+ and only works on Unix-like systems (Linux, macOS)
Code Comparison
VizTracer:
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Your code here
tracer.stop()
tracer.save("result.json")
Scalene:
# No code changes required
# Run your script with:
# python -m scalene your_script.py
Summary
Scalene focuses on comprehensive profiling, including memory usage and GPU profiling, making it suitable for complex applications where resource usage is critical. It generates detailed reports but may have higher overhead.
VizTracer emphasizes low-overhead tracing and visualization, making it ideal for performance analysis of large-scale applications. It requires minimal code changes but may not provide as detailed memory profiling as Scalene.
Choose Scalene for in-depth resource profiling, especially for memory-intensive applications. Opt for VizTracer when you need a lightweight solution for tracing and visualizing program execution.
Line-by-line profiling for Python
Pros of line_profiler
- Provides detailed line-by-line profiling information
- Lightweight and focused on a specific task
- Easy to integrate with existing Python projects
Cons of line_profiler
- Limited to line-level profiling, lacking broader performance insights
- Requires manual decoration of functions to profile
- Less visual representation of profiling data
Code Comparison
line_profiler:
@profile
def my_function():
# Function code here
pass
# Run the profiler
lprofiler = LineProfiler()
lprofiler.add_function(my_function)
lprofiler.run('my_function()')
VizTracer:
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Code to profile
tracer.stop()
tracer.save()
VizTracer offers a more comprehensive approach to profiling, capturing function calls, arguments, and return values. It provides a visual timeline of program execution, making it easier to identify performance bottlenecks. However, line_profiler excels at providing detailed line-by-line timing information for specific functions, which can be crucial for optimizing critical code sections.
While line_profiler requires explicit function decoration, VizTracer can profile entire programs with minimal code changes. This makes VizTracer more suitable for large-scale profiling, while line_profiler is ideal for targeted optimization of specific functions.
Monitor Memory usage of Python code
Pros of memory_profiler
- Focused specifically on memory usage profiling
- Provides line-by-line memory consumption analysis
- Can be used as a command-line tool or as a Python module
Cons of memory_profiler
- Limited to memory profiling, doesn't provide comprehensive performance analysis
- May have higher overhead compared to VizTracer for large-scale applications
- Lacks advanced visualization features for complex program flows
Code Comparison
memory_profiler:
from memory_profiler import profile
@profile
def my_func():
a = [1] * (10 ** 6)
b = [2] * (2 * 10 ** 7)
del b
return a
VizTracer:
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Your code here
tracer.stop()
tracer.save()
memory_profiler is more straightforward for memory-specific profiling, while VizTracer offers a broader range of profiling capabilities with its tracing approach. VizTracer provides more comprehensive performance insights but may require more setup for memory-specific analysis.
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual CopilotREADME
VizTracer
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
The front-end UI is powered by Perfetto. Use "AWSD" to zoom/navigate. More help can be found in "Support - Controls".
Highlights
- Detailed function entry/exit information on timeline with source code
- Super easy to use, no source code change for most features, no package dependency
- Low overhead, probably the fastest tracer in the market
- Supports threading, multiprocessing, subprocess, async and PyTorch
- Powerful front-end, able to render GB-level trace smoothly
- Works on Linux/MacOS/Windows
Install
The preferred way to install VizTracer is via pip
pip install viztracer
Basic Usage
Command Line
# Instead of "python3 my_script.py arg1 arg2"
viztracer my_script.py arg1 arg2
A result.json
file will be generated, which you can open with vizviewer
# You can display all the files in a directory and open them in browser too
vizviewer ./
# For very large trace files, try external trace processor
vizviewer --use_external_processor result.json
vizviewer will host an HTTP server on http://localhost:9001
. You can also open your browser and use that address.
If you do not want vizviewer to open the webbrowser automatically, you can use
vizviewer --server_only result.json
If you just need to bring up the trace report once, and do not want the persistent server, use
vizviewer --once result.json
vizviewer result.json
A VS Code Extension is available to make your life even easier.
Add --open
to open the reports right after tracing
viztracer --open my_script.py arg1 arg2
viztracer -o result.html --open my_script.py arg1 arg2
modules and console scripts(like flask
) are supported as well
viztracer -m your_module
viztracer flask run
Inline
You can also manually start/stop VizTracer in your script as well.
from viztracer import VizTracer
tracer = VizTracer()
tracer.start()
# Something happens here
tracer.stop()
tracer.save() # also takes output_file as an optional argument
Or, you can do it with with
statement
with VizTracer(output_file="optional.json") as tracer:
# Something happens here
Jupyter
If you are using Jupyter, you can use viztracer cell magics.
# You need to load the extension first
%load_ext viztracer
%%viztracer
# Your code after
A VizTracer Report
button will appear after the cell and you can click it to view the results
PyTorch
VizTracer can log native calls and GPU events of PyTorch (based on torch.profiler
) with
--log_torch
.
with VizTracer(log_torch=True) as tracer:
# Your torch code
viztracer --log_torch your_model.py
Advanced Usage
Trace Filter
VizTracer can filter out the data you don't want to reduce overhead and keep info of a longer time period before you dump the log.
Extra Logs without Code Change
VizTracer can log extra information without changing your source code
- Any Variable/Attribute with RegEx
- Function Entry
- Variables in Specified Function
- Garbage Collector Operation
- Function Input Arguments
- Function Return Value
- Audit Events
- Raised Exceptions
Add Custom Event
VizTracer supports inserting custom events while the program is running. This works like a print debug, but you can know when this print happens while looking at trace data.
Misc
Multi Thread Support
For Python3.12+, VizTracer supports Python-level multi-thread tracing without the need to do any modification to your code.
For versions before 3.12, VizTracer supports python native threading
module. Just start VizTracer
before you create threads and it will just work.
For other multi-thread scenarios, you can use enable_thread_tracing()
to notice VizTracer about the thread to trace it.
Refer to multi thread docs for details
Multi Process Support
VizTracer supports subprocess
, multiprocessing
, os.fork()
, concurrent.futures
, and loky
out of the box.
For more general multi-process cases, VizTracer can support with some extra steps.
Refer to multi process docs for details
Async Support
VizTracer supports asyncio
natively, but could enhance the report by using --log_async
.
Refer to async docs for details
Flamegraph
Perfetto supports native flamegraph, just select slices on the UI and choose "Slice Flamegraph".
Remote attach
VizTracer supports remote attach to an arbitrary Python process to trace it, as long as viztracer is importable
Refer to remote attach docs
JSON alternative
VizTracer needs to dump the internal data to json format. It is recommended for the users to install orjson
, which is much faster than the builtin json
library. VizTracer will try to import orjson
and fall back to the builtin json
library if orjson
does not exist.
Performance
VizTracer puts in a lot of effort to achieve low overhead. The actual performance impact largely depends on your application. For typical codebases, the overhead is expected to be below 1x. If your code has infrequent function calls, the overhead could be minimal.
Detailed explanation
The overhead introduced by VizTracer is basically a fixed amount of time during function entry and exit, so the more time spent on
function entries and exits, the more overhead will be observed. A pure recursive fib
function could suffer 3x-4x overhead
on Python3.11+ (when the Python call is optimized, before that Python call was slower so the overhead ratio would be less).
In the real life scenario, your code should not spend too much time on function calls (they don't really do anything useful), so the overhead would be much smaller.
Many techniques are applied to minimize the overall overhead during code execution to reduce the inevitable skew introduced by
VizTracer (the report saving part is not as critical). For example, VizTracer tries to use the CPU timestamp counter instead of
a syscall to get the time when available. On Python 3.12+, VizTracer uses sys.monitoring
which has less overhead than
sys.setprofile
. All of the efforts made it observably faster than cProfile
, the Python stdlib profiler.
However, VizTracer is a tracer, which means it has to record every single function entry and exit, so it can't be as fast as the sampling profilers - they are not the same thing. With the extra overhead, VizTracer provides a lot more information than normal sampling profilers.
Documentation
For full documentation, please see https://viztracer.readthedocs.io/en/stable
Bugs/Requests
Please send bug reports and feature requests through github issue tracker. VizTracer is currently under development now and it's open to any constructive suggestions.
License
Copyright 2020-2024 Tian Gao.
Distributed under the terms of the Apache 2.0 license.
Top Related Projects
🚴 Call stack profiler for Python. Shows you why your code is slow!
Sampling profiler for Python programs
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Line-by-line profiling for Python
Monitor Memory usage of Python code
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual Copilot