netdata
The fastest path to AI-powered full stack observability, even for lean teams.
Top Related Projects
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
The Prometheus monitoring system and time series database.
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
:tropical_fish: Beats - Lightweight shippers for Elasticsearch & Logstash
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
Real-time monitoring of IT components and services, such as networks, servers, VMs, applications and the cloud.
Quick Overview
Netdata is an open-source, real-time performance and health monitoring system for systems, hardware, containers, and applications. It provides detailed insights and visualizations for various metrics, allowing users to diagnose and troubleshoot issues quickly. Netdata is designed to be lightweight, scalable, and easy to install and use.
Pros
- Extremely low resource usage, making it suitable for production environments
- Automatic configuration and real-time visualization of thousands of metrics
- Highly customizable with a wide range of plugins and integrations
- Free and open-source with an active community
Cons
- Can be overwhelming for beginners due to the vast amount of information presented
- Limited long-term data storage capabilities out of the box
- Some advanced features may require additional setup or external tools
- Documentation can be extensive and sometimes difficult to navigate
Getting Started
To install Netdata on most Linux systems, you can use the following one-line installation script:
bash <(curl -Ss https://my-netdata.io/kickstart.sh)
After installation, Netdata will automatically start collecting and visualizing data. You can access the web interface by opening a web browser and navigating to:
http://localhost:19999
To view Netdata metrics from the command line, you can use the netdatacli
tool:
sudo netdatacli ping
sudo netdatacli info
sudo netdatacli dump
For more advanced configuration and usage, refer to the official Netdata documentation.
Competitor Comparisons
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
Pros of Grafana
- More flexible and customizable dashboards
- Supports a wider range of data sources
- Better suited for complex, multi-source visualizations
Cons of Grafana
- Steeper learning curve and more complex setup
- Requires more system resources
- Less out-of-the-box functionality for real-time monitoring
Code Comparison
Netdata configuration (netdata.conf):
[global]
update every = 1
memory mode = save
history = 3600
Grafana dashboard JSON snippet:
{
"datasource": "Prometheus",
"fieldConfig": {
"defaults": {
"color": {"mode": "palette-classic"},
"custom": {"axisLabel": "", "axisPlacement": "auto"}
}
}
}
Netdata focuses on simplicity and real-time monitoring, with minimal configuration required. Grafana offers more advanced customization options but requires more setup and knowledge to utilize fully. Netdata excels in immediate, out-of-the-box functionality, while Grafana shines in creating complex, tailored dashboards across various data sources.
The Prometheus monitoring system and time series database.
Pros of Prometheus
- More flexible and scalable for large, distributed systems
- Powerful query language (PromQL) for complex data analysis
- Extensive ecosystem with many integrations and exporters
Cons of Prometheus
- Steeper learning curve and more complex setup
- Requires additional components for long-term storage
- Less out-of-the-box visualizations compared to Netdata
Code Comparison
Prometheus configuration (prometheus.yml):
global:
scrape_interval: 15s
scrape_configs:
- job_name: 'example'
static_configs:
- targets: ['localhost:9090']
Netdata configuration (netdata.conf):
[global]
update every = 1
memory mode = save
history = 3600
port = 19999
Both Prometheus and Netdata are powerful monitoring solutions, but they serve different use cases. Prometheus excels in complex, distributed environments with its flexible data model and query language. Netdata offers a more user-friendly experience with real-time monitoring and pre-configured dashboards. The choice between the two depends on the specific requirements of your infrastructure and monitoring needs.
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
Pros of Telegraf
- More flexible and extensible plugin system, supporting a wide range of input and output plugins
- Better suited for large-scale, distributed environments
- Integrates seamlessly with the InfluxDB ecosystem
Cons of Telegraf
- Steeper learning curve and more complex configuration
- Higher resource consumption, especially in large deployments
- Less out-of-the-box visualization capabilities compared to Netdata
Code Comparison
Telegraf configuration (TOML):
[[inputs.cpu]]
percpu = true
totalcpu = true
collect_cpu_time = false
report_active = false
Netdata configuration (YAML):
plugin:
proc:
/proc/stat: yes
/proc/uptime: yes
/proc/loadavg: yes
Both Netdata and Telegraf are powerful monitoring and metrics collection tools. Netdata excels in ease of use and real-time visualization, while Telegraf offers more flexibility and integration options. Netdata is ideal for quick setup and immediate insights, whereas Telegraf is better suited for complex, customized monitoring solutions in larger environments. The choice between the two depends on specific use cases, existing infrastructure, and desired level of customization.
:tropical_fish: Beats - Lightweight shippers for Elasticsearch & Logstash
Pros of Beats
- Part of the Elastic Stack ecosystem, integrating seamlessly with Elasticsearch and Kibana
- Modular architecture with specialized data shippers for different use cases (e.g., Filebeat, Metricbeat)
- Extensive documentation and enterprise support options
Cons of Beats
- Steeper learning curve, especially when configuring complex data pipelines
- Requires additional setup and infrastructure (Elasticsearch, Kibana) for full functionality
- Can be resource-intensive for large-scale deployments
Code Comparison
Netdata configuration example:
[global]
update every = 1s
memory mode = ram
Beats (Metricbeat) configuration example:
metricbeat.modules:
- module: system
metricsets: ["cpu", "memory"]
period: 10s
Key Differences
- Netdata focuses on real-time monitoring and visualization, while Beats is primarily for data shipping and ingestion
- Netdata offers a more user-friendly, out-of-the-box experience, whereas Beats provides greater flexibility and customization
- Netdata is designed for single-node monitoring, while Beats excels in distributed environments and centralized logging
Use Cases
- Choose Netdata for quick setup and real-time system monitoring on individual nodes
- Opt for Beats when building a comprehensive logging and monitoring solution within the Elastic Stack ecosystem
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
Pros of Glances
- Lightweight and cross-platform, suitable for various operating systems
- Supports both CLI and web-based interfaces, offering flexibility in monitoring
- Extensible through plugins, allowing customization for specific monitoring needs
Cons of Glances
- Less detailed metrics and visualizations compared to Netdata's extensive offerings
- Smaller community and fewer integrations with third-party tools and services
- Limited long-term data storage and historical analysis capabilities
Code Comparison
Glances (Python):
from glances.outputs.glances_curses import GlancesCursesClient
class GlancesStandalone(GlancesCursesClient):
def __init__(self, config=None, args=None):
GlancesCursesClient.__init__(self, config=config, args=args)
Netdata (C):
int main(int argc, char **argv) {
netdata_ready = 0;
program_name = "netdata";
program_version = VERSION;
error_log_limit_unlimited();
netdata_init_debug(argc, argv);
}
Both projects use different programming languages, with Glances primarily written in Python and Netdata in C. This affects performance characteristics and deployment considerations. Glances' Python codebase may be more accessible for contributions, while Netdata's C implementation potentially offers better performance for high-frequency data collection and processing.
Real-time monitoring of IT components and services, such as networks, servers, VMs, applications and the cloud.
Pros of Zabbix
- More comprehensive enterprise-level monitoring solution
- Highly scalable for large infrastructures
- Extensive customization and templating capabilities
Cons of Zabbix
- Steeper learning curve and more complex setup
- Requires more resources to run effectively
- Less real-time data collection compared to Netdata
Code Comparison
Zabbix (agent configuration):
Server=zabbix.example.com
ServerActive=zabbix.example.com
Hostname=myhost
Netdata (health configuration):
template: disk_space_usage
on: disk.space
calc: $used * 100 / ($avail + $used)
every: 1m
warn: $this > 80
crit: $this > 90
Key Differences
- Netdata focuses on real-time monitoring with a lightweight agent, while Zabbix offers a more comprehensive monitoring and alerting platform.
- Zabbix provides more advanced features for enterprise environments, including inventory management and service-level monitoring.
- Netdata excels in ease of setup and immediate visibility, while Zabbix offers more depth in long-term data analysis and reporting.
- Zabbix uses a centralized database for data storage, whereas Netdata primarily relies on in-memory and on-disk round-robin databases for short-term data retention.
Both tools have their strengths, with Netdata being ideal for quick deployment and real-time insights, and Zabbix better suited for complex, large-scale monitoring needs.
Convert
designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual CopilotREADME
X-Ray Vision for your infrastructure!
Every Metric, Every Second. No BS.
Visit our Home Page
MENU: WHO WE ARE | KEY FEATURES | GETTING STARTED | HOW IT WORKS | FAQ | DOCS | COMMUNITY | CONTRIBUTE | LICENSE
[!WARNING] People get addicted to Netdata. Once you use it on your systems, there's no going back.
WHO WE ARE
Netdata is an open-source, real-time infrastructure monitoring platform. Monitor, detect, and act across your entire infrastructure.
Core Advantages:
- Instant Insights â With Netdata you can access per-second metrics and visualizations.
- Zero Configuration â You can deploy immediately without complex setup.
- ML-Powered â You can detect anomalies, predict issues, and automate analysis.
- Efficient â You can monitor with minimal resource usage and maximum scalability.
- Secure & Distributed â You can keep your data local with no central collection needed.
With Netdata, you get real-time, per-second updates. Clear insights at a glance, no complexity.
All heroes have a great origin story. Click to discover ours.
In 2013, at the company where Costa Tsaousis was COO, a significant percentage of their cloud-based transactions failed silently, severely impacting business performance.
Costa and his team tried every troubleshooting tool available at the time. None could identify the root cause. As Costa later wrote:
â*I couldnât believe that monitoring systems provide so few metrics and with such low resolution, scale so badly, and cost so much to run.*â
Frustrated, he decided to build his own monitoring tool, starting from scratch.
That decision led to countless late nights and weekends. It also sparked a fundamental shift in how infrastructure monitoring and troubleshooting are approached, both in method and in cost.
Most Energy-Efficient Monitoring Tool
According to the University of Amsterdam study, Netdata is the most energy-efficient tool for monitoring Docker-based systems. The study also shows Netdata excels in CPU usage, RAM usage, and execution time compared to other monitoring solutions.
Key Features
Feature | Description | What Makes It Unique |
---|---|---|
Real-Time | Per-second data collection and processing | Works in a beat â click and see results instantly |
Zero-Configuration | Automatic detection and discovery | Auto-discovers everything on the nodes it runs |
ML-Powered | Unsupervised anomaly detection | Trains multiple ML models per metric at the edge |
Long-Term Retention | High-performance storage | ~0.5 bytes per sample with tiered storage for archiving |
Advanced Visualization | Rich, interactive dashboards | Slice and dice data without query language |
Extreme Scalability | Native horizontal scaling | Parent-Child centralization with multi-million samples/s |
Complete Visibility | From infrastructure to applications | Simplifies operations and eliminates silos |
Edge-Based | Processing at your premises | Distributes code instead of centralizing data |
[!NOTE]
Want to put Netdata to the test against Prometheus? Explore the full comparison.
Netdata Ecosystem
This three-part architecture enables you to scale from single nodes to complex multi-cloud environments:
Component | Description | License |
---|---|---|
Netdata Agent | ⢠Core monitoring engine ⢠Handles collection, storage, ML, alerts, exports ⢠Runs on servers, cloud, K8s, IoT ⢠Zero production impact | GPL v3+ |
Netdata Cloud | ⢠Enterprise features ⢠User management, RBAC, horizontal scaling ⢠Centralized alerts ⢠Free community tier ⢠No metric storage centralization | |
Netdata UI | ⢠Dashboards and visualizations ⢠Free to use ⢠Included in standard packages ⢠Latest version via CDN | NCUL1 |
What You Can Monitor
With Netdata you can monitor all these components across platforms:
Component | Linux | FreeBSD | macOS | Windows |
---|---|---|---|---|
System Resources CPU, Memory and system shared resources | Full | Yes | Yes | Yes |
Storage Disks, Mount points, Filesystems, RAID arrays | Full | Yes | Yes | Yes |
Network Network Interfaces, Protocols, Firewall, etc | Full | Yes | Yes | Yes |
Hardware & Sensors Fans, Temperatures, Controllers, GPUs, etc | Full | Some | Some | Some |
O/S Services Resources, Performance and Status | Yessystemd | - | - | - |
Processes Resources, Performance, OOM, and more | Yes | Yes | Yes | Yes |
System and Application Logs | Yessystemd -journal | - | - | YesWindows Event Log , ETW |
Network Connections Live TCP and UDP sockets per PID | Yes | - | - | - |
Containers Docker/containerd, LXC/LXD, Kubernetes, etc | Yes | - | - | - |
VMs (from the host) KVM, qemu, libvirt, Proxmox, etc | Yescgroups | - | - | YesHyper-V |
Synthetic Checks Test APIs, TCP ports, Ping, Certificates, etc | Yes | Yes | Yes | Yes |
Packaged Applications nginx, apache, postgres, redis, mongodb, and hundreds more | Yes | Yes | Yes | Yes |
Cloud Provider Infrastructure AWS, GCP, Azure, and more | Yes | Yes | Yes | Yes |
Custom Applications OpenMetrics, StatsD and soon OpenTelemetry | Yes | Yes | Yes | Yes |
On Linux, you can continuously monitor all kernel features and hardware sensors for errors, including Intel/AMD/Nvidia GPUs, PCI AER, RAM EDAC, IPMI, S.M.A.R.T, Intel RAPL, NVMe, fans, power supplies, and voltage readings.
Getting Started
You can install Netdata on all major operating systems. To begin:
1. Install Netdata
Choose your platform and follow the installation guide:
[!NOTE] You can access the Netdata UI at
http://localhost:19999
(orhttp://NODE:19999
if remote).
2. Configure Collectors
Netdata auto-discovers most metrics, but you can manually configure some collectors:
3. Configure Alerts
You can use hundreds of built-in alerts and integrate with:
email
, Slack
, Telegram
, PagerDuty
, Discord
, Microsoft Teams
, and more.
[!NOTE]
Email alerts work by default if there's a configured MTA.
4. Configure Parents
You can centralize dashboards, alerts, and storage with Netdata Parents:
[!NOTE]
You can use Netdata Parents for central dashboards, longer retention, and alert configuration.
5. Connect to Netdata Cloud
Sign in to Netdata Cloud and connect your nodes for:
- Access from anywhere
- Horizontal scalability and multi-node dashboards
- UI configuration for alerts and data collection
- Role-based access control
- Free tier available
[!NOTE]
Netdata Cloud is optional. Your data stays in your infrastructure.
Live Demo Sites
See Netdata in action
FRANKFURT |
NEWYORK |
ATLANTA |
SANFRANCISCO |
TORONTO |
SINGAPORE |
BANGALORE
These demo clusters run with default configuration and show real monitoring data.
Choose the instance closest to you for the best performance.
How It Works
With Netdata you can run a modular pipeline for metrics collection, processing, and visualization.
flowchart TB
A[Netdata Agent]:::mainNode
A1(Collect):::green --> A
A2(Store):::green --> A
A3(Learn):::green --> A
A4(Detect):::green --> A
A5(Check):::green --> A
A6(Stream):::green --> A
A7(Archive):::green --> A
A8(Query):::green --> A
A9(Score):::green --> A
classDef green fill:#bbf3bb,stroke:#333,stroke-width:1px,color:#000
classDef mainNode fill:#f0f0f0,stroke:#333,stroke-width:1px,color:#333
With each Agent you can:
- Collect â Gather metrics from systems, containers, apps, logs, APIs, and synthetic checks.
- Store â Save metrics to a high-efficiency, tiered time-series database.
- Learn â Train ML models per metric using recent behavior.
- Detect â Identify anomalies using trained ML models.
- Check â Evaluate metrics against pre-set or custom alert rules.
- Stream â Send metrics to Netdata Parents in real time.
- Archive â Export metrics to Prometheus, InfluxDB, OpenTSDB, Graphite, and others.
- Query â Access metrics via an API for dashboards or third-party tools.
- Score â Use a scoring engine to find patterns and correlations across metrics.
[!NOTE]
Learn more: Netdata's architecture
Agent Capabilities
With the Netdata Agent, you can use these core capabilities out-of-the-box:
Capability | Description |
---|---|
Comprehensive Collection | ⢠800+ integrations ⢠Systems, containers, VMs, hardware sensors ⢠OpenMetrics, StatsD, and logs ⢠OpenTelemetry support coming soon |
Performance & Precision | ⢠Per-second collection ⢠Real-time visualization with 1-second latency ⢠High-resolution metrics |
Edge-Based ML | ⢠ML models trained at the edge ⢠Automatic anomaly detection per metric ⢠Pattern recognition based on historical behavior |
Advanced Log Management | ⢠Direct systemd-journald and Windows Event Log integration ⢠Process logs at the edge ⢠Rich log visualization |
Observability Pipeline | ⢠Parent-Child relationships ⢠Flexible centralization ⢠Multi-level replication and retention |
Automated Visualization | ⢠NIDL data model ⢠Auto-generated dashboards ⢠No query language needed |
Smart Alerting | ⢠Pre-configured alerts ⢠Multiple notification methods ⢠Proactive detection |
Low Maintenance | ⢠Auto-detection ⢠Zero-touch ML ⢠Easy scalability ⢠CI/CD friendly |
Open & Extensible | ⢠Modular architecture ⢠Easy to customize ⢠Integrates with existing tools |
CNCF Membership
Netdata actively supports and is a member of the Cloud Native Computing Foundation (CNCF).
It is one of the most starred projects in the CNCF landscape.
FAQ
Is Netdata secure?
Yes. Netdata follows OpenSSF best practices, has a security-first design, and is regularly audited by the community.
Does Netdata use a lot of resources?
No. Even with ML and per-second metrics, Netdata uses minimal resources.
- ~5% CPU and 150MiB RAM by default on production systems
- <1% CPU and ~100MiB RAM when ML and alerts are disabled and using ephemeral storage
- Parents scale to millions of metrics per second with appropriate hardware
You can use the Netdata Monitoring section in the dashboard to inspect its resource usage.
How much data retention is possible?
As much as your disk allows.
With Netdata you can use tiered retention:
- Tier 0: per-second resolution
- Tier 1: per-minute resolution
- Tier 2: per-hour resolution
These are queried automatically based on the zoom level.
Can Netdata scale to many servers?
Yes. With Netdata you can:
- Scale horizontally with many Agents
- Scale vertically with powerful Parents
- Scale infinitely via Netdata Cloud
You can use Netdata Cloud to merge many independent infrastructures into one logical view.
Is disk I/O a concern?
No. Netdata minimizes disk usage:
- Metrics are flushed to disk every 17 minutes, spread out evenly
- Uses direct I/O and compression (ZSTD)
- Can run entirely in RAM or stream to a Parent
You can use
alloc
orram
mode for no disk writes.
How is Netdata different from Prometheus + Grafana?
With Netdata you get a complete monitoring solutionânot just tools.
- No manual setup or dashboards needed
- Built-in ML, alerts, dashboards, and correlations
- More efficient and easier to deploy
How is Netdata different from commercial SaaS tools?
With Netdata you can store all metrics on your infrastructureâno sampling, no aggregation, no loss.
- High-resolution metrics by default
- ML per metric, not shared models
- Unlimited scalability without skyrocketing cost
Can Netdata run alongside Nagios, Zabbix, etc.?
Yes. You can use Netdata together with traditional tools.
With Netdata you get:
- Real-time, high-resolution monitoring
- Zero configuration and auto-generated dashboards
- Anomaly detection and advanced visualization
What if I feel overwhelmed?
You can start small:
- Use the dashboard's table of contents and search
- Explore anomaly scoring ("AR" toggle)
- Create custom dashboards in Netdata Cloud
Do I have to use Netdata Cloud?
No. Netdata Cloud is optional.
Netdata works without it, but with Cloud you can:
- Access remotely with SSO
- Save dashboard customizations
- Configure alerts centrally
- Collaborate with role-based access
What telemetry does Netdata collect?
Anonymous telemetry helps improve the product. You can disable it:
- Add
--disable-telemetry
to the installer, or - Create
/etc/netdata/.opt-out-from-anonymous-statistics
and restart Netdata
Telemetry helps us understand usage, not track users. No private data is collected.
Who uses Netdata?
You'll join users including:
- Major companies (Amazon, ABN AMRO Bank, Facebook, Google, IBM, Intel, Netflix, Samsung)
- Universities (NYU, Columbia, Seoul National, UCL)
- Government organizations worldwide
- Infrastructure-intensive organizations
- Technology operators
- Startups and freelancers
- SysAdmins and DevOps professionals
:book: Documentation
Visit Netdata Learn for full documentation and guides.
[!NOTE]
Includes deployment, configuration, alerting, exporting, troubleshooting, and more.
:tada: Community
Join the Netdata community:
[!NOTE]
Code of Conduct
Follow us on: Twitter | Reddit | YouTube | LinkedIn
:pray: Contribute
We welcome your contributions.
Ways you help us stay sharp:
- Share best practices and monitoring insights
- Report issues or missing features
- Improve documentation
- Develop new integrations or collectors
- Help users in forums and chats
[!NOTE]
Contribution guide
:scroll: License
The Netdata ecosystem includes:
- Netdata Agent â Open-source core (GPLv3+). Includes data collection, storage, ML, alerting, APIs and redistributes several other open-source tools and libraries.
- Netdata UI â Closed-source but free to use with Netdata Agent and Cloud. Delivered via CDN. It integrates third-party open-source components.
- Netdata Cloud â Closed-source, with free and paid tiers. Adds remote access, SSO, scalability.
Top Related Projects
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
The Prometheus monitoring system and time series database.
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
:tropical_fish: Beats - Lightweight shippers for Elasticsearch & Logstash
Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac OS and Windows operating systems.
Real-time monitoring of IT components and services, such as networks, servers, VMs, applications and the cloud.
Convert
designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual Copilot