vector

A high-performance observability data pipeline.

20,056

1,818

20,056

2,069

View on GitHub

Top Related Projects

fluentd

13,249

Fluentd: Unified Logging Layer (project under CNCF)

logstash

14,590

Logstash - transport and process your logs, events, or other data

telegraf

16,310

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

Quick Overview

Vector is a high-performance observability data pipeline that collects, transforms, and routes logs, metrics, and traces to various destinations. It's designed to be lightweight, fast, and reliable, making it an excellent choice for organizations looking to streamline their observability infrastructure.

Pros

High performance and low resource usage
Extensive list of supported sources and destinations
Flexible data transformation capabilities
Written in Rust, providing memory safety and concurrency benefits

Cons

Steeper learning curve compared to some alternatives
Configuration can be complex for advanced use cases
Limited built-in visualization capabilities
Younger project compared to some established alternatives

Getting Started

To get started with Vector, follow these steps:

Install Vector:

curl --proto '=https' --tlsv1.2 -sSf https://sh.vector.dev | sh

Create a basic configuration file (e.g., vector.yaml):

sources:
  file:
    type: file
    include:
      - /var/log/**/*.log

transforms:
  parse_logs:
    type: remap
    inputs:
      - file
    source: |
      . = parse_json!(.message)

sinks:
  console:
    type: console
    inputs:
      - parse_logs

Run Vector:
```
vector --config vector.yaml
```

This configuration will collect logs from files, parse JSON messages, and output the results to the console. Adjust the configuration to suit your specific needs and data sources.

Competitor Comparisons

fluentd

13,249

Fluentd: Unified Logging Layer (project under CNCF)

Pros of Fluentd

Mature ecosystem with extensive plugin support
Strong community and widespread adoption
Flexible configuration options for complex log routing

Cons of Fluentd

Higher resource consumption, especially memory usage
Slower performance for high-volume data processing
Ruby-based, which can be less efficient than compiled languages

Code Comparison

Fluentd configuration:

<source>
  @type http
  port 8888
</source>
<match **>
  @type file
  path /var/log/fluent/myapp
</match>

Vector configuration:

[sources.http_input]
type = "http"
address = "0.0.0.0:8888"

[sinks.file_output]
type = "file"
inputs = ["http_input"]
path = "/var/log/vector/myapp.log"

Both configurations set up an HTTP input and file output, but Vector's TOML format is more concise and easier to read. Fluentd's configuration offers more flexibility with its tag-based routing system, while Vector's pipeline is more straightforward and performance-oriented.

Vector generally outperforms Fluentd in terms of resource efficiency and processing speed, making it a better choice for high-volume data processing. However, Fluentd's extensive plugin ecosystem and mature community support make it a versatile option for complex log management scenarios.

logstash

14,590

Logstash - transport and process your logs, events, or other data

Pros of Logstash

Mature ecosystem with extensive plugin support
Tight integration with Elasticsearch and Kibana
Strong community support and documentation

Cons of Logstash

Higher resource consumption
Slower processing speed for high-volume data
More complex configuration syntax

Code Comparison

Logstash configuration:

input {
  file {
    path => "/var/log/syslog"
    start_position => "beginning"
  }
}
filter {
  grok {
    match => { "message" => "%{SYSLOGLINE}" }
  }
}
output {
  elasticsearch {
    hosts => ["localhost:9200"]
  }
}

Vector configuration:

[sources.syslog]
type = "file"
include = ["/var/log/syslog"]

[transforms.parse_syslog]
type = "remap"
inputs = ["syslog"]
source = '''
. = parse_syslog!(.message)
'''

[sinks.elasticsearch]
type = "elasticsearch"
inputs = ["parse_syslog"]
host = "http://localhost:9200"

Vector offers a more concise configuration syntax and generally provides better performance for high-volume data processing. However, Logstash benefits from its mature ecosystem and tight integration with the Elastic Stack. The choice between the two depends on specific use cases and existing infrastructure.

loki

26,112

Like Prometheus, but for logs.

Pros of Loki

Tightly integrated with Grafana for visualization and querying
Efficient storage and indexing optimized for logs
Supports LogQL, a powerful query language for log analysis

Cons of Loki

More focused on log aggregation, less versatile for general data processing
Steeper learning curve for advanced features and configurations
Limited built-in data transformation capabilities

Code Comparison

Loki configuration example:

auth_enabled: false

server:
  http_listen_port: 3100

ingester:
  lifecycler:
    address: 127.0.0.1
    ring:
      kvstore:
        store: inmemory
      replication_factor: 1
    final_sleep: 0s
  chunk_idle_period: 5m
  chunk_retain_period: 30s

Vector configuration example:

[sources.logs]
type = "file"
include = ["/var/log/**/*.log"]

[transforms.parse_logs]
type = "remap"
inputs = ["logs"]
source = '''
. = parse_json!(.message)
'''

[sinks.loki]
type = "loki"
inputs = ["parse_logs"]
endpoint = "http://localhost:3100"

Vector offers more flexibility in data processing and transformation, while Loki excels in log-specific features and Grafana integration. Vector's configuration is more straightforward for general data pipelines, whereas Loki's setup is tailored for log management systems.

telegraf

16,310

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

Pros of Telegraf

Wider range of supported input plugins and integrations
Longer history and larger community, resulting in more resources and documentation
Native integration with InfluxDB for seamless time-series data storage

Cons of Telegraf

Less focus on high-performance data processing compared to Vector
Configuration can be more complex, especially for advanced use cases
Written in Go, which may have higher resource usage than Vector's Rust implementation

Code Comparison

Telegraf configuration example:

[[inputs.cpu]]
  percpu = true
  totalcpu = true
  collect_cpu_time = false
  report_active = false

[[outputs.influxdb]]
  urls = ["http://localhost:8086"]
  database = "telegraf"

Vector configuration example:

[sources.cpu_metrics]
  type = "host_metrics"
  collectors = ["cpu"]

[sinks.influxdb_output]
  type = "influxdb"
  inputs = ["cpu_metrics"]
  endpoint = "http://localhost:8086"
  bucket = "vector"

Both configurations collect CPU metrics and send them to InfluxDB, but Vector's syntax is generally more concise and intuitive. Telegraf offers more granular control over CPU metric collection, while Vector's configuration is simpler and easier to read at a glance.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

Vector

Quickstart • Docs • Guides • Integrations • Chat • Download • Rust Crate Docs

What is Vector?

Vector is a high-performance, end-to-end (agent & aggregator) observability data pipeline that puts you in control of your observability data. Collect, transform, and route all your logs and metrics to any vendors you want today and any other vendors you may want tomorrow. Vector enables dramatic cost reduction, novel data enrichment, and data security where you need it, not where it is most convenient for your vendors. Additionally, it is open source and up to 10x faster than every alternative in the space.

To get started, follow our quickstart guide or install Vector.

Vector is maintained by the Datadog's Community Open Source Engineering team.

Principles

Reliable - Built in Rust, Vector's primary design goal is reliability.
End-to-end - Deploys as an agent or aggregator. Vector is a complete platform.
Unified - Logs, metrics (beta), and traces (coming soon). One tool for all of your data.

Use cases

Reduce total observability costs.
Transition vendors without disrupting workflows.
Enhance data quality and improve insights.
Consolidate agents and eliminate agent fatigue.
Improve overall observability performance and reliability.

Community

Vector is relied on by startups and enterprises like Atlassian, T-Mobile, Comcast, Zendesk, Discord, Fastly, CVS, Trivago, Tuple, Douban, Visa, Mambu, Blockfi, Claranet, Instacart, Forcepoint, and many more.
Vector is downloaded over 100,000 times per day.
Vector's largest user processes over 500TB daily.
Vector has over 500 contributors and growing.

Documentation

All user documentation is available at vector.dev/docs.

Other Resources:

Vector Calendar
Policies:

Comparisons

Performance

The following performance tests demonstrate baseline performance between common protocols with the exception of the Regex Parsing test.

Test	Vector	Filebeat	FluentBit	FluentD	Logstash	SplunkUF	SplunkHF
TCP to Blackhole	86mib/s	n/a	64.4mib/s	27.7mib/s	40.6mib/s	n/a	n/a
File to TCP	76.7mib/s	7.8mib/s	35mib/s	26.1mib/s	3.1mib/s	40.1mib/s	39mib/s
Regex Parsing	13.2mib/s	n/a	20.5mib/s	2.6mib/s	4.6mib/s	n/a	7.8mib/s
TCP to HTTP	26.7mib/s	n/a	19.6mib/s	<1mib/s	2.7mib/s	n/a	n/a
TCP to TCP	69.9mib/s	5mib/s	67.1mib/s	3.9mib/s	10mib/s	70.4mib/s	7.6mib/s

To learn more about our performance tests, please see the Vector test harness.

Correctness

The following correctness tests are not exhaustive, but they demonstrate fundamental differences in quality and attention to detail:

Test	Vector	Filebeat	FluentBit	FluentD	Logstash	Splunk UF	Splunk HF
Disk Buffer Persistence	â	â			â	â	â
File Rotate (create)	â	â	â	â	â	â	â
File Rotate (copytruncate)	â					â	â
File Truncation	â	â	â	â	â	â	â
Process (SIGHUP)	â				â	â	â
JSON (wrapped)	â	â	â	â	â	â	â

To learn more about our correctness tests, please see the Vector test harness.

Features

Vector is an end-to-end, unified, open data platform.

	Vector	Beats	Fluentbit	Fluentd	Logstash	Splunk UF	Splunk HF	Telegraf
End-to-end	â							â
Agent	â	â	â			â		â
Aggregator	â			â	â		â	â
Unified	â							â
Logs	â	â	â	â	â	â	â	â
Metrics	â	â	â	â	â	â	â	â
Traces	ð§
Open	â		â	â				â
Open-source	â	â	â	â	â			â
Vendor-neutral	â		â	â				â
Reliability	â
Memory-safe	â							â
Delivery guarantees	â					â	â
Multi-core	â	â	â	â	â	â	â	â

â = Not interoperable, metrics are represented as structured logs

Developed with â¤ï¸ by Datadog - Security Policy - Privacy Policy

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot