yugabyte-db

YugabyteDB - the cloud native distributed SQL database for mission-critical applications.

9,664

1,155

9,664

7,146

View on GitHub

Top Related Projects

cockroach

31,141

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

tidb

38,817

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

vitess

19,919

Vitess is a database clustering system for horizontal scaling of MySQL.

scylladb

14,641

NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB

Quick Overview

YugabyteDB is an open-source, high-performance distributed SQL database designed for global, internet-scale applications. It combines the scalability of NoSQL databases with the consistency and SQL features of traditional relational databases, offering a unique solution for modern cloud-native applications.

Pros

Highly scalable and distributed architecture
Strong consistency and ACID compliance
PostgreSQL-compatible, supporting standard SQL
Multi-region and multi-cloud deployment capabilities

Cons

Relatively new compared to established databases
Limited ecosystem and third-party tool support
Steeper learning curve for teams unfamiliar with distributed systems
Resource-intensive for smaller applications

Code Examples

Connecting to YugabyteDB using Python:

from yugabyte import YugabyteConnection

conn = YugabyteConnection('host=127.0.0.1 port=5433 dbname=yugabyte user=yugabyte')
cursor = conn.cursor()

Creating a table and inserting data:

CREATE TABLE users (
  id INT PRIMARY KEY,
  name TEXT,
  email TEXT
);

INSERT INTO users (id, name, email) VALUES
  (1, 'John Doe', 'john@example.com'),
  (2, 'Jane Smith', 'jane@example.com');

Performing a distributed transaction:

with conn.transaction():
    cursor.execute("UPDATE accounts SET balance = balance - 100 WHERE id = 1")
    cursor.execute("UPDATE accounts SET balance = balance + 100 WHERE id = 2")

Getting Started

To get started with YugabyteDB:

Install YugabyteDB:

wget https://downloads.yugabyte.com/yugabyte-2.13.1.0-linux.tar.gz
tar xvfz yugabyte-2.13.1.0-linux.tar.gz
cd yugabyte-2.13.1.0/

Start a local cluster:
```
./bin/yugabyted start
```
Connect using ysqlsh:
```
./bin/ysqlsh
```

Create a database and table:

CREATE DATABASE myapp;
\c myapp
CREATE TABLE users (id INT PRIMARY KEY, name TEXT);

Insert and query data:

INSERT INTO users VALUES (1, 'Alice');
SELECT * FROM users;

Competitor Comparisons

cockroach

31,141

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

Pros of CockroachDB

Better support for global, multi-region deployments with advanced geo-partitioning features
More mature and battle-tested in production environments
Stronger consistency guarantees with serializable isolation level by default

Cons of CockroachDB

Higher resource consumption, especially for smaller deployments
Steeper learning curve due to more complex architecture and configuration options
Less flexible in terms of storage engine options (only RocksDB)

Code Comparison

CockroachDB (SQL syntax):

CREATE TABLE users (
  id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
  name STRING,
  created_at TIMESTAMP DEFAULT current_timestamp()
);

YugabyteDB (SQL syntax):

CREATE TABLE users (
  id UUID PRIMARY KEY DEFAULT uuid_generate_v4(),
  name TEXT,
  created_at TIMESTAMP DEFAULT now()
);

Both databases use similar SQL syntax, with minor differences in function names and data types. CockroachDB uses STRING for text data, while YugabyteDB uses TEXT. The UUID generation functions also have slightly different names.

Overall, CockroachDB and YugabyteDB are both distributed SQL databases with similar goals, but CockroachDB has a slight edge in maturity and global deployment features, while YugabyteDB offers more flexibility in terms of storage engines and potentially lower resource usage for smaller deployments.

tidb

38,817

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Pros of TiDB

More mature project with a larger community and ecosystem
Better support for distributed OLAP workloads
More advanced query optimizer for complex SQL queries

Cons of TiDB

Steeper learning curve and more complex architecture
Less compatibility with PostgreSQL ecosystem
Higher resource requirements for small-scale deployments

Code Comparison

TiDB (Go):

func (s *tikvStore) Begin() (kv.Transaction, error) {
    txn, err := newTiKVTxn(s)
    if err != nil {
        return nil, errors.Trace(err)
    }
    return txn, nil
}

YugabyteDB (C++):

Status YBClient::OpenTable(const string& table_name,
                           shared_ptr<YBTable>* table) {
  return OpenTable(table_name, {}, table);
}

Both projects implement distributed SQL databases, but TiDB focuses on scalability and HTAP workloads, while YugabyteDB emphasizes PostgreSQL compatibility and ease of use. TiDB uses a more complex architecture with separate storage and computation layers, while YugabyteDB has a more integrated approach. TiDB is written primarily in Go, whereas YugabyteDB uses C++ for its core components.

cassandra

9,303

Apache Cassandra®

Pros of Cassandra

Mature and battle-tested with a large community and extensive ecosystem
Highly scalable and designed for massive distributed deployments
Strong support for multi-datacenter replication and tunable consistency levels

Cons of Cassandra

Limited support for ACID transactions and complex queries
Steep learning curve and complex configuration process
Lacks built-in SQL support, requiring the use of CQL or additional tools

Code Comparison

Cassandra query (CQL):

SELECT * FROM users
WHERE user_id = 123
AND timestamp > '2023-01-01'
LIMIT 10;

YugabyteDB query (YSQL):

SELECT * FROM users
WHERE user_id = 123
AND timestamp > '2023-01-01'
LIMIT 10;

Key Differences

YugabyteDB offers PostgreSQL-compatible SQL support (YSQL) in addition to Cassandra-compatible APIs
YugabyteDB provides stronger consistency guarantees and ACID transactions out of the box
Cassandra has a longer track record and more extensive production deployments
YugabyteDB aims to combine the scalability of Cassandra with the ease of use and features of traditional RDBMSs

Both databases excel in distributed environments, but YugabyteDB offers a more familiar SQL experience and stronger consistency, while Cassandra provides unparalleled scalability and a proven track record in large-scale deployments.

vitess

19,919

Vitess is a database clustering system for horizontal scaling of MySQL.

Pros of Vitess

Designed for horizontal scaling of MySQL databases, making it ideal for large-scale deployments
Provides advanced sharding capabilities, allowing for efficient data distribution
Offers seamless integration with Kubernetes for orchestration and management

Cons of Vitess

Steeper learning curve due to its complex architecture and components
Limited support for non-MySQL databases, focusing primarily on MySQL compatibility
May introduce additional latency in some scenarios due to its proxy-based architecture

Code Comparison

Vitess (VTGate query execution):

func (e *Executor) Execute(ctx context.Context, session *vtgatepb.Session, sql string, bindVariables map[string]*querypb.BindVariable) (*sqltypes.Result, error) {
    // Query execution logic
}

YugabyteDB (SQL execution):

Status PgSession::ExecuteStatements(const string& query_string,
                                    StatementExecutedCallback cb) {
  // SQL execution logic
}

Both projects implement query execution, but Vitess focuses on routing and proxying MySQL queries, while YugabyteDB directly executes SQL statements in its distributed database engine.

scylladb

14,641

NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB

Pros of ScyllaDB

Higher performance and lower latency due to its C++ implementation and shared-nothing architecture
Better resource utilization and scalability, especially for large datasets and high-throughput workloads
More mature and battle-tested in production environments

Cons of ScyllaDB

Less flexible in terms of consistency models compared to YugabyteDB's tunable consistency
Limited support for distributed ACID transactions across multiple partitions
Narrower ecosystem integration and fewer enterprise features than YugabyteDB

Code Comparison

ScyllaDB (CQL):

CREATE TABLE users (
  user_id UUID PRIMARY KEY,
  name TEXT,
  email TEXT
);

YugabyteDB (YSQL):

CREATE TABLE users (
  user_id UUID PRIMARY KEY,
  name TEXT,
  email TEXT
);

Both databases support SQL-like syntax, with ScyllaDB using Cassandra Query Language (CQL) and YugabyteDB offering PostgreSQL-compatible YSQL. The example above shows a simple table creation, which is nearly identical in both systems. However, YugabyteDB's YSQL provides more advanced SQL features and better compatibility with existing PostgreSQL applications.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

What is YugabyteDB?

YugabyteDB is a PostgreSQL-compatible, high-performance, cloud-native, distributed SQL database. It combines the benefits of traditional relational databases with the scalability of NoSQL systems, making it suitable for applications that require both transactional consistency and the ability to handle large amounts of data. It is best suited for cloud-native OLTP (that is, real-time, business-critical) applications that need absolute data correctness and require at least one of the following: scalability, high tolerance to failures, or globally-distributed deployments.

Core Features
Get Started
Build Apps
Current Roadmap
Recent features
Architecture
Need Help?
Contribute
License
Read More

Core Features

Powerful RDBMS capabilities Yugabyte SQL (YSQL for short) reuses the PostgreSQL query layer (similar to Amazon Aurora PostgreSQL), thereby supporting most of its features (datatypes, queries, expressions, operators and functions, stored procedures, triggers, extensions, and so on).
Distributed transactions The transaction design is based on the Google Spanner architecture. Strong consistency of writes is achieved by using Raft consensus for replication and cluster-wide distributed ACID transactions using hybrid logical clocks. Snapshot, serializable and read committed isolation levels are supported. Reads (queries) have strong consistency by default, but can be tuned dynamically to read from followers and read replicas.
Continuous availability YugabyteDB is extremely resilient to common outages with native failover and repair. YugabyteDB can be configured to tolerate disk, rack, node, zone, region, and cloud failures automatically. For a typical deployment where a YugabyteDB cluster is deployed in one region across multiple zones on a public cloud, the RPO is 0 (meaning no data is lost on failure) and the RTO is 3 seconds (meaning the data being served by the failed node is available in 3 seconds).
Horizontal scalability Scaling a YugabyteDB cluster to achieve more IOPS or data storage is as simple as adding nodes to the cluster.
Geo-distributed, multi-cloud YugabyteDB can be deployed in public clouds and natively inside Kubernetes. It supports deployments that span three or more fault domains, such as multi-zone, multi-rack, multi-region, and multi-cloud deployments. It also supports xCluster asynchronous replication with unidirectional master-slave and bidirectional multi-master configurations in two-region deployments. Read replicas are also a supported to serve (stale) data with low latencies.
Multi API design The YugabyteDB query layer is built to be extensible. Currently, YugabyteDB supports two distributed SQL APIs: Yugabyte SQL (YSQL), a fully relational API that re-uses the PostgreSQL query layer, and Yugabyte Cloud QL (YCQL), a semi-relational SQL-like API with documents/indexing support with Apache Cassandra QL roots.
100% open source YugabyteDB is fully open-source under the Apache 2.0 license. The open-source version has powerful enterprise features such as distributed backups, encryption of data at rest, in-flight TLS encryption, change data capture, read replicas, and more.

YugabyteDB was created with several key design goals in mind, aiming to address the challenges faced by modern, cloud-native applications while maintaining the familiarity and power of traditional relational databases. Read more about these in our Design goals.

Get Started

Quick Start
Try running a real-world demo application:
- Microservices-oriented e-commerce app
- Streaming IoT app with Kafka and Spark Streaming

Can't find what you're looking for? Have a question? Post your questions or comments on our Community Slack or Forum.

Build Applications

YugabyteDB supports many languages and client drivers, including Java, Go, NodeJS, Python, and more. For a complete list, including examples, see Drivers and ORMs.

Current Roadmap

The following is a list of some of the key features being worked on for upcoming releases.

Feature	Details
PostgreSQL 15 Compatibility	For latest features, new PostgreSQL extensions, performance, and community fixes.
PostgreSQL Publication/Replication slot API in CDC	PostgreSQL has a huge community that needs a PG-compatible API to set up and consume database changes.
Bitmap scan	Bitmap Scan support for using Index Scans, remote filter and enhanced Cost Model.
Cost based optimizer(CBO)	Efficient query plans based on statistics (such as table size, number of rows) and data distribution.
Parallel query execution	Higher query performance by splitting a single query for execution across different CPU cores.
pgvector extension	Support for vector data types, enabling efficient storage and querying of high-dimensional vectors.
Connection Management	Server side connection management enabling upto 30K connections per node

Refer to roadmap tracker for the list of all items in the current roadmap.

Recently released features

v2.25 (Preview) - Jan, 2025

v2.25 is the current Preview release. This includes features under active development and is recommended for development and testing only. For the full list of features and improvements in this release, see Release notes - v2.25. Here are some of the prominent features.

PostgreSQL 15 Support

As part of this release, we have upgraded our PostgreSQL fork from version 11.2 to 15.0, enabling you to leverage the many key capabilities introduced in PostgreSQL between these two versions. This upgrade brings YSQL API support for numerous features, including stored generated columns, foreign keys on partitioned tables, and non-distinct NULLs in unique indexes. It also introduces query execution optimizations like incremental sort and memoization, along with various observability and security enhancements.

Query Diagnostics

This feature significantly simplifies tuning poorly performing SQL queries by allowing you to capture and export detailed diagnostic information, including bind variables and constants, pg_stat_statements statistics, schema details, active session history, and execution plans.

Active session history

In addition, the Active Session History, which provides real-time and historical views of system activity, is now enabled by default.

v2024.2 (Stable) - Dec, 2024

v2024.2 is the current stable release. Stable releases undergo rigorous testing for a longer period of time and are ready for production use. For the full list of features and improvements in this release, see Release notes - v2024.2. Here are some of the prominent features.

Yugabyte Kubernetes Operator

The Yugabyte Kubernetes Operator is a powerful tool designed to automate deploying, scaling, and managing YugabyteDB clusters in Kubernetes environments. It streamlines database operations, reducing manual effort for developers and operators. For more information, refer to the YugabyteDB Kubernetes Operator GitHub project.

Active session history

Get real-time and historical views of system activity by sampling session activity in the database. Use this feature to analyze and troubleshoot performance issues.

pg_partman extension

Use the pg_partman extension to create and manage both time- and serial-based (aka range-based) table partition sets. pg_partman is often used in combination with pg_cron for data lifecycle management, and specifically for managing data aging, retention, and expiration.

Colocated tables with tablespaces

Starting this release, you can create colocated tables with tablespaces. With this enhancement, you can now take advantage of colocated tables for geo-distributed use cases, eliminating the need for trade-offs between distributing data across specific regions.

Architecture

Review detailed architecture in our Docs.

Need Help?

You can ask questions, find answers, and help others on our Community Slack, Forum, Stack Overflow, as well as Twitter @Yugabyte.
Use GitHub issues to report issues or request new features.
To troubleshoot YugabyteDB and cluster/node-level issues, refer to Troubleshooting documentation.

Contribute

As an open-source project with a strong focus on the user community, we welcome contributions as GitHub pull requests. See our Contributor Guides to get going. Discussions and RFCs for features happen on the design discussions section of our Forum.

License

Source code in this repository is variously licensed under the Apache License 2.0 and the Polyform Free Trial License 1.0.0. A copy of each license can be found in the licenses directory.

The build produces two sets of binaries:

The entire database with all its features (including the enterprise ones) is licensed under the Apache License 2.0
The binaries that contain -managed in the artifact and help run a managed service are licensed under the Polyform Free Trial License 1.0.0.

By default, the build options generate only the Apache License 2.0 binaries.

To see our updates, go to the Distributed SQL Blog.
For in-depth design and architecture details, see our design specs.
Tech Talks and Videos.
See how YugabyteDB compares with other databases.

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

Top Related Projects

Quick Overview

Pros

Cons

Code Examples

Getting Started

Competitor Comparisons

Pros of CockroachDB

Cons of CockroachDB

Code Comparison

Pros of TiDB

Cons of TiDB

Code Comparison

Pros of Cassandra

Cons of Cassandra

Code Comparison

Key Differences

Pros of Vitess

Cons of Vitess

Code Comparison

Pros of ScyllaDB

Cons of ScyllaDB

Code Comparison

Convert designs to code with AI

README

What is YugabyteDB?

Core Features

Get Started

Build Applications

Current Roadmap

Recently released features

v2.25 (Preview) - Jan, 2025

v2024.2 (Stable) - Dec, 2024

Architecture

Need Help?

Contribute

License

Read More

Top Related Projects

Convert designs to code with AI