protobuf

Protocol Buffers - Google's data interchange format

67,402

15,688

67,402

304

View on GitHub View on NPM

Top Related Projects

flatbuffers

24,121

FlatBuffers: Memory Efficient Serialization Library

capnproto

12,274

Cap'n Proto serialization/RPC system - core tools and C++ library

msgpack

7,169

MessagePack is an extremely efficient object serialization library. It's like JSON, but very fast and small.

avro

3,054

Apache Avro is a data serialization system.

grpc

42,942

C++ based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Quick Overview

Protocol Buffers (protobuf) is Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data. It's a method of encoding structured data in an efficient yet extensible format, designed for use in developing programs to communicate with each other over a network or for storing data.

Pros

Efficient serialization and deserialization, resulting in smaller message sizes and faster processing
Language and platform agnostic, supporting a wide range of programming languages and environments
Backward and forward compatibility, allowing easy schema evolution
Strong typing and auto-generated code, reducing errors and development time

Cons

Steeper learning curve compared to simpler formats like JSON
Less human-readable than text-based formats
Requires compilation step for schema changes
Limited support for dynamic structures compared to some alternatives

Code Examples

Defining a message in .proto file:

syntax = "proto3";

message Person {
  string name = 1;
  int32 age = 2;
  repeated string hobbies = 3;
}

Serializing data in Python:

import person_pb2

person = person_pb2.Person()
person.name = "Alice"
person.age = 30
person.hobbies.extend(["reading", "hiking"])

serialized_data = person.SerializeToString()

Deserializing data in C++:

#include "person.pb.h"

Person person;
person.ParseFromString(serialized_data);

std::cout << "Name: " << person.name() << std::endl;
std::cout << "Age: " << person.age() << std::endl;

Getting Started

Install Protocol Buffers compiler:

# For Ubuntu/Debian
sudo apt-get install protobuf-compiler

# For macOS
brew install protobuf

Define your message in a .proto file (e.g., person.proto)

Compile the .proto file:

protoc --cpp_out=. person.proto  # For C++
protoc --python_out=. person.proto  # For Python

Include the generated files in your project and start using Protocol Buffers in your code.

Competitor Comparisons

flatbuffers

24,121

FlatBuffers: Memory Efficient Serialization Library

Pros of Flatbuffers

Zero-copy deserialization, resulting in faster parsing and lower memory usage
More compact binary format, leading to smaller file sizes
Better support for schema evolution and backwards compatibility

Cons of Flatbuffers

Less mature ecosystem and fewer language bindings compared to Protocol Buffers
Steeper learning curve and more complex API
Limited support for certain data types (e.g., maps)

Code Comparison

Protobuf:

message Person {
  string name = 1;
  int32 age = 2;
  string email = 3;
}

Flatbuffers:

table Person {
  name:string;
  age:int32;
  email:string;
}

root_type Person;

Both examples define a simple "Person" message/table with name, age, and email fields. The syntax is similar, but Flatbuffers uses a table keyword instead of message and requires a root_type declaration.

Protobuf uses numbered fields for backwards compatibility, while Flatbuffers achieves this through careful schema design. Flatbuffers' approach allows for more flexible evolution but requires more attention from developers.

Overall, Flatbuffers offers performance advantages at the cost of increased complexity, while Protocol Buffers provides a more mature ecosystem with broader language support.

capnproto

12,274

Cap'n Proto serialization/RPC system - core tools and C++ library

Pros of Cap'n Proto

Zero-copy deserialization, resulting in faster parsing
Support for object capabilities and RPC
More compact binary representation

Cons of Cap'n Proto

Less widespread adoption compared to Protocol Buffers
Fewer language bindings available
Steeper learning curve for developers new to the concept

Code Comparison

Cap'n Proto:

struct Person {
  name @0 :Text;
  age @1 :UInt32;
  email @2 :Text;
}

Protocol Buffers:

message Person {
  string name = 1;
  uint32 age = 2;
  string email = 3;
}

Both Cap'n Proto and Protocol Buffers use a similar schema definition syntax. However, Cap'n Proto uses the @ symbol to denote field numbers, while Protocol Buffers uses =.

Cap'n Proto focuses on zero-copy deserialization and a more compact binary representation, which can lead to better performance in certain scenarios. It also includes built-in support for object capabilities and RPC.

Protocol Buffers, on the other hand, has wider adoption and more language bindings available. It's generally considered easier to learn and integrate into existing projects, especially for developers already familiar with similar serialization formats.

The choice between the two depends on specific project requirements, performance needs, and the development team's familiarity with each technology.

thrift

10,671

Apache Thrift

Pros of Thrift

Supports a wider range of programming languages (20+)
Offers built-in RPC framework for easier service implementation
More flexible type system with custom types and containers

Cons of Thrift

Less widespread adoption compared to Protocol Buffers
Documentation and community support are not as extensive
Slightly more complex syntax and learning curve

Code Comparison

Thrift IDL:

struct Person {
  1: string name
  2: i32 age
  3: bool is_active
}

Protocol Buffers:

message Person {
  string name = 1;
  int32 age = 2;
  bool is_active = 3;
}

Both Thrift and Protocol Buffers are popular serialization frameworks used for efficient data exchange between services. While Protocol Buffers is known for its simplicity and performance, Thrift offers more language support and built-in RPC capabilities. The choice between the two often depends on specific project requirements, existing infrastructure, and team preferences.

msgpack

7,169

MessagePack is an extremely efficient object serialization library. It's like JSON, but very fast and small.

Pros of msgpack

Simpler to use, with no need for schema definitions
Smaller serialized data size in many cases
Supports a wider range of programming languages

Cons of msgpack

Lacks built-in versioning and backward compatibility features
No native support for complex data structures like enums or nested objects
Less efficient for large datasets with repetitive structures

Code Comparison

msgpack:

import msgpack

data = {"name": "John", "age": 30}
packed = msgpack.packb(data)
unpacked = msgpack.unpackb(packed)

protobuf:

import person_pb2

person = person_pb2.Person()
person.name = "John"
person.age = 30
serialized = person.SerializeToString()
deserialized = person_pb2.Person().ParseFromString(serialized)

msgpack offers a more straightforward approach with direct serialization of Python dictionaries, while protobuf requires defining a schema and generating code before use. protobuf provides stronger typing and better performance for large, complex datasets, but msgpack is often simpler for small-scale projects or when working with dynamic data structures. The choice between the two depends on specific project requirements, such as data size, complexity, and the need for schema evolution.

avro

3,054

Apache Avro is a data serialization system.

Pros of Avro

Dynamic typing: Avro supports schema evolution, allowing easier updates to data structures
Compact binary format: Avro's binary encoding is generally more space-efficient
Built-in support for complex data types and nested structures

Cons of Avro

Less widespread adoption compared to Protocol Buffers
Limited language support outside of Java ecosystem
Steeper learning curve for developers new to the format

Code Comparison

Avro schema definition:

{
  "type": "record",
  "name": "User",
  "fields": [
    {"name": "name", "type": "string"},
    {"name": "age", "type": "int"}
  ]
}

Protocol Buffers definition:

message User {
  string name = 1;
  int32 age = 2;
}

Both Avro and Protocol Buffers offer efficient serialization formats for structured data. Avro excels in schema evolution and compact encoding, making it suitable for big data processing and systems with frequently changing data structures. Protocol Buffers, on the other hand, benefits from wider language support and a simpler learning curve, making it a popular choice for microservices and RPC systems. The choice between the two depends on specific project requirements, ecosystem compatibility, and developer familiarity.

grpc

42,942

C++ based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Pros of gRPC

Built-in support for streaming, both client-side and server-side
Automatic code generation for multiple languages
Integrated authentication and load balancing features

Cons of gRPC

Steeper learning curve compared to simpler protocols
Limited browser support without additional tools or proxies
Potentially more complex setup and configuration

Code Comparison

protobuf:

syntax = "proto3";

message Person {
  string name = 1;
  int32 age = 2;
}

gRPC:

syntax = "proto3";

service Greeter {
  rpc SayHello (HelloRequest) returns (HelloReply) {}
}

message HelloRequest {
  string name = 1;
}

message HelloReply {
  string message = 1;
}

Summary

protobuf is a data serialization format, while gRPC is a complete RPC framework built on top of protobuf. gRPC offers more features for building distributed systems, including streaming and code generation. However, it comes with increased complexity and a steeper learning curve. protobuf is simpler and more focused on efficient data serialization, making it easier to integrate into existing systems. The choice between the two depends on the specific requirements of your project and the level of functionality needed for inter-service communication.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

Protocol Buffers - Google's data interchange format

Overview

Protocol Buffers (a.k.a., protobuf) are Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data. You can learn more about it in protobuf's documentation.

This README file contains protobuf installation instructions. To install protobuf, you need to install the protocol compiler (used to compile .proto files) and the protobuf runtime for your chosen programming language.

Working With Protobuf Source Code

Most users will find working from supported releases to be the easiest path.

If you choose to work from the head revision of the main branch your build will occasionally be broken by source-incompatible changes and insufficiently-tested (and therefore broken) behavior.

If you are using C++ or otherwise need to build protobuf from source as a part of your project, you should pin to a release commit on a release branch.

This is because even release branches can experience some instability in between release commits.

Bazel with Bzlmod

Protobuf supports Bzlmod with Bazel 7 +. Users should specify a dependency on protobuf in their MODULE.bazel file as follows.

bazel_dep(name = "protobuf", version = <VERSION>)

Users can optionally override the repo name, such as for compatibility with WORKSPACE.

bazel_dep(name = "protobuf", version = <VERSION>, repo_name = "com_google_protobuf")

Bazel with WORKSPACE

Users can also add the following to their legacy WORKSPACE file.

Note that with the release of 30.x there are a few more load statements to properly set up rules_java and rules_python.

http_archive(
    name = "com_google_protobuf",
    strip_prefix = "protobuf-VERSION",
    sha256 = ...,
    url = ...,
)

load("@com_google_protobuf//:protobuf_deps.bzl", "protobuf_deps")

protobuf_deps()

load("@rules_java//java:rules_java_deps.bzl", "rules_java_dependencies")

rules_java_dependencies()

load("@rules_java//java:repositories.bzl", "rules_java_toolchains")

rules_java_toolchains()

load("@rules_python//python:repositories.bzl", "py_repositories")

py_repositories()

Protobuf Compiler Installation

The protobuf compiler is written in C++. If you are using C++, please follow the C++ Installation Instructions to install protoc along with the C++ runtime.

For non-C++ users, the simplest way to install the protocol compiler is to download a pre-built binary from our GitHub release page.

In the downloads section of each release, you can find pre-built binaries in zip packages: protoc-$VERSION-$PLATFORM.zip. It contains the protoc binary as well as a set of standard .proto files distributed along with protobuf.

If you are looking for an old version that is not available in the release page, check out the Maven repository.

These pre-built binaries are only provided for released versions. If you want to use the github main version at HEAD, or you need to modify protobuf code, or you are using C++, it's recommended to build your own protoc binary from source.

If you would like to build protoc binary from source, see the C++ Installation Instructions.

Protobuf Runtime Installation

Protobuf supports several different programming languages. For each programming language, you can find instructions in the corresponding source directory about how to install protobuf runtime for that specific language:

Language	Source
C++ (include C++ runtime and protoc)	src
Java	java
Python	python
Objective-C	objectivec
C#	csharp
Ruby	ruby
Go	protocolbuffers/protobuf-go
PHP	php
Dart	dart-lang/protobuf
JavaScript	protocolbuffers/protobuf-javascript

Quick Start

The best way to learn how to use protobuf is to follow the tutorials in our developer guide.

If you want to learn from code examples, take a look at the examples in the examples directory.

Documentation

The complete documentation is available at the Protocol Buffers doc site.

Support Policy

Read about our version support policy to stay current on support timeframes for the language libraries.

Developer Community

To be alerted to upcoming changes in Protocol Buffers and connect with protobuf developers and users, join the Google Group.

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

Top Related Projects

Quick Overview

Pros

Cons

Code Examples

Getting Started

Competitor Comparisons

Pros of Flatbuffers

Cons of Flatbuffers

Code Comparison

Pros of Cap'n Proto

Cons of Cap'n Proto

Code Comparison

Pros of Thrift

Cons of Thrift

Code Comparison

Pros of msgpack

Cons of msgpack

Code Comparison

Pros of Avro

Cons of Avro

Code Comparison

Pros of gRPC

Cons of gRPC

Code Comparison

Summary

Convert designs to code with AI

README

Protocol Buffers - Google's data interchange format

Overview

Working With Protobuf Source Code

Bazel with Bzlmod

Bazel with WORKSPACE

Protobuf Compiler Installation

Protobuf Runtime Installation

Quick Start

Documentation

Support Policy

Developer Community

Top Related Projects

Convert designs to code with AI

NPM DownloadsLast 30 Days