Top Related Projects
FlatBuffers: Memory Efficient Serialization Library
Cap'n Proto serialization/RPC system - core tools and C++ library
Apache Thrift
MessagePack is an extremely efficient object serialization library. It's like JSON, but very fast and small.
Apache Avro is a data serialization system.
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Quick Overview
Protocol Buffers (protobuf) is Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data. It's a method of encoding structured data in an efficient yet extensible format, designed for use in developing programs to communicate with each other over a network or for storing data.
Pros
- Efficient serialization and deserialization, resulting in smaller message sizes and faster processing
- Language and platform agnostic, supporting a wide range of programming languages and environments
- Backward and forward compatibility, allowing easy schema evolution
- Strong typing and auto-generated code, reducing errors and development time
Cons
- Steeper learning curve compared to simpler formats like JSON
- Less human-readable than text-based formats
- Requires compilation step for schema changes
- Limited support for dynamic structures compared to some alternatives
Code Examples
- Defining a message in .proto file:
syntax = "proto3";
message Person {
string name = 1;
int32 age = 2;
repeated string hobbies = 3;
}
- Serializing data in Python:
import person_pb2
person = person_pb2.Person()
person.name = "Alice"
person.age = 30
person.hobbies.extend(["reading", "hiking"])
serialized_data = person.SerializeToString()
- Deserializing data in C++:
#include "person.pb.h"
Person person;
person.ParseFromString(serialized_data);
std::cout << "Name: " << person.name() << std::endl;
std::cout << "Age: " << person.age() << std::endl;
Getting Started
-
Install Protocol Buffers compiler:
# For Ubuntu/Debian sudo apt-get install protobuf-compiler # For macOS brew install protobuf
-
Define your message in a .proto file (e.g.,
person.proto
) -
Compile the .proto file:
protoc --cpp_out=. person.proto # For C++ protoc --python_out=. person.proto # For Python
-
Include the generated files in your project and start using Protocol Buffers in your code.
Competitor Comparisons
FlatBuffers: Memory Efficient Serialization Library
Pros of Flatbuffers
- Zero-copy deserialization, resulting in faster parsing and lower memory usage
- More compact binary format, leading to smaller file sizes
- Better support for schema evolution and backwards compatibility
Cons of Flatbuffers
- Less mature ecosystem and fewer language bindings compared to Protocol Buffers
- Steeper learning curve and more complex API
- Limited support for certain data types (e.g., maps)
Code Comparison
Protobuf:
message Person {
string name = 1;
int32 age = 2;
string email = 3;
}
Flatbuffers:
table Person {
name:string;
age:int32;
email:string;
}
root_type Person;
Both examples define a simple "Person" message/table with name, age, and email fields. The syntax is similar, but Flatbuffers uses a table
keyword instead of message
and requires a root_type
declaration.
Protobuf uses numbered fields for backwards compatibility, while Flatbuffers achieves this through careful schema design. Flatbuffers' approach allows for more flexible evolution but requires more attention from developers.
Overall, Flatbuffers offers performance advantages at the cost of increased complexity, while Protocol Buffers provides a more mature ecosystem with broader language support.
Cap'n Proto serialization/RPC system - core tools and C++ library
Pros of Cap'n Proto
- Zero-copy deserialization, resulting in faster parsing
- Support for object capabilities and RPC
- More compact binary representation
Cons of Cap'n Proto
- Less widespread adoption compared to Protocol Buffers
- Fewer language bindings available
- Steeper learning curve for developers new to the concept
Code Comparison
Cap'n Proto:
struct Person {
name @0 :Text;
age @1 :UInt32;
email @2 :Text;
}
Protocol Buffers:
message Person {
string name = 1;
uint32 age = 2;
string email = 3;
}
Both Cap'n Proto and Protocol Buffers use a similar schema definition syntax. However, Cap'n Proto uses the @
symbol to denote field numbers, while Protocol Buffers uses =
.
Cap'n Proto focuses on zero-copy deserialization and a more compact binary representation, which can lead to better performance in certain scenarios. It also includes built-in support for object capabilities and RPC.
Protocol Buffers, on the other hand, has wider adoption and more language bindings available. It's generally considered easier to learn and integrate into existing projects, especially for developers already familiar with similar serialization formats.
The choice between the two depends on specific project requirements, performance needs, and the development team's familiarity with each technology.
Apache Thrift
Pros of Thrift
- Supports a wider range of programming languages (20+)
- Offers built-in RPC framework for easier service implementation
- More flexible type system with custom types and containers
Cons of Thrift
- Less widespread adoption compared to Protocol Buffers
- Documentation and community support are not as extensive
- Slightly more complex syntax and learning curve
Code Comparison
Thrift IDL:
struct Person {
1: string name
2: i32 age
3: bool is_active
}
Protocol Buffers:
message Person {
string name = 1;
int32 age = 2;
bool is_active = 3;
}
Both Thrift and Protocol Buffers are popular serialization frameworks used for efficient data exchange between services. While Protocol Buffers is known for its simplicity and performance, Thrift offers more language support and built-in RPC capabilities. The choice between the two often depends on specific project requirements, existing infrastructure, and team preferences.
MessagePack is an extremely efficient object serialization library. It's like JSON, but very fast and small.
Pros of msgpack
- Simpler to use, with no need for schema definitions
- Smaller serialized data size in many cases
- Supports a wider range of programming languages
Cons of msgpack
- Lacks built-in versioning and backward compatibility features
- No native support for complex data structures like enums or nested objects
- Less efficient for large datasets with repetitive structures
Code Comparison
msgpack:
import msgpack
data = {"name": "John", "age": 30}
packed = msgpack.packb(data)
unpacked = msgpack.unpackb(packed)
protobuf:
import person_pb2
person = person_pb2.Person()
person.name = "John"
person.age = 30
serialized = person.SerializeToString()
deserialized = person_pb2.Person().ParseFromString(serialized)
msgpack offers a more straightforward approach with direct serialization of Python dictionaries, while protobuf requires defining a schema and generating code before use. protobuf provides stronger typing and better performance for large, complex datasets, but msgpack is often simpler for small-scale projects or when working with dynamic data structures. The choice between the two depends on specific project requirements, such as data size, complexity, and the need for schema evolution.
Apache Avro is a data serialization system.
Pros of Avro
- Dynamic typing: Avro supports schema evolution, allowing easier updates to data structures
- Compact binary format: Avro's binary encoding is generally more space-efficient
- Built-in support for complex data types and nested structures
Cons of Avro
- Less widespread adoption compared to Protocol Buffers
- Limited language support outside of Java ecosystem
- Steeper learning curve for developers new to the format
Code Comparison
Avro schema definition:
{
"type": "record",
"name": "User",
"fields": [
{"name": "name", "type": "string"},
{"name": "age", "type": "int"}
]
}
Protocol Buffers definition:
message User {
string name = 1;
int32 age = 2;
}
Both Avro and Protocol Buffers offer efficient serialization formats for structured data. Avro excels in schema evolution and compact encoding, making it suitable for big data processing and systems with frequently changing data structures. Protocol Buffers, on the other hand, benefits from wider language support and a simpler learning curve, making it a popular choice for microservices and RPC systems. The choice between the two depends on specific project requirements, ecosystem compatibility, and developer familiarity.
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Pros of gRPC
- Built-in support for streaming, both client-side and server-side
- Automatic code generation for multiple languages
- Integrated authentication and load balancing features
Cons of gRPC
- Steeper learning curve compared to simpler protocols
- Limited browser support without additional tools or proxies
- Potentially more complex setup and configuration
Code Comparison
protobuf:
syntax = "proto3";
message Person {
string name = 1;
int32 age = 2;
}
gRPC:
syntax = "proto3";
service Greeter {
rpc SayHello (HelloRequest) returns (HelloReply) {}
}
message HelloRequest {
string name = 1;
}
message HelloReply {
string message = 1;
}
Summary
protobuf is a data serialization format, while gRPC is a complete RPC framework built on top of protobuf. gRPC offers more features for building distributed systems, including streaming and code generation. However, it comes with increased complexity and a steeper learning curve. protobuf is simpler and more focused on efficient data serialization, making it easier to integrate into existing systems. The choice between the two depends on the specific requirements of your project and the level of functionality needed for inter-service communication.
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual CopilotREADME
Protocol Buffers - Google's data interchange format
Copyright 2023 Google LLC
Overview
Protocol Buffers (a.k.a., protobuf) are Google's language-neutral, platform-neutral, extensible mechanism for serializing structured data. You can learn more about it in protobuf's documentation.
This README file contains protobuf installation instructions. To install protobuf, you need to install the protocol compiler (used to compile .proto files) and the protobuf runtime for your chosen programming language.
Working With Protobuf Source Code
Most users will find working from supported releases to be the easiest path.
If you choose to work from the head revision of the main branch your build will occasionally be broken by source-incompatible changes and insufficiently-tested (and therefore broken) behavior.
If you are using C++ or otherwise need to build protobuf from source as a part of your project, you should pin to a release commit on a release branch.
This is because even release branches can experience some instability in between release commits.
Bazel with Bzlmod
Protobuf supports Bzlmod with Bazel 7 +. Users should specify a dependency on protobuf in their MODULE.bazel file as follows.
bazel_dep(name = "protobuf", version = <VERSION>)
Users can optionally override the repo name, such as for compatibility with WORKSPACE.
bazel_dep(name = "protobuf", version = <VERSION>, repo_name = "com_google_protobuf")
Bazel with WORKSPACE
Users can also add the following to their legacy WORKSPACE file.
Note that the protobuf_extra_deps.bzl
is added in the v30.x
release.
http_archive(
name = "com_google_protobuf",
strip_prefix = "protobuf-VERSION",
sha256 = ...,
url = ...,
)
load("@com_google_protobuf//:protobuf_deps.bzl", "protobuf_deps")
protobuf_deps()
load("@com_google_protobuf//:protobuf_extra_deps.bzl", "protobuf_extra_deps")
protobuf_extra_deps();
Protobuf Compiler Installation
The protobuf compiler is written in C++. If you are using C++, please follow the C++ Installation Instructions to install protoc along with the C++ runtime.
For non-C++ users, the simplest way to install the protocol compiler is to download a pre-built binary from our GitHub release page.
In the downloads section of each release, you can find pre-built binaries in
zip packages: protoc-$VERSION-$PLATFORM.zip
. It contains the protoc binary
as well as a set of standard .proto
files distributed along with protobuf.
If you are looking for an old version that is not available in the release page, check out the Maven repository.
These pre-built binaries are only provided for released versions. If you want to use the github main version at HEAD, or you need to modify protobuf code, or you are using C++, it's recommended to build your own protoc binary from source.
If you would like to build protoc binary from source, see the C++ Installation Instructions.
Protobuf Runtime Installation
Protobuf supports several different programming languages. For each programming language, you can find instructions in the corresponding source directory about how to install protobuf runtime for that specific language:
Language | Source |
---|---|
C++ (include C++ runtime and protoc) | src |
Java | java |
Python | python |
Objective-C | objectivec |
C# | csharp |
Ruby | ruby |
Go | protocolbuffers/protobuf-go |
PHP | php |
Dart | dart-lang/protobuf |
JavaScript | protocolbuffers/protobuf-javascript |
Quick Start
The best way to learn how to use protobuf is to follow the tutorials in our developer guide.
If you want to learn from code examples, take a look at the examples in the examples directory.
Documentation
The complete documentation is available at the Protocol Buffers doc site.
Support Policy
Read about our version support policy to stay current on support timeframes for the language libraries.
Developer Community
To be alerted to upcoming changes in Protocol Buffers and connect with protobuf developers and users, join the Google Group.
Top Related Projects
FlatBuffers: Memory Efficient Serialization Library
Cap'n Proto serialization/RPC system - core tools and C++ library
Apache Thrift
MessagePack is an extremely efficient object serialization library. It's like JSON, but very fast and small.
Apache Avro is a data serialization system.
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual Copilot