Data-driven protobuf toolkit for Elixir. Runtime usage needs no code generation or build step. A schema is a plain Elixir data structure produced by compile/1.

For stable schemas, see PB.Schema to embed a compiled schema in a module at Elixir compile time while keeping the runtime schema-map API available.

Usage

# Generate a descriptor set from your .proto files:
#   protoc --descriptor_set_out=schema.pb your.proto

{:ok, descriptor_set} = PB.decode_descriptor_set(File.read!("schema.pb"))
schema = PB.compile(descriptor_set)

{:ok, binary} = PB.encode(%{name: "hello"}, schema, :"my.package.MyMessage")
{:ok, decoded} = PB.decode(binary, schema, :"my.package.MyMessage")

Schema

Message names, enum names, service names, and extension names are fully-qualified atoms (e.g. :"my.package.MyMessage").

The map returned by compile/1 is an internal representation. Its structure is not part of the public API and may change without notice. To enumerate or look up messages, enums, services, and extensions, use the introspection helpers in PB.Schema (list_messages/1, fetch_message/2, message!/2, etc.), which return stable %PB.Schema.Info{} structs.

Services

Service definitions from .proto files are extracted by compile/1 and exposed through PB.Schema.list_services/1, fetch_service/2, and service!/2, including method input/output types and streaming flags.

Reserved map keys

PB stores non-protobuf metadata under dunder atom keys that cannot collide with real .proto field names. These atoms are the stable contract for both encode input and decode output:

  • :__unknown_fields__ — preserved unknown wire fields, as a list of %PB.UnknownField{} structs.
  • :__extensions__ — known extension field values, keyed by fully-qualified extension name.
  • :__message_name__ — optional message-name metadata produced by decode/3,4 when message_names: :root is set. When supplied on input to encode or validation APIs, it must match the message being processed.

Types

Maps, oneofs, packed repeated fields, and all scalar types (including sint32/sint64 zigzag, sfixed, float/double with NaN/infinity) are supported.

Summary

Types

Resolved field cardinality surfaced by schema introspection.

Decoded protobuf descriptor set (output of decode_descriptor_set/1, input to compile/1).

Fully-qualified protobuf enum name (an atom).

Structured runtime encode/decode error.

Fully-qualified protobuf extension field name (an atom).

Field name atom as it appears in PB message maps.

Protobuf field number.

Fully-qualified protobuf message name (an atom, e.g. :"my.pkg.Person").

Resolved protobuf presence semantics surfaced by schema introspection.

Compile-time projection entries for the :projections option.

Projection options accepted within a :projections entry.

Prepared compiled PB schema, as returned by compile/2.

Schema source accepted by public encode/decode/normalize/validate/JSON APIs.

Fully-qualified protobuf service name (an atom).

Functions

Compiles a decoded descriptor set (from decode_descriptor_set/1) into a schema that can be used with encode/3 and decode/3.

Decodes a protobuf binary into an Elixir value.

Decodes a protobuf binary into an Elixir map, returning the map or raising PB.Error.

Decodes a binary FileDescriptorSet (as produced by protoc --descriptor_set_out) and returns a decoded descriptor set suitable for compile/1.

Decodes a binary FileDescriptorSet, returning the decoded descriptor set or raising PB.Error.

Encodes data as a protobuf message, returning {:ok, binary}.

Encodes data as a protobuf message, returning a binary or raising PB.Error.

Encodes data as a protobuf message, returning {:ok, iodata}.

Encodes data as a protobuf message, returning iodata or raising PB.Error.

Compares two decoded message terms with schema-aware protobuf field semantics.

Normalizes data as a protobuf message, returning the canonical decoded value.

Normalizes data as a protobuf message, returning the normalized value or raising PB.Error.

Types

cardinality()

@type cardinality() :: :singular | :repeated | :map

Resolved field cardinality surfaced by schema introspection.

descriptor_set()

@type descriptor_set() :: map()

Decoded protobuf descriptor set (output of decode_descriptor_set/1, input to compile/1).

enum_name()

@type enum_name() :: atom()

Fully-qualified protobuf enum name (an atom).

error()

@type error() :: PB.Error.t()

Structured runtime encode/decode error.

extension_name()

@type extension_name() :: atom()

Fully-qualified protobuf extension field name (an atom).

field_name()

@type field_name() :: atom()

Field name atom as it appears in PB message maps.

field_number()

@type field_number() :: pos_integer()

Protobuf field number.

message_name()

@type message_name() :: atom()

Fully-qualified protobuf message name (an atom, e.g. :"my.pkg.Person").

presence()

@type presence() :: :implicit | :explicit | :required | :oneof | :none

Resolved protobuf presence semantics surfaced by schema introspection.

projection_entry()

@type projection_entry() :: {message_name(), projection_opts()}

Compile-time projection entries for the :projections option.

Each entry pairs a fully-qualified message name with a keyword list of projection options. Within an entry, :adapter is mutually exclusive with the structural options :struct, :unwrap, :preserved_unknown_fields, :extensions, and :oneofs.

projection_opts()

@type projection_opts() :: [
  adapter: PB.Adapter.t(),
  struct: module(),
  unwrap: true | atom(),
  preserved_unknown_fields: :drop | :reject | {:field, atom()},
  extensions: :reject | {:field, atom()},
  oneofs: [{atom(), [{:representation, :identity}]}]
]

Projection options accepted within a :projections entry.

schema()

@type schema() :: map()

Prepared compiled PB schema, as returned by compile/2.

The internal structure is not part of the public API and may change without notice. Use the PB.Schema introspection helpers (list_messages/1, fetch_message/2, …) to inspect a schema rather than reading the map directly.

schema_source()

@type schema_source() :: schema() | module()

Schema source accepted by public encode/decode/normalize/validate/JSON APIs.

Either a compiled schema map or a module that use PB.Schema. Module values are resolved by calling __pb_schema__/0.

service_name()

@type service_name() :: atom()

Fully-qualified protobuf service name (an atom).

Functions

compile(descriptor_set, opts \\ [])

@spec compile(
  descriptor_set(),
  keyword()
) :: schema()

Compiles a decoded descriptor set (from decode_descriptor_set/1) into a schema that can be used with encode/3 and decode/3.

Compilation has explicit phases: descriptor compilation produces a draft schema, validation annotation and projection compilation consume descriptor options, finalization removes those options, and one final preparation pass writes runtime metadata. Runtime encode/decode require the prepared schema shape returned by this function.

Important: The descriptor set is treated as trusted input. Field names, type names, and package names are converted to atoms, which are never garbage collected. Do not pass untrusted or user-controlled descriptor sets to this function, as it could exhaust the BEAM atom table.

Options

  • :projections — list of {fqn, opts} entries. Each entry attaches a message-level projection to a fully-qualified protobuf message name. Within an entry, opts is a keyword list with one of:

    • adapter: %PB.Adapter{} — app/proto conversion spec (mutually exclusive with the keys below).
    • struct: Module — project the message to an Elixir struct.
    • unwrap: field_name | true — project the message to its single field's value.

    • preserved_unknown_fields: :drop | :reject | {:field, atom} — controls how preserved unknown wire fields are handled in struct or unwrap representations.

    • extensions: :reject | {:field, atom} — controls how known extensions are handled in struct representations.

    • oneofs: [{oneof_name, [representation: :identity]}, ...] — declares identity-projected oneofs on the message.

    Structural projections may also be declared in proto source via the elixir.pb.v1 custom options, and that is the preferred place when you own the schema. Reserve compile-time :projections for adapters (which cannot be expressed in proto source) and for overriding schemas you do not own. Compile-time :projections entries conflict if they disagree with proto-source options for the same target.

decode(binary, schema, message_name, opts \\ [])

@spec decode(binary(), schema_source(), message_name(), keyword()) ::
  {:ok, term()} | {:error, error()}

Decodes a protobuf binary into an Elixir value.

For canonical messages, returns {:ok, map} where the map has atom keys matching field names. Only fields present in the binary are included — proto3 default values (zero, empty string, false) are omitted from the result. Repeated fields are lists, map fields are Elixir maps, and oneofs are {field_name, value} tuples under the oneof name key. For represented messages, decode returns the configured struct, identity oneof, or unwrapped value. For adapted root or nested messages, decode returns the adapter's app value. Unknown wire fields are preserved under the :__unknown_fields__ map key. Known protobuf extension fields are decoded under the :__extensions__ map key, keyed by fully-qualified extension name.

Options

  • :defaults — when true, populates missing fields with their proto3 default values (0, 0.0, false, "", [], %{} for maps). Singular message fields and oneofs are not populated. Defaults to false.
  • :message_names — controls whether decoded maps include message-name metadata under the :__message_name__ map key. nil omits metadata and is the default. :root annotates only the root decoded message map. If the root message decodes to a represented struct, unwrapped value, or adapter value, no message-name metadata is added.

Returns {:error, error} where error is a PB.Error.t() for unknown messages, malformed input, invalid input values, invalid options, and adapter failures. Errors include the operation, root message name, kind, field path, reason, and details.

decode!(binary, schema, message_name, opts \\ [])

@spec decode!(binary(), schema_source(), message_name(), keyword()) :: term()

Decodes a protobuf binary into an Elixir map, returning the map or raising PB.Error.

This is the bang variant of decode/4.

decode_descriptor_set(binary)

@spec decode_descriptor_set(binary()) :: {:ok, descriptor_set()} | {:error, error()}

Decodes a binary FileDescriptorSet (as produced by protoc --descriptor_set_out) and returns a decoded descriptor set suitable for compile/1.

Uses PB's bundled main schema, which knows about the well-known types, the CEL cel.expr descriptors, and the buf.validate extensions. Extensions defined in those protos (notably buf.validate.field, buf.validate.message, buf.validate.oneof) are therefore materialized into the decoded options under the :__extensions__ map key rather than dropped to unknown-field bytes.

decode_descriptor_set!(binary)

@spec decode_descriptor_set!(binary()) :: descriptor_set()

Decodes a binary FileDescriptorSet, returning the decoded descriptor set or raising PB.Error.

This is the bang variant of decode_descriptor_set/1.

encode(data, schema, message_name, opts \\ [])

@spec encode(term(), schema_source(), message_name(), keyword()) ::
  {:ok, binary()} | {:error, error()}

Encodes data as a protobuf message, returning {:ok, binary}.

For zero-copy paths that can consume iodata directly, use encode_iodata/4.

For canonical messages, data is a map with atom keys matching field names. Repeated fields use lists, map fields use Elixir maps, oneofs use {field_name, value} tuples wrapped under the oneof name key, and known extension fields may be supplied under the :__extensions__ map key keyed by fully-qualified extension name.

For represented messages, data uses the configured public term shape: structs, identity oneofs, or unwrapped single-field values. For adapted root or nested messages, data is the adapter's app value. PB projects these public terms to the protobuf field shape at each message boundary before wire encoding.

Field presence is controlled by map keys. For implicit-presence scalar and enum fields, nil is treated as the protobuf default and default values are elided. For repeated and map fields, nil is treated as the empty collection. For oneofs, nil is treated as no selected variant. Preserved unknown wire fields may be supplied under the :__unknown_fields__ map key and are emitted after known fields.

Options

  • :unknown_fields — controls how unknown map keys are handled. The default is :error, returning {:error, %PB.ValueError{kind: :unknown_field}}. Pass :ignore to preserve the old behavior of silently dropping unknown fields.

Returns {:error, error} where error is a PB.Error.t() for unknown messages, invalid input values, invalid options, and adapter failures. Errors include the operation, root message name, kind, field path, reason, and details.

encode!(data, schema, message_name, opts \\ [])

@spec encode!(term(), schema_source(), message_name(), keyword()) :: binary()

Encodes data as a protobuf message, returning a binary or raising PB.Error.

This is the bang variant of encode/4.

encode_iodata(data, schema, message_name, opts \\ [])

@spec encode_iodata(term(), schema_source(), message_name(), keyword()) ::
  {:ok, iodata()} | {:error, error()}

Encodes data as a protobuf message, returning {:ok, iodata}.

Same semantics as encode/4 but skips the final IO.iodata_to_binary/1. Useful for callers that pass the result directly to a socket, file, or framing layer that already accepts iodata.

encode_iodata!(data, schema, message_name, opts \\ [])

@spec encode_iodata!(term(), schema_source(), message_name(), keyword()) :: iodata()

Encodes data as a protobuf message, returning iodata or raising PB.Error.

This is the bang variant of encode_iodata/4.

message_equal?(left, right, schema, message_name)

@spec message_equal?(term(), term(), schema_source(), message_name()) :: boolean()

Compares two decoded message terms with schema-aware protobuf field semantics.

This compares declared fields through the compiled schema rather than relying on raw map equality. Explicit-presence fields must be present on both sides; implicit scalar, repeated, and map fields compare against their protobuf defaults when absent. Nested message fields are compared recursively and unknown fields compare by preserved wire bytes. Well-known types, including google.protobuf.Any, are treated as ordinary protobuf messages.

Represented structs, identity oneofs, unwrapped messages, and adapted values are projected through the same single-message boundary used by encode.

Both arguments are presumed to be valid terms for message_name. A false result therefore always means "valid messages that are not equal" — it never stands in for malformed input. If either side cannot be projected as message_name (adapter/struct/representation failure, at the root or a nested message), or message_name (or a nested type_name) is not present in schema, this raises a PB.Error struct (PB.SchemaError or PB.ValueError) with operation: :message_equal. Those are caller-contract failures rather than comparison results — the same class as passing an unknown message_name (%PB.SchemaError{kind: :unknown_message}).

normalize(data, schema, message_name, opts \\ [])

@spec normalize(term(), schema_source(), message_name(), keyword()) ::
  {:ok, term()} | {:error, error()}

Normalizes data as a protobuf message, returning the canonical decoded value.

Normalization validates and encodes data with the same rules as encode/4, then decodes the bytes with the same rules as decode/4. This produces the data shape PB would return after a protobuf round-trip: implicit scalar defaults are elided unless defaults: true is passed, nested message values are recursively normalized, enum values are canonicalized, and scalar range checks are enforced.

Options

  • :unknown_fields — forwarded to the encode step. Defaults to :error.
  • :defaults — forwarded to the decode step. Defaults to false.

Returns {:error, error} (a PB.Error struct) with operation: :normalize for invalid input, unknown messages or fields, invalid options, and adapter failures.

normalize!(data, schema, message_name, opts \\ [])

@spec normalize!(term(), schema_source(), message_name(), keyword()) :: term()

Normalizes data as a protobuf message, returning the normalized value or raising PB.Error.

This is the bang variant of normalize/4.