Skip to content

Commit

Permalink
docs: First pass at moving the Pub/Sub binding into this repo.
Browse files Browse the repository at this point in the history
Original source:
https://github.com/google/knative-gcp/blob/main/docs/spec/pubsub-protocol-binding.md

I've modified the text in a few places as the CloudEvents spec
wording has been tightened up since that doc was written:

- Event data isn't an attribute
- There are now three content modes: binary, structured and batch.
  (We could easily extend this spec to support batch, but we haven't
  done so yet.)
- The context attributes defined in the CloudEvents spec now have a
  name: "core context attributes"

Reading through this, it looks like we don't have a convention for
extension attributes, and we don't have any examples of them either.
We can approach that in a follow-up PR.
  • Loading branch information
jskeet committed Apr 19, 2023
1 parent 5c65bb4 commit c46f4c2
Showing 1 changed file with 316 additions and 0 deletions.
316 changes: 316 additions & 0 deletions docs/spec/pubsub.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,316 @@
# Google Cloud Pub/Sub Protocol Binding for CloudEvents - Version 1.0

## Abstract

The Google Cloud Pub/Sub Protocol Binding for CloudEvents defines how events are
mapped to Pub/Sub 1.1 request and response messages.

## Status of this document

This document is a working draft.

## Table of Contents

1. [Introduction](#1-introduction)

- 1.1. [Conformance](#11-conformance)
- 1.2. [Relation to Pub/Sub](#12-relation-to-pubsub)
- 1.3. [Content Modes](#13-content-modes)
- 1.4. [Event Formats](#14-event-formats)
- 1.5. [Security](#15-security)

2. [Use of CloudEvents Attributes](#2-use-of-cloudevents-attributes)

- 2.1. [datacontenttype Attribute](#21-datacontenttype-attribute)
- 2.2. [Data](#22-data)

3. [Pub/Sub Message Mapping](#3-pubsub-message-mapping)

- 3.1. [Binary Content Mode](#31-binary-content-mode)
- 3.2. [Structured Content Mode](#32-structured-content-mode)

4. [References](#4-references)

## 1. Introduction

[CloudEvents][ce] is a standardized and protocol-agnostic definition of the
structure and metadata description of events. This specification defines how the
elements defined in the CloudEvents specification are to be used in
[Pub/Sub][pubsub] requests and response messages.

### 1.1. Conformance

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD",
"SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be
interpreted as described in [RFC2119][rfc2119].

### 1.2. Relation to Pub/Sub

Events can be produced using a Topic or consumed using Push or Pull based
Subscriptions.

### 1.3. Content Modes

The CloudEvents specification defines three content modes for
transferring events: _structured_, _binary_ and _batch_. The Pub/Sub
binding does not currently support batch mode. Every compliant
implementation SHOULD support both structured and binary modes.

In the _structured_ content mode, event metadata attributes and event data are
placed into the Pub/Sub message body section using an
[event format](#14-event-formats).

In the _binary_ content mode, the value of the event `data` attribute is placed
into the Pub/Sub payload data as-is, with the `datacontenttype` attribute value
declaring its media type; all other event attributes are mapped to Pub/Sub
attributes.

### 1.4. Event Formats

Event formats, used within the _structured_ content mode, define how an event
is expressed in a particular data format. All implementations of this specification
MUST support the [JSON event format][json-format], but MAY support any additional,
including proprietary, formats.

### 1.5. Security

This specification does not introduce any new security features for Pub/Sub, or
mandate specific existing features to be used.

## 2. Use of CloudEvents Attributes

This specification does not further define any of the [CloudEvents][ce] event
attributes.

This mapping is intentionally robust against changes, including the addition and
removal of event attributes, and also accommodates vendor extensions to the
event metadata.

### 2.1. datacontenttype Attribute

The `datacontenttype` attribute SHOULD contain a [RFC2046][rfc2046] compliant
media-type expression.

### 2.2. Data

The event data (also known as the event payload) SHOULD contain
opaque application data that is encoded as declared by the
`datacontenttype` attribute.

An application MAY hold the information in any in-memory
representation of its choosing, but as the value is transposed into
a Pub/Sub message as defined in this specification, the assumption
is that the event data is represented as a sequence of bytes.

For instance, if the declared `datacontenttype` is
`application/json;charset=utf-8`, the expectation is that the event
data is represented as [UTF-8][rfc3629] encoded JSON text Pub/Sub
message data.

## 3. Pub/Sub Message Mapping

With Pub/Sub, the content mode is chosen by the sender of the event. Protocol
usage patterns that might allow solicitation of events using a particular
content mode might be defined by an application, but are not defined here.

The receiver of the event can distinguish between the two content modes by
inspecting the [Pub/Sub Attribute `content-type`][content-type] of the [Pub/Sub
message][pubsub-message]. If the attribute is present and its value is prefixed
with the CloudEvents media type `application/cloudevents`, indicating the use of
a known [event format](#14-event-formats), the receiver SHOULD use _structured_
mode, otherwise it MUST default to _binary_ mode.

If a receiver finds a CloudEvents media type as per the above rule, but with an
event format that it cannot handle, for instance `application/cloudevents+avro`,
it MAY still treat the event as binary and forward it to another party as-is.

### 3.1. Binary Content Mode

The _binary_ content mode accommodates any shape of event data, and allows for
efficient transfer and without transcoding effort.

The _binary_ content mode can only be used if the metadata attributes fit within
the [limits of Pub/Sub messages][pubsub-message-quotas].

#### 3.1.1. content-type

For the _binary_ mode, the [Pub/Sub Attribute `content-type`][content-type], if
present, MUST be used as the CloudEvents `datacontenttype` attribute.

#### 3.1.2. Event Data Encoding

The [event data](#22-data) byte-sequence MUST be used as the value of the
Pub/Sub message data payload.

#### 3.1.3. Metadata Attributes

All [CloudEvents][ce] attributes MUST be individually mapped to and from
distinct Pub/Sub message attributes.

##### 3.1.3.1 Pub/Sub Attribute Names

The naming convention for the Pub/Sub attribute mapping of the
CloudEvents core context attributes is that each attribute name MUST
be prefixed with "ce-".

Examples:

* `time` maps to `ce-time`
* `id` maps to `ce-id`
* `specversion` maps to `ce-specversion`

Attribute names are lower-case.

##### 3.1.3.2 Pub/Sub Attribute Values

The value for each Pub/Sub attribute MUST be constructed from the respective
CloudEvents attribute type's [canonical string representation][ce-types].

#### 3.1.4 Examples

This example shows the _binary_ mode mapping of an event to a Pub/Sub Message:

```text
--- Attributes ---
{
"ce-specversion": "1.0",
"ce-type": "com.example.someevent",
"ce-time": "2020-03-10T03:56:24Z",
"ce-id": "1234-1234-1234",
"ce-source": "/mycontext/subcontext",
.... further attributes ...
"ce-datacontenttype": "application/json; charset=utf-8",
}
--- Data ---
{
... application data ...
}
```

This example show a publish request on the REST API of the above Pub/Sub
Message:

```text
POST https://pubsub.googleapis.com/v1/{topic}:publish HTTP/1.1
Content-Type: application/json; charset=utf-8
Authorization: Bearer ... token ...
{
"messages": [
{
"attributes": {
"ce-specversion": "1.0",
"ce-type": "com.example.someevent",
"ce-time": "2020-03-10T03:56:24Z",
"ce-id": "1234-1234-1234",
"ce-source": "/mycontext/subcontext",
.... further attributes ...
"ce-datacontenttype": "application/json; charset=utf-8",
},
"data": "... base64 encoded Data JSON (see above) ..."
}
]
}
```

### 3.2. Structured Content Mode

The _structured_ content mode keeps event metadata and data together in the
payload, allowing simple forwarding of the same event across multiple routing
hops, and across multiple transports.

#### 3.2.1. content-type

The [Pub/Sub Attribute `content-type`][content-type] MUST be set to the media
type of an [event format](#14-event-formats).

Example for the [JSON format][json-format]:

```text
"content-type": "application/cloudevents+json; charset=UTF-8"
```

#### 3.2.2. Event Data Encoding

The chosen [event format](#14-event-formats) defines how all
attributes and event data are represented.

The event metadata and data are then rendered in accordance with the event
format specification and the resulting data becomes the Pub/Sub message data.

#### 3.2.3. Metadata Attributes

Implementations MAY include the same Pub/Sub attributes as defined for the
[binary mode](#313-metadata-attributes).

All CloudEvents metadata attributes MUST be mapped into the payload, even if
they are also mapped into Pub/Sub attributes.

#### 3.2.4 Examples

This example shows the _structured_ mode mapping of an event to a Pub/Sub
Message:

```text
--- Attributes ---
{
"content-type": "application/cloudevents+json; charset=utf-8"
}
--- Data ---
{
"specversion": "1.0",
"type": "com.example.someevent",
"time": "2018-04-05T03:56:24Z",
"id": "1234-1234-1234",
"source": "/mycontext/subcontext",
"datacontenttype": "application/json; charset=utf-8",
.... further attributes ...
"data": {
... application data ...
}
}
```

This example show a publish request on the REST API of the above Pub/Sub
Message:

```text
POST https://pubsub.googleapis.com/v1/{topic}:publish HTTP/1.1
Content-Type: application/json; charset=utf-8
Authorization: Bearer ... token ...
{
"messages": [
{
"attributes": {
"content-type": "application/cloudevents+json; charset=utf-8"
},
"data": "... base64 encoded Data JSON (see above) ..."
}
]
}
```

## 4. References

- [PUBSUB][pubsub] Google Cloud Pub/Sub
- [PUBSUB-MESSAGE][pubsub-message] The Google Cloud Pub/Sub message
- [PUBSUB-MESSAGE-QUOTAS][pubsub-message-quotas] The Google Cloud Pub/Sub
message quotas
- [RFC2046][rfc2046] Multipurpose Internet Mail Extensions (MIME) Part Two:
Media Types
- [RFC2119][rfc2119] Key words for use in RFCs to Indicate Requirement Levels
- [RFC3629][rfc3629] UTF-8, a transformation format of ISO 10646
- [RFC3986][rfc3986] Uniform Resource Identifier (URI): Generic Syntax

[ce]: https://github.com/cloudevents/spec/blob/master/spec.md
[ce-types]: https://github.com/cloudevents/spec/blob/master/spec.md#type-system
[pubsub]: https://cloud.google.com/pubsub/
[pubsub-message]: https://cloud.google.com/pubsub/docs/reference/rest/v1/PubsubMessage
[pubsub-message-quotas]: https://cloud.google.com/pubsub/quotas
[json-format]: https://github.com/cloudevents/spec/blob/master/json-format.md
[content-type]: https://tools.ietf.org/html/rfc7231#section-3.1.1.5
[rfc2046]: https://tools.ietf.org/html/rfc2046
[rfc2119]: https://tools.ietf.org/html/rfc2119
[rfc3629]: https://tools.ietf.org/html/rfc3629
[rfc3986]: https://tools.ietf.org/html/rfc3986

0 comments on commit c46f4c2

Please sign in to comment.