Skip to content

Commit e90ac88

Browse files
docs: Adding some content about the push vs pull model (feast-dev#4385)
* docs: Adding some content about the push vs pull model Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> * checking in progress...all of this was mostly autogenerated Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> * updated Signed-off-by: Francisco Javier Arceo <farceo@redhat.com> * Update push-vs-pull-model.md * Update push-vs-pull-model.md * Update push-vs-pull-model.md --------- Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>
1 parent 125fa49 commit e90ac88

File tree

4 files changed

+36
-1
lines changed

4 files changed

+36
-1
lines changed

docs/README.md

+11-1
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Feast (**Fea**ture **St**ore) is a customizable operational data system that re-
66

77
Feast allows ML platform teams to:
88

9-
* **Make features consistently available for training and serving** by managing an _offline store_ (to process historical data for scale-out batch scoring or model training), a low-latency _online store_ (to power real-time prediction)_,_ and a battle-tested _feature server_ (to serve pre-computed features online).
9+
* **Make features consistently available for training and low-latency serving** by managing an _offline store_ (to process historical data for scale-out batch scoring or model training), a low-latency _online store_ (to power real-time prediction)_,_ and a battle-tested _feature server_ (to serve pre-computed features online).
1010
* **Avoid data leakage** by generating point-in-time correct feature sets so data scientists can focus on feature engineering rather than debugging error-prone dataset joining logic. This ensure that future feature values do not leak to models during training.
1111
* **Decouple ML from data infrastructure** by providing a single data access layer that abstracts feature storage from feature retrieval, ensuring models remain portable as you move from training models to serving models, from batch models to realtime models, and from one data infra system to another.
1212

@@ -16,6 +16,16 @@ Feast allows ML platform teams to:
1616

1717
![](assets/feast_marchitecture.png)
1818

19+
{% hint style="info" %}
20+
**Note:** Feast uses a push model for online serving. This means that the feature store pushes feature values to the
21+
online store, which reduces the latency of feature retrieval. This is more efficient than a pull model, where the model
22+
serving system must make a request to the feature store to retrieve feature values. See
23+
[this document](getting-started/architecture-and-components/push-vs-pull-model.md) for a more detailed discussion.
24+
{% endhint %}
25+
26+
{% hint style="info" %}
27+
{% endhint %}
28+
1929
## Who is Feast for?
2030

2131
Feast helps ML platform teams with DevOps experience productionize real-time models. Feast can also help these teams build towards a feature platform that improves collaboration between engineers and data scientists.

docs/SUMMARY.md

+1
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@
2020
* [Architecture](getting-started/architecture-and-components/README.md)
2121
* [Overview](getting-started/architecture-and-components/overview.md)
2222
* [Language](getting-started/architecture-and-components/language.md)
23+
* [Push vs Pull Model](getting-started/architecture-and-components/push-vs-pull-model.md)
2324
* [Registry](getting-started/architecture-and-components/registry.md)
2425
* [Offline store](getting-started/architecture-and-components/offline-store.md)
2526
* [Online store](getting-started/architecture-and-components/online-store.md)

docs/getting-started/architecture-and-components/README.md

+4
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,10 @@
88
[overview.md](overview.md)
99
{% endcontent-ref %}
1010

11+
{% content-ref url="push-vs-pull-model.md" %}
12+
[push-vs-pull-model.md](push-vs-pull-model.md)
13+
{% endcontent-ref %}
14+
1115
{% content-ref url="registry.md" %}
1216
[registry.md](registry.md)
1317
{% endcontent-ref %}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
# Push vs Pull Model
2+
3+
Feast uses a [Push Model](https://en.wikipedia.org/wiki/Push_technology), i.e.,
4+
Data Producers push data to the feature store and Feast stores the feature values
5+
in the online store, to serve features in real-time.
6+
7+
In a [Pull Model](https://en.wikipedia.org/wiki/Pull_technology), Feast would
8+
pull data from the data producers at request time and store the feature values in
9+
the online store before serving them (storing them would actually be unneccessary).
10+
This approach would incur additional network latency as Feast would need to orchestrate
11+
a request to each data producer, which would mean the latency would be at least as long as
12+
your slowest call. So, in order to serve features as fast as possible, we push data to
13+
Feast and store the feature values in the online store.
14+
15+
The trade-off with the Push Model is that strong consistency is not gauranteed out
16+
of the box. Instead, stong consistency has to be explicitly designed for in orchestrating
17+
the updates to Feast and the client usage.
18+
19+
The significant advantage with this approach is that Feast is read-optimized for low-latency
20+
feature retrieval.

0 commit comments

Comments
 (0)