docs: Adding language recommendation #4266

franciscojavierarceo · 2024-06-11T01:44:47Z

What this PR does / why we need it:

This PR adds documentation about why we recommend using Feast with a Python Microservice.

Which issue(s) this PR fixes:

#4266

Fixes

#4266

Signed-off-by: Francisco Javier Arceo <[email protected]>

docs/getting-started/architecture-and-components/language.md

tokoko · 2024-06-12T04:16:52Z

Here we go 😄 I'm obviously very conflicted about this. I think we might be conflating two points here:

Should I reimplement feature computation logic? - tbh, I think this is the part this text is addressing rather than serving language choice. I'm very much on board here, you shouldn't reimplement because of skew, effort and all the other reasons outlined in the doc. But if you manage to stick to Precomputation is the way religiously (i.e. you're not using odfvs), that also means that you have effectively decoupled computation from serving, so the discussion regarding reimplementation of features has no bearing to serving environment language choice anymore.
Should I run a feature server in something other than python? - If we disregard odfvs for a minute, this is more of an implementation/maintenance burden for us rather than a user. Of course, I'd also go with python at this point in time because I'm not sure others work at all, but if we make them work and if we reimplement online store retrieval logic for most of online store implementations in java/go, there's really no reason why someone shouldn't switch to them (other than odfvs).

Executing ODFVs written in python in a non-python serving environment is admittedly very tricky, but also something that's on us to solve, not on the user. If we manage to find good ways to do it w/o requiring reimplementation (by optimizing transformation server, making arrow-based go-python interop work or by somehow leveraging substrait), that also becomes a non-issue for the user.

franciscojavierarceo · 2024-06-12T13:01:59Z

So in my last role I used ODFVs to create features and then persisted that output to a regular FV.

This a simple hack to allow us to launch what we needed faster but the real solution is to enable FVs to have feature computations equivalent to how ODFVs work with the Python mode.

The point is that feature transformations (i.e., precomputation) should happen in Python as well. ODFV will still need to support some light weight calculations (e.g., date differences from datetime.now()).

Let me in how what you think and thanks for the feedback! I can definitely incorporate this feedback in.

tokoko · 2024-06-12T13:34:36Z

You're referring to BatchFeatureView concept, right? (in tecton lingo, we still have to implement an alternative) I think I agree there, When implemented, BatchFeatureViews should only support python as far as feast is concerned. Having said that, we should also keep supporting externally computed features with normal FeatureView objects (Feature Table in tecton lingo). Those externally computed features might or might not be written in python, of course, we don't really have any say over there. I don't think we disagree there, just wanted to stress that none of this (except odfvs) has any bearing to the serving environment.

franciscojavierarceo · 2024-06-12T20:30:13Z

yeah agreed with that. I'll clarify this in the doc more and I'll cut a PR to support Feature Transformations in regular feature views

EXPEbdodla · 2024-06-13T23:18:27Z

docs/getting-started/architecture-and-components/overview.md

-Java and Go Clients are also available for online feature retrieval.
+Java and Go Clients are also available for online feature retrieval. 
+
+In general, we recommend [using Python](language.md) for your Feature Store microservice.


The problem I see with this recommendation is online feature retrieval latency. Python has high latencies compared to Go or Java option. Do you think its better to mention the latency impact?

I tried to address this point in this statement:

Precomputing features is the recommended optimal path to ensure low latency performance. Reducing feature serving to a lightweight database lookup is the ideal pattern, which means the marginal overhead of Python should be tolerable.

But I can be more explicit.

That definitely should be a factor. Even if you precompute, there will be applications out there with low-latency requirements and high enough load for which python server performance itself might become a bottleneck. I guess we are sort of trying to address that with introducing asyncio in python online retrieval, but even that might not be enough for some use cases.

While that is true in theory, in practice Python works very well at quite high scale so my goal is to make it clear that we recommend Python.

Regardless, I see that this is in the overview section so I'll add this snippet in it.

docs/getting-started/architecture-and-components/overview.md

docs/getting-started/faq.md

docs/getting-started/architecture-and-components/overview.md

docs: Adding language recommendation

497b5dd

Signed-off-by: Francisco Javier Arceo <[email protected]>

franciscojavierarceo marked this pull request as ready for review June 11, 2024 01:49

franciscojavierarceo requested review from jeremyary, tokoko and HaoXuAI June 11, 2024 01:49

franciscojavierarceo added the ok-to-test label Jun 11, 2024

franciscojavierarceo requested a review from shuchu June 11, 2024 10:59

shuchu reviewed Jun 12, 2024

View reviewed changes

docs/getting-started/architecture-and-components/language.md Outdated Show resolved Hide resolved

HaoXuAI approved these changes Jun 13, 2024

View reviewed changes

tokoko mentioned this pull request Jun 13, 2024

Phase out Provider interface #4057

Open

EXPEbdodla reviewed Jun 13, 2024

View reviewed changes

Update language.md

e4a7ad0

franciscojavierarceo commented Jun 14, 2024

View reviewed changes

docs/getting-started/architecture-and-components/overview.md Show resolved Hide resolved

Update docs/getting-started/architecture-and-components/overview.md

fa1beb8

franciscojavierarceo commented Jun 14, 2024

View reviewed changes

docs/getting-started/faq.md Outdated Show resolved Hide resolved

franciscojavierarceo added 2 commits June 14, 2024 01:40

Update docs/getting-started/faq.md

a57a7f1

Update language.md

3a8b8b9

franciscojavierarceo commented Jun 15, 2024

View reviewed changes

docs/getting-started/architecture-and-components/overview.md Outdated Show resolved Hide resolved

Update docs/getting-started/architecture-and-components/overview.md

c12dc12

franciscojavierarceo enabled auto-merge (squash) June 15, 2024 09:49

franciscojavierarceo merged commit ae4fc6c into master Jun 15, 2024
17 checks passed

franciscojavierarceo mentioned this pull request Jul 10, 2024

Update docs to recommend Python as the recommended language for a service #4264

Closed

tokoko deleted the use-python branch July 16, 2024 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Adding language recommendation #4266

docs: Adding language recommendation #4266

franciscojavierarceo commented Jun 11, 2024 •

edited

Loading

tokoko commented Jun 12, 2024

franciscojavierarceo commented Jun 12, 2024

tokoko commented Jun 12, 2024

franciscojavierarceo commented Jun 12, 2024

EXPEbdodla Jun 13, 2024

franciscojavierarceo Jun 14, 2024

tokoko Jun 14, 2024

franciscojavierarceo Jun 14, 2024

docs: Adding language recommendation #4266

docs: Adding language recommendation #4266

Conversation

franciscojavierarceo commented Jun 11, 2024 • edited Loading

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes

tokoko commented Jun 12, 2024

franciscojavierarceo commented Jun 12, 2024

tokoko commented Jun 12, 2024

franciscojavierarceo commented Jun 12, 2024

EXPEbdodla Jun 13, 2024

Choose a reason for hiding this comment

franciscojavierarceo Jun 14, 2024

Choose a reason for hiding this comment

tokoko Jun 14, 2024

Choose a reason for hiding this comment

franciscojavierarceo Jun 14, 2024

Choose a reason for hiding this comment

franciscojavierarceo commented Jun 11, 2024 •

edited

Loading