Async implementation of driver/adapter #171

elijahbenizzy · 2022-08-06T23:40:37Z

@skrawcz this is what I meant

Basically we pass the coroutine the entire way through.
When a function depends on other data, we create a coroutine that
awaits everything. Finally, we do a gather at the end.

I think this is optimal but will need to dig in a bit/look into
things.

Otherwise, nifty POC with very rough edges.

The biggest hack is that I'm using a cache to store coroutine values as python makes it very hard to access them.
I think we can do better, but the point is we don't have to mess with execution.

[Short description explaining the high-level reason for the pull request]

Changes

Testing

Notes

Checklist

Testing checklist

Python - local testing

python 3.6
python 3.7

hamilton/driver.py

elijahbenizzy · 2022-08-07T19:42:20Z

Remaining:

Tests
Determine how to release (experimental?)

Basically we pass the coroutine the entire way through. When a function depends on other data, we create a coroutine that awaits everything. Finally, we do a gather at the end. This involves no modification to the functiongraph code

hamilton/experimental/h_async.py

examples/async/fastapi_example.py

If we use tasks, we can await them twice. Furthermore, they'll begin scheduling earlier, and python handles keeping track of them. This is a much cleaner way to handle it.

examples/async/README.md

skrawcz · 2022-08-08T04:42:32Z

examples/async/fastapi_example.py

+        ad_hoc_utils.create_temporary_module(
+            pipeline,
+            computation1,
+            foo,
+            bar,
+            some_data,
+            computation2))


We should probably not do this in an example. This doesn't feel very "ad hoc".

Otherwise 💡, fastapi could be a possible code compilation target too.

hamilton/experimental/h_async.py

skrawcz · 2022-08-08T04:51:47Z

hamilton/experimental/h_async.py

+        if display_graph:
+            raise ValueError(f'display_graph=True is not supported for the async graph adapter. '
+                             f'Instead you should be using visualize_execution.')
+        return await await_dict_of_tasks({key: asyncio.create_task(process_value(memoized_computation[key])) for key in final_vars})


Suggested change

return await await_dict_of_tasks({key: asyncio.create_task(process_value(memoized_computation[key])) for key in final_vars})

task_dict = {key: asyncio.create_task(process_value(memoized_computation[key])) for key in final_vars}

return await await_dict_of_tasks(task_dict)

Also is process_value needed here? Seems a bit redundant with the result being wrapped in a task and that awaited?

process_value is needed but create_task is not I think...

okay. But this seems like an important line that should be at least two... break out the dict comprehension at least.

examples/async/fastapi_example.py

skrawcz · 2022-08-10T00:02:48Z

hamilton/experimental/h_async.py

+        callabl = node.callable
+
+        async def new_fn(fn=callabl, **fn_kwargs):
+            fn_kwargs = await await_dict_of_tasks({key: process_value(value) for key, value in fn_kwargs.items()})


can move this dict comprehension to its own line.

skrawcz

Just those two code nits, otherwise I think this looks good.

1. Fixes docstring formats 2. Removes redundant call to create_task 3. Breaks example into modules

elijahbenizzy force-pushed the async-prototype branch from 7908643 to 081db80 Compare August 6, 2022 23:42

skrawcz reviewed Aug 6, 2022

View reviewed changes

hamilton/driver.py Outdated Show resolved Hide resolved

skrawcz reviewed Aug 6, 2022

View reviewed changes

hamilton/driver.py Outdated Show resolved Hide resolved

skrawcz reviewed Aug 6, 2022

View reviewed changes

hamilton/driver.py Outdated Show resolved Hide resolved

elijahbenizzy force-pushed the async-prototype branch from 081db80 to de39759 Compare August 7, 2022 00:00

skrawcz reviewed Aug 7, 2022

View reviewed changes

hamilton/driver.py Outdated Show resolved Hide resolved

elijahbenizzy force-pushed the async-prototype branch 4 times, most recently from 018fd96 to a8667ca Compare August 7, 2022 19:36

elijahbenizzy changed the title ~~Rough POC that we can do async without touching function graph~~ Async implementation of driver/adapter Aug 7, 2022

elijahbenizzy force-pushed the async-prototype branch from a8667ca to f5c9826 Compare August 7, 2022 19:42

Adds async graphadapter + driver

9bff5c5

Basically we pass the coroutine the entire way through. When a function depends on other data, we create a coroutine that awaits everything. Finally, we do a gather at the end. This involves no modification to the functiongraph code

elijahbenizzy force-pushed the async-prototype branch from f5c9826 to 6910d33 Compare August 7, 2022 19:51

skrawcz reviewed Aug 7, 2022

View reviewed changes

hamilton/experimental/h_async.py Show resolved Hide resolved

skrawcz reviewed Aug 7, 2022

View reviewed changes

examples/async/fastapi_example.py Outdated Show resolved Hide resolved

Adds README

97967a8

elijahbenizzy force-pushed the async-prototype branch from 6910d33 to 97967a8 Compare August 7, 2022 23:24

elijahbenizzy added 2 commits August 7, 2022 16:58

Gets rid of the coroutine cache in favor of using tasks

0f8ddeb

If we use tasks, we can await them twice. Furthermore, they'll begin scheduling earlier, and python handles keeping track of them. This is a much cleaner way to handle it.

Adds tests for async graphadapter + Driver

c0aacc8

elijahbenizzy marked this pull request as ready for review August 8, 2022 00:59

elijahbenizzy mentioned this pull request Aug 8, 2022

Add asyncio based driver and related components #167

Closed

skrawcz suggested changes Aug 8, 2022

View reviewed changes

skrawcz reviewed Aug 10, 2022

View reviewed changes

skrawcz approved these changes Aug 10, 2022

View reviewed changes

elijahbenizzy force-pushed the async-prototype branch 2 times, most recently from ca4b36f to bc3a7e2 Compare August 10, 2022 00:14

Improvements for PR

e044a40

1. Fixes docstring formats 2. Removes redundant call to create_task 3. Breaks example into modules

elijahbenizzy force-pushed the async-prototype branch from bc3a7e2 to e044a40 Compare August 10, 2022 00:16

elijahbenizzy merged commit ca529c1 into main Aug 10, 2022

elijahbenizzy deleted the async-prototype branch August 10, 2022 01:02

skrawcz linked an issue Aug 22, 2022 that may be closed by this pull request

Add asyncio based driver and related components #167

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async implementation of driver/adapter #171

Async implementation of driver/adapter #171

elijahbenizzy commented Aug 6, 2022 •

edited

Loading

elijahbenizzy commented Aug 7, 2022 •

edited

Loading

skrawcz Aug 8, 2022

skrawcz Aug 8, 2022

elijahbenizzy Aug 8, 2022

skrawcz Aug 10, 2022

skrawcz Aug 10, 2022 •

edited

Loading

skrawcz left a comment

	return await await_dict_of_tasks({key: asyncio.create_task(process_value(memoized_computation[key])) for key in final_vars})
	task_dict = {key: asyncio.create_task(process_value(memoized_computation[key])) for key in final_vars}
	return await await_dict_of_tasks(task_dict)

Async implementation of driver/adapter #171

Async implementation of driver/adapter #171

Conversation

elijahbenizzy commented Aug 6, 2022 • edited Loading

Changes

Testing

Notes

Checklist

Testing checklist

Python - local testing

elijahbenizzy commented Aug 7, 2022 • edited Loading

skrawcz Aug 8, 2022

Choose a reason for hiding this comment

skrawcz Aug 8, 2022

Choose a reason for hiding this comment

elijahbenizzy Aug 8, 2022

Choose a reason for hiding this comment

skrawcz Aug 10, 2022

Choose a reason for hiding this comment

skrawcz Aug 10, 2022 • edited Loading

Choose a reason for hiding this comment

skrawcz left a comment

Choose a reason for hiding this comment

elijahbenizzy commented Aug 6, 2022 •

edited

Loading

elijahbenizzy commented Aug 7, 2022 •

edited

Loading

skrawcz Aug 10, 2022 •

edited

Loading