[async] Partial SFG node GC (keep latest state writers/readers) #1915

xumingkuan · 2020-10-01T15:30:29Z

Related issue = #742

This PR keeps the latest state writers/readers in the SFG when all nodes are extracted to execute.

Most functions of the SFG need to be modified. This PR modifies

This PR doesn't modify

print()
dump_dot()

These functions don't need to be modified:

demote_activation()

This PR also tried to modify fuse() so that only tasks that are "near" in the topological order (i.e., position in nodes_ differ by < 512) are fused. This improves the complexity of a single invocation of fuse() from O(nm/64) to O(min(n, 512*4)*m/64 + nm/512)...

Benchmark: async_mgpcg.py, time of fuse() on kun: 0.746s (0.355s unaccounted) -> 1.334s (0.800s rebuild graph, 0.559s insert task)
(2.760s at commit c691248, 1.387s at commit da2a0f2 (0.920s rebuild graph), 1.243s at commit bc6f17f (0.704s rebuild graph))

[Click here for the format server]

xumingkuan · 2020-10-01T15:32:21Z

Do we need bool executed in the SFGNode?

yuanming-hu · 2020-10-01T19:04:23Z

Do we need bool executed in the SFGNode?

Sounds like a good idea!

…ng_node_id = -1 for other nodes

xumingkuan · 2020-10-02T09:08:40Z

taichi/program/state_flow_graph.cpp

-  for (int i = 1; i < (int)nodes_.size(); i++) {
+  for (int i = first_pending_task_index_; i < (int)nodes_.size(); i++) {


It's better to make use of get_pending_tasks(), but I make the changeset minimal here to avoid conflicts with #1907.

sorry, i merged #1907 and it has created mrege conflict now.. But is it possible to use get_pending_tasks() now?

xumingkuan · 2020-10-02T09:50:24Z

async_mgpcg.py fails in optimize_listgen now. Will debug later.

…time on rebuild_graph

codecov · 2020-10-02T11:52:24Z

Codecov Report

Merging #1915 into master will increase coverage by 0.02%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master    #1915      +/-   ##
==========================================
+ Coverage   43.83%   43.86%   +0.02%     
==========================================
  Files          45       45              
  Lines        6194     6199       +5     
  Branches     1100     1101       +1     
==========================================
+ Hits         2715     2719       +4     
- Misses       3310     3311       +1     
  Partials      169      169

Impacted Files	Coverage Δ
python/taichi/lang/impl.py	`66.84% <0.00%> (+0.17%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a1c6d7c...2b1b903. Read the comment docs.

# Conflicts: # taichi/program/state_flow_graph.cpp

k-ye

Thanks, most parts LGTM!

I have one suggestion. It took me some amount of time to wrap my head around how first_pending_task_index_ and pending_node_ids are maintained. IMHO, we only need to re-id pending node IDs when first_pending_task_index_ has been modified:

rebuild_graph(). Because we could remove empty tasks, first_pending_task_index_ might shrink. (* if first_pending_task_index_ doesn't change, does that imply we don't even have to re-id?)
mark_pending_tasks_as_executed(). After this, all the nodes_ should be in the executed state. (We don't have to do another re-id pass if we just do pending_node_id = -1 or mark_execution() as L112's for-loop goes.)

For other places that currently updates pending_node_id, i.e. get_pending_tasks(begin, end) and topo_sort_nodes(), I think we should instead verifies/asserts on these IDs. If they don't meet the verification, that could very well indicate a bug. WDYT?

taichi/program/state_flow_graph.cpp

taichi/program/state_flow_graph.h

k-ye

Great, LGTM!

taichi/program/state_flow_graph.cpp

k-ye · 2020-10-04T04:50:15Z

taichi/program/state_flow_graph.cpp

@@ -783,17 +916,16 @@ void StateFlowGraph::delete_nodes(

  nodes_ = std::move(new_nodes_);
  reid_nodes();
+  reid_pending_nodes();


BTW, can we assert that all the nodes in indices_to_delete are pending?

Maybe we can refactor indices_to_delete such that the pending_node_id instead of node_id is passed into the unordered_map?

Ah yeah, that should be better :) I saw that you have explicitly converted a pending index into a "global" one when populating indices_to_delete..

xumingkuan · 2020-10-04T06:09:27Z

Note: async_mgpcg.py doesn't need to delete any nodes in optimize_dead_store.

xumingkuan added 2 commits October 1, 2020 23:14

Separate SFG::extract() into rebuild_graph() and extract_to_execute()

6f4b828

Add declaration of SFG::fuse_range()

835ba5c

xumingkuan added 7 commits October 2, 2020 14:42

Add SFGNode::executed and update verify()

178ec38

modify topo_sort_nodes()

55900a5

Fix verify and topo sort

e329bf8

modify compute_transitive_closure()

53f0ceb

Modify fuse() (implement fuse_range())

1ddebe5

Fix compute_transitive_closure and let reid_pending_nodes() set pendi…

c56deb5

…ng_node_id = -1 for other nodes

Modify optimize_dead_store() and add some assertions

8e6f543

xumingkuan marked this pull request as ready for review October 2, 2020 08:53

Fix fuse_range()

b709931

xumingkuan commented Oct 2, 2020

View reviewed changes

xumingkuan changed the title ~~[async] Partial SFG node GC (keep latest state writers/readers)~~ [async] Partial SFG node GC and improve fusion speed Oct 2, 2020

xumingkuan added 4 commits October 2, 2020 19:17

Modify fuse() to make optimize_listgen() work?

f885652

[skip ci] code format

c691248

[skip ci] test

dbaf5e4

Change fuse_range's return type to std::unordered_set<int> to reduce …

da2a0f2

…time on rebuild_graph

Revert changes on logic of fuse() in this PR

0dc2afd

xumingkuan changed the title ~~[async] Partial SFG node GC and improve fusion speed~~ [async] Partial SFG node GC (keep latest state writers/readers) Oct 2, 2020

xumingkuan added 5 commits October 2, 2020 20:06

Merge branch 'master' into sfg-gc

2728352

# Conflicts: # taichi/program/state_flow_graph.cpp

Fix compilation error

f0ddd3c

use get_pending_tasks()

c7aa2ec

[skip ci] TI_AUTO_PROF for insert_task

159f6aa

[skip ci] TI_AUTO_PROF

bc6f17f

xumingkuan requested review from yuanming-hu and k-ye October 2, 2020 15:56

k-ye reviewed Oct 3, 2020

View reviewed changes

taichi/program/state_flow_graph.cpp Outdated Show resolved Hide resolved

taichi/program/state_flow_graph.cpp Outdated Show resolved Hide resolved

taichi/program/state_flow_graph.cpp Show resolved Hide resolved

taichi/program/state_flow_graph.h Outdated Show resolved Hide resolved

xumingkuan added 3 commits October 4, 2020 00:49

Apply review suggestions

556e707

Apply review suggestions and make SFG::verify() const

da674af

Fix tests: add reid_pending_nodes() to topo_sort_nodes()

38608ee

xumingkuan requested a review from k-ye October 3, 2020 17:28

k-ye approved these changes Oct 4, 2020

View reviewed changes

Fix L1017

5ff1ce5

update profiler name

2b1b903

yuanming-hu merged commit 16e6bc3 into taichi-dev:master Oct 4, 2020

yuanming-hu mentioned this pull request Oct 7, 2020

[release] v0.6.39 #1928

Merged

xumingkuan deleted the sfg-gc branch October 12, 2020 08:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[async] Partial SFG node GC (keep latest state writers/readers) #1915

[async] Partial SFG node GC (keep latest state writers/readers) #1915

xumingkuan commented Oct 1, 2020 •

edited

Loading

xumingkuan commented Oct 1, 2020

yuanming-hu commented Oct 1, 2020

xumingkuan Oct 2, 2020

k-ye Oct 2, 2020

xumingkuan commented Oct 2, 2020

codecov bot commented Oct 2, 2020 •

edited

Loading

k-ye left a comment •

edited

Loading

k-ye left a comment

k-ye Oct 4, 2020

xumingkuan Oct 4, 2020

k-ye Oct 4, 2020

xumingkuan commented Oct 4, 2020

		for (int i = 1; i < (int)nodes_.size(); i++) {
		for (int i = first_pending_task_index_; i < (int)nodes_.size(); i++) {

[async] Partial SFG node GC (keep latest state writers/readers) #1915

[async] Partial SFG node GC (keep latest state writers/readers) #1915

Conversation

xumingkuan commented Oct 1, 2020 • edited Loading

xumingkuan commented Oct 1, 2020

yuanming-hu commented Oct 1, 2020

xumingkuan Oct 2, 2020

Choose a reason for hiding this comment

k-ye Oct 2, 2020

Choose a reason for hiding this comment

xumingkuan commented Oct 2, 2020

codecov bot commented Oct 2, 2020 • edited Loading

Codecov Report

k-ye left a comment • edited Loading

Choose a reason for hiding this comment

k-ye left a comment

Choose a reason for hiding this comment

k-ye Oct 4, 2020

Choose a reason for hiding this comment

xumingkuan Oct 4, 2020

Choose a reason for hiding this comment

k-ye Oct 4, 2020

Choose a reason for hiding this comment

xumingkuan commented Oct 4, 2020

xumingkuan commented Oct 1, 2020 •

edited

Loading

codecov bot commented Oct 2, 2020 •

edited

Loading

k-ye left a comment •

edited

Loading