Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancement/optimize edge all predicate #5481

Merged

Conversation

czpmango
Copy link
Contributor

@czpmango czpmango commented Apr 6, 2023

What type of PR is this?

  • bug
  • feature
  • enhancement

What problem(s) does this PR solve?

Push edge all predicate down to the storage layer to optimize performance.

Checklist:

Tests:

  • Unit test(positive and negative cases)
  • Function test
  • Performance test
  • N/A

Affects:

  • Documentation affected (Please add the label if documentation needs to be modified.)
  • Incompatibility (If it breaks the compatibility, please describe it and add the label.)
  • If it's needed to cherry-pick (If cherry-pick to some branches is required, please label the destination version(s).)
  • Performance impacted: Consumes more CPU/Memory

@czpmango czpmango added the ready-for-testing PR: ready for the CI test label Apr 6, 2023
@czpmango czpmango force-pushed the enhancement/optimize-edge-all-predicate branch 2 times, most recently from 4c5d5b1 to 6c86d3c Compare April 6, 2023 09:49
@czpmango czpmango marked this pull request as draft April 6, 2023 10:52
jievince
jievince previously approved these changes Apr 6, 2023
Copy link
Contributor

@jievince jievince left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yixinglu
Copy link
Contributor

yixinglu commented Apr 7, 2023

plz solve the conflicts of this PR

@czpmango
Copy link
Contributor Author

czpmango commented Apr 7, 2023

plz solve the conflicts of this PR

Ok.

@czpmango czpmango force-pushed the enhancement/optimize-edge-all-predicate branch 2 times, most recently from f4133f8 to 9916f04 Compare April 7, 2023 02:35
@czpmango czpmango marked this pull request as ready for review April 7, 2023 06:00
@czpmango czpmango force-pushed the enhancement/optimize-edge-all-predicate branch from bfc7a1c to fe5fb95 Compare April 7, 2023 06:00
[Optimizer]Embed edge all predicate into Traverse

Move rules

fix expr util

small change

fix compile

rename rule

Split opt rule

Push edge all filter

fmt

smll fix

Add tck

fix tck

fmt

Add tck

Add tck

Add tck

tmp

fix

fix

Fix tck
@czpmango czpmango force-pushed the enhancement/optimize-edge-all-predicate branch from d1848d5 to 08002ff Compare April 7, 2023 07:11
@@ -62,8 +62,12 @@ Status TraverseExecutor::buildRequestVids() {
auto vidType = SchemaUtil::propTypeToValueType(metaVidType.get_type());
for (; iter->valid(); iter->next()) {
const auto& vid = src->eval(ctx(iter));
DCHECK_EQ(vid.type(), vidType)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why delete this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

crash

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should check why vid type is unexpected.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ACK.

jievince
jievince previously approved these changes Apr 7, 2023
Copy link
Contributor

@yixinglu yixinglu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good job!

maybe you should test some scenarios which could not push the all predicates down, such as:

all( i in e where i.likeness>95 AND true);
all(i in e WHERE i.likeness>95 AND v.player.name =="");
all(i in e WHERE i.likeness>95 OR i.likeness < 50);
...

canBePushed_ = false;
return;
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could you extract these checks into following functions, such as:

bool checkEdgeSchema(const std::string &sym) const;
bool checkTagSchema(const std::string &sym) const;

if so, you could be able to use these in following way:

if (!checkTagSchema(expr->sym())) return;
if (!checkEdgeSchema(expr->sym()) return;

| 15 | Filter | 11 | | {"condition": "((n.player.age>0)==true)"} |
| 11 | AppendVertices | 14 | | |
| 14 | Filter | 13 | | {"condition": "(size($e)>0)"} |
| 13 | Traverse | 1 | | {"edge filter": "(*.likeness>90)"} |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why the predicate is sometimes in edge filter and sometimes in filter fields of Traverse?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

0 step filter cannot push to storage layer.

| 53 | Filter | 52 | | {"condition": "(id($friend)!=\"Tony Parker\")"} |
| 52 | AppendVertices | 59 | | |
| 59 | Filter | 60 | | {"condition": "(id($person)==\"Tony Parker\")"} |
| 60 | Traverse | 2 | | {"filter": "((like.likeness>0) AND (like.likeness>1))"} |
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we could merge the condition into like.likeness>1 later.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yep. Expression normalization is not optimized yet.

@czpmango
Copy link
Contributor Author

czpmango commented Apr 7, 2023

all(i in e WHERE i.likeness>95 AND v.player.name =="");

ACK.

@yixinglu yixinglu requested a review from jievince April 7, 2023 09:39
dutor added a commit that referenced this pull request Apr 20, 2023
* refactor traverse output (#5464)

* refactor traverse output

* fix pruneproperties error & none_direct_dst

* fix test error

* fix shortest path

* Change the compaction filter logic to let periodic compaction go through custom compaction filter, to gc expired data (#5447)

* Push filter down cross join (#5473)

* fix comment

* push down filter through cross join

---------

Co-authored-by: Sophie <84560950+Sophie-Xie@users.noreply.github.com>

* Fix shortest path crash (#5472)

* fix crash of geo (#5475)

* fix crash of geo

* change log(fatal) to log(error)

* fix miss arg $GITHUB_OUTPUT (#5478)

* Split optimizer rules (#5470)

Fix compile

small rename

Fix tck

Fix tck

fmt

Fix tck

Fix tck

* Enhancement/optimize edge all predicate (#5481)

* fix eval contains filter on storaged (#5485)

* fix eval contains filter on storaged

* add tck case

* add tck case

* fix tck

* fix lint

* fix lint

* Fix expression util function (#5487)

fmt

Co-authored-by: Sophie <84560950+Sophie-Xie@users.noreply.github.com>

* fix ContainsFilter random fail (#5489)

* Fixed graphd startup issue (#5493)

* fix prunproperties (#5494)

* stop the pushing down of not expressions that are not rewritten to proper forms. (#5502)

* Fix edge all predicate with rank function (#5503)

Co-authored-by: Sophie <84560950+Sophie-Xie@users.noreply.github.com>

* rewrite param in subgraph & path (#5500)

* check param in subgraph

* rewrite param in path

---------

Co-authored-by: Sophie <84560950+Sophie-Xie@users.noreply.github.com>

* Fix concurrent bug about session count (#5496)

* Fix regex expression (#5507)

* Update requirements.txt (#5512)

Solidified tomli version to solve centos7 compatibility issues

* Update cluster id (#5514)

---------

Co-authored-by: jimingquan <mingquan.ji@vesoft.com>
Co-authored-by: Ryan <ydlu1987@gmail.com>
Co-authored-by: Yee <2520865+yixinglu@users.noreply.github.com>
Co-authored-by: jie.wang <38901892+jievince@users.noreply.github.com>
Co-authored-by: George <58841610+Shinji-IkariG@users.noreply.github.com>
Co-authored-by: kyle.cao <kyle.cao@vesoft.com>
Co-authored-by: codesigner <codesigner.huang@vesoft.com>
Co-authored-by: dutor <440396+dutor@users.noreply.github.com>
Co-authored-by: Cheng Xuntao <7731943+xtcyclist@users.noreply.github.com>
Co-authored-by: Yichen Wang <18348405+Aiee@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants