Settings for read optimized graph? #5403

porscheme · 2023-03-15T22:08:10Z

General Question

Our graph is read only...

We never do online UPSERTs
We only update through SST files on weekly cadence

I wanted to now if there is any read optimized Nebula or RocksDB settings.
Currently graph walks are annoyingly very slow.
Below is our query...

MATCH (p:Student)
WHERE id(p) IN [ ... ]

OPTIONAL MATCH (p)<-[:HAS_COURSE]-(a:CourseCodes)
OPTIONAL MATCH (p)<-[:STUDENT_HAS_SOCIAL]-(s:Social)
OPTIONAL MATCH (p)<-[:HAS_PROCEDURE]-(r:ProcedureCodes)

WITH
  p,
  COLLECT(DISTINCT a.CourseCodes.CodeId) as courses,
  COLLECT(DISTINCT s.Social.AttributeId) as social,
  COLLECT(DISTINCT r.ProcedureCodes.CodeId) as procedures

RETURN 
  p.Student.StudentId as Student, 
  p.Student.Gender as sex, 
  p.Student.Race as race, 
  p.Student.Ethnicity as ethnicity, 
  p.Student.MaritalStatus as maritalstatus,
  diagnosis,
  procedures,
  social

The text was updated successfully, but these errors were encountered:

porscheme · 2023-03-21T05:06:04Z

Anyone?

Sophie-Xie · 2023-03-27T07:26:26Z

@yixinglu Pls take a look, thanks.

yixinglu · 2023-03-27T07:51:02Z

sorry to reply late.

there some optimize options for performance tuning, u could try to update following flags separately:

nebula-graphd.conf

--optimize_appendvertices=true
--max_job_size=10

nebula-storaged.conf

--query_concurrently=true

In addition, you can profile above query in your environment to check where the bottleneck is at runtime.

by the way, what's the version of nebula you used? if possible, you could upgrade the latest version since we have improve the match performance in latest version.

porscheme · 2023-03-28T05:56:29Z

Our cluster configuration:
version: v3.4.0
metad: 3 (16 cores, 128 GB, 2 TB SSD)
graphd: 3 (16 cores, 128 GB, 2 TB SSD)
storaged: 5 (16 cores, 128 GB, 2 TB SSD)
Replica Factor: 3
VID: String 40 character size

We did see performance improvement after setting below but not enough. Is there anything else we can do?

nebula-graphd.conf

--optimize_appendvertices=true
--max_job_size=10

nebula-storaged.conf

# This is turned ON by default in nebula v3.4.0, just do it just incase
query_concurrently=true

porscheme changed the title ~~Settings read optimized graph?~~ Settings for read optimized graph? Mar 15, 2023

wey-gu mentioned this issue Mar 18, 2023

Weekly Report 2023-03-17 vesoft-inc/nebula-community#392

Closed

Sophie-Xie added the type/question Type: question about the product label Mar 27, 2023

wey-gu mentioned this issue Mar 27, 2023

Very slow graph walk #5401

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Settings for read optimized graph? #5403

Settings for read optimized graph? #5403

porscheme commented Mar 15, 2023

porscheme commented Mar 21, 2023

Sophie-Xie commented Mar 27, 2023

yixinglu commented Mar 27, 2023

porscheme commented Mar 28, 2023 •

edited

Loading

Settings for read optimized graph? #5403

Settings for read optimized graph? #5403

Comments

porscheme commented Mar 15, 2023

porscheme commented Mar 21, 2023

Sophie-Xie commented Mar 27, 2023

yixinglu commented Mar 27, 2023

porscheme commented Mar 28, 2023 • edited Loading

porscheme commented Mar 28, 2023 •

edited

Loading