[API] create a graph with additional edge properties #2521

seunghwak · 2022-08-09T17:27:51Z

Partially address #2479

We currently support only edge weight as edge properties. This PR's goal is to update cuGraph to support additional edge properties (currently only edge ID and type, eventually any arithmetic types or thrust tuple of arithmetic types).

This PR defines an API. There will be a separate PR for implementation.

… addition

…dge_property

seunghwak · 2022-08-16T18:28:41Z

rerun tests

codecov-commenter · 2022-08-16T22:13:29Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.10@cc05758). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-22.10    #2521   +/-   ##
===============================================
  Coverage                ?   61.11%           
===============================================
  Files                   ?      106           
  Lines                   ?     5634           
  Branches                ?        0           
===============================================
  Hits                    ?     3443           
  Misses                  ?     2191           
  Partials                ?        0

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

…dge_property

naimnv

It looks good and understandable to me.

naimnv · 2022-08-26T12:39:18Z

cpp/include/cugraph/edge_property.hpp

+  auto view() const
+  {
+    using const_value_iterator = decltype(get_dataframe_buffer_cbegin(buffers_[0]));
+
+    std::vector<const_value_iterator> edge_partition_value_firsts(buffers_.size());
+    std::vector<edge_type> edge_partition_edge_counts(buffers_.size());
+    for (size_t i = 0; i < edge_partition_value_firsts.size(); ++i) {
+      edge_partition_value_firsts[i] = get_dataframe_buffer_cbegin(buffers_[i]);
+      edge_partition_edge_counts[i]  = size_dataframe_buffer(buffers_[i]);
+    }
+
+    return detail::edge_property_view_t<edge_type, const_value_iterator>(
+      edge_partition_value_firsts, edge_partition_edge_counts);
+  }
+
+  auto mutable_view()
+  {
+    using value_iterator = decltype(get_dataframe_buffer_begin(buffers_[0]));
+
+    std::vector<value_iterator> edge_partition_value_firsts(buffers_.size());
+    std::vector<edge_type> edge_partition_edge_counts(buffers_.size());
+    for (size_t i = 0; i < edge_partition_value_firsts.size(); ++i) {
+      edge_partition_value_firsts[i] = get_dataframe_buffer_begin(buffers_[i]);
+      edge_partition_edge_counts[i]  = size_dataframe_buffer(buffers_[i]);
+    }
+
+    return detail::edge_property_view_t<edge_type, value_iterator>(edge_partition_value_firsts,
+                                                                   edge_partition_edge_counts);
+  }
+


@seunghwak Not sure if it's a good idea if we merge this two functions into one and use flag to return mutable or immutable view?

auto mutable_or_immutable_view(bool mutable){ using value_iterator = mutable? decltype(get_dataframe_buffer_begin(buffers_[0])):decltype(get_dataframe_buffer_cbegin(buffers_[0])); std::vector<value_iterator> edge_partition_value_firsts(buffers_.size()); std::vector<edge_type> edge_partition_edge_counts(buffers_.size()); for (size_t i = 0; i < edge_partition_value_firsts.size(); ++i) { edge_partition_value_firsts[i] = mutable?get_dataframe_buffer_begin(buffers_[i]): get_dataframe_buffer_cbegin(buffers_[i]); edge_partition_edge_counts[i] = size_dataframe_buffer(buffers_[i]); } return detail::edge_property_view_t<edge_type, value_iterator>( edge_partition_value_firsts, edge_partition_edge_counts); } auto view() const{ return mutable_or_immutable_view(false); } auto mutable_view(){ return mutable_or_immutable_view(true); }

This naming somewhat comes with a sort of RAPIDS convention (or I'd better say cuDF's naming scheme).

https://github.com/rapidsai/cudf/blob/branch-22.10/cpp/include/cudf/column/column_device_view.cuh#L928
and this convention somewhat leaked to cuCollection as well (https://github.com/NVIDIA/cuCollections/blob/dev/include/cuco/static_map.cuh#L772 not surprisingly as cuDF & cuCollection developers have an overlap).

I'm more inclined to stick with this convention unless there is a very strong reason to not follow this.

I was rather thinking if we can have one common function with boolean flag to indicate constant view or mutable view. And depending on the boolean value the common function would return constant or mutable view.

`auto view() const{
return common_function(false);
}

auto mutable_view(){
return common_function(true);
}`

And that common function needs not to be exposed outside. The objective is to get rid of nearly duplicate code.

I guess the code above won't compile unless mutable is a non-type template parameter (so can be evaluated in compile time).

using value_iterator = mutable? decltype(get_dataframe_buffer_begin(buffers_[0])):decltype(get_dataframe_buffer_cbegin(buffers_[0]));

edge_partition_value_firsts[i] = mutable?get_dataframe_buffer_begin(buffers_[i]): get_dataframe_buffer_cbegin(buffers_[i]);

some = A ? B : C; // B and C should have the same type to compile.

Need to use std::conditional_t and if constexpr(mutalbe) but in this case, mutable can't be a run time variable.

If I create

template <bool mutable> auto mutable_or_immutable_view() { ... }

I guess the benefit in the binary size is gone. Not sure avoiding code duplication is worth the added additional complexity.

I didn't think of that. Seems like we better merge as it is.

naimnv · 2022-08-26T13:11:32Z

cpp/include/cugraph/edge_partition_edge_property_device_view.cuh

+namespace detail {
+
+template <typename edge_t, typename ValueIterator>
+class edge_partition_edge_property_device_view_t {


Would edge_property_partition_device_view_t read better?

So, there is edge_partition_endpoint_property_device_view_t as well.

This name implies a device view object for edge properties in an edge partition (and edge_partition_endpoint_property_device_view_t means a device view object for edge endpoint (source/destination) properties in an edge partition).

And there will be vertex_partition_vertex_property_device_view_t in the future as well (to support vertex masking)... These names should be considered in this context.

edge_partition or vertex_partition comes first to emphasize that these work on a vertex partition or an edge partition.

"edge_property_partition" or "edge_endpoint_partition" somewhat sounds like we are partitioning edge properties or edge endpoints instead of edges... Edge partitioning precedes and edge properties or endpoint properties just follow this edge partition.

Based on this assumption, let me know if you have suggestions for better names.

According the what you described, it's probably best to adhere with this naming convention.

ChuckHastings · 2022-08-26T17:36:34Z

@gpucibot merge

initial edge property data structure definitions

0f38540

seunghwak requested a review from a team as a code owner August 9, 2022 17:27

seunghwak self-assigned this Aug 9, 2022

seunghwak added feature request New feature or request 2 - In Progress non-breaking Non-breaking change labels Aug 9, 2022

seunghwak added this to the 22.10 milestone Aug 9, 2022

seunghwak added 3 commits August 9, 2022 10:28

fix copyright year

540a6c2

define create_graph_from_edgelist function API that takes edge IDs in…

278a6d4

… addition

clang-format

6e828f5

seunghwak added 3 - Ready for Review and removed 2 - In Progress labels Aug 9, 2022

seunghwak requested a review from ChuckHastings August 9, 2022 20:00

seunghwak changed the title ~~[WIP] create a grpah with additional edge properties~~ [API] create a grpah with additional edge properties Aug 9, 2022

seunghwak added 4 commits August 11, 2022 12:10

Merge branch 'branch-22.10' of github.com:rapidsai/cugraph into fea_e…

869ac96

…dge_property

move edge_property.hpp

fe30d99

move file

e4dab0b

fix compile errors

4c21ab2

seunghwak added 2 commits August 18, 2022 13:36

update create_graph_from_egdelist to take edge ID & type pairs

d793c94

Merge branch 'branch-22.10' of github.com:rapidsai/cugraph into fea_e…

c3f3ced

…dge_property

seunghwak requested review from naimnv and jnke2016 August 23, 2022 15:54

ChuckHastings approved these changes Aug 23, 2022

View reviewed changes

ChuckHastings changed the title ~~[API] create a grpah with additional edge properties~~ [API] create a graph with additional edge properties Aug 23, 2022

Merge branch 'branch-22.10' of github.com:rapidsai/cugraph into fea_e…

06c9502

…dge_property

naimnv approved these changes Aug 26, 2022

View reviewed changes

rapids-bot bot merged commit 1f83c6b into rapidsai:branch-22.10 Aug 26, 2022

seunghwak deleted the fea_edge_property branch October 20, 2022 18:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API] create a graph with additional edge properties #2521

[API] create a graph with additional edge properties #2521

seunghwak commented Aug 9, 2022 •

edited

Loading

seunghwak commented Aug 16, 2022

codecov-commenter commented Aug 16, 2022 •

edited

Loading

naimnv left a comment

naimnv Aug 26, 2022 •

edited

Loading

seunghwak Aug 26, 2022

naimnv Aug 26, 2022 •

edited

Loading

seunghwak Aug 26, 2022

naimnv Aug 26, 2022

naimnv Aug 26, 2022

seunghwak Aug 26, 2022

seunghwak Aug 26, 2022 •

edited

Loading

naimnv Aug 26, 2022 •

edited

Loading

ChuckHastings commented Aug 26, 2022

[API] create a graph with additional edge properties #2521

[API] create a graph with additional edge properties #2521

Conversation

seunghwak commented Aug 9, 2022 • edited Loading

seunghwak commented Aug 16, 2022

codecov-commenter commented Aug 16, 2022 • edited Loading

Codecov Report

naimnv left a comment

Choose a reason for hiding this comment

naimnv Aug 26, 2022 • edited Loading

Choose a reason for hiding this comment

seunghwak Aug 26, 2022

Choose a reason for hiding this comment

naimnv Aug 26, 2022 • edited Loading

Choose a reason for hiding this comment

seunghwak Aug 26, 2022

Choose a reason for hiding this comment

naimnv Aug 26, 2022

Choose a reason for hiding this comment

naimnv Aug 26, 2022

Choose a reason for hiding this comment

seunghwak Aug 26, 2022

Choose a reason for hiding this comment

seunghwak Aug 26, 2022 • edited Loading

Choose a reason for hiding this comment

naimnv Aug 26, 2022 • edited Loading

Choose a reason for hiding this comment

ChuckHastings commented Aug 26, 2022

seunghwak commented Aug 9, 2022 •

edited

Loading

codecov-commenter commented Aug 16, 2022 •

edited

Loading

naimnv Aug 26, 2022 •

edited

Loading

naimnv Aug 26, 2022 •

edited

Loading

seunghwak Aug 26, 2022 •

edited

Loading

naimnv Aug 26, 2022 •

edited

Loading