[Bug] Fix bug that prevented custom tables/grids with column referral #439

maxschulz-COL · 2024-04-26T08:08:15Z

Description

Closes #435 and https://github.com/McK-Internal/vizro-internal/issues/747

We now send the full DF during the build phase (instead of an empty pd.DataFrame). This could have performance consequences for very large DFs, but on the grand scheme of things I think it probably the better solution.

And alternative would be to send an empty DF with the columns present, but that may lead to other bugs down the road.

Screenshot

Notice

I acknowledge and agree that, by checking this box and clicking "Submit Pull Request":
- I submit this contribution under the Apache 2.0 license and represent that I am entitled to do so on behalf of myself, my employer, or relevant third parties, as applicable.
- I certify that (a) this contribution is my original creation and / or (b) to the extent it is not my original creation, I am authorized to submit this contribution on behalf of the original creator(s) or their licensees.
- I certify that the use of this contribution as authorized by the Apache 2.0 license does not violate the intellectual property rights of anyone else.
- I have not referenced individuals, products or companies in any commits, directly or indirectly.
- I have not added data or restricted code in any commits, directly or indirectly.

for more information, see https://pre-commit.ci

huong-li-nguyen · 2024-04-26T08:29:49Z

Thanks for fixing so quickly! 🚀 What "potential bugs down the road" do you imagine? I actually like the alternative solution approach - it's also the one we've implemented on VizX actually. I think it did not lead to any major bugs (as far as I remember), and the performance issue was more severe.

Do we know how other tools deal with this? e.g. do they load in the data in batches? or do they always load in the entire dataset?

maxschulz-COL · 2024-04-26T08:55:06Z

Thanks for fixing so quickly! 🚀 What "potential bugs down the road" do you imagine? I actually like the alternative solution approach - it's also the one we've implemented on VizX actually. I think it did not lead to any major bugs (as far as I remember), and the performance issue was more severe.

So of the top of my head I could imagine that someone not only refers to a column, but also a specific row. This then would fail, as there are no rows present.

This seems to be a case of a wider range of things where people refer or rely on the size of the data, or generally anything regarding the original DF they provide.

One could argue that it is bad design to write a custom grid/table like that, but the confusion of the user showed that people simply do not expect the data to be different at any point from what they originally provide

Do we know how other tools deal with this? e.g. do they load in the data in batches? or do they always load in the entire dataset?

So the AgGrid for example has the opportunity to enable batch loading in infinite scroll, but I think that defeats a little bit the point. In principle we are not converned with loading the entire data, that is what we do anyway once the page loads with all filters/parameters etc. It is simply this initial building that immediately gets overwritten that we want to avoid because it is "redundant".

Ideally we would want to just create an entirely different loading component (like for graph), but it has been shown (see comment above the build lines in both table and grid) that having different settings for the objects causes problems (the case we observed was the pagination setting). Remember the long discussion we had where you said, just write "Do not change this line" :). So this is related to that.

huong-li-nguyen · 2024-04-26T10:30:05Z

Thanks for clarifying!

vizro-core/src/vizro/models/_components/ag_grid.py

antonymilne · 2024-04-29T11:28:05Z

I would like to review this so please don't merge yet 🙂

vizro-core/src/vizro/models/_components/table.py

antonymilne

tl;dr: thanks @petar-qb and @maxschulz-COL for your great work on this. Hopefully longer term we will somehow have an entirely better system here but for now this looks good.

I prefer the solution @petar-qb suggested in https://github.com/mckinsey/vizro/pull/439/files#r1580918187 so would be great if you could try it out. But the current solution is fine too.

I am not hugely concerned right now about the performance hit this incurs, but I am not keen on it either. In general I don't much like the "double loading" we currently have where we do everything once with an empty dataframe (or the real thing, like now) and then immediately override it with the real data. This PR unfortunately, but necessarily, adds another layer of unsatisfactoriness to that scheme 😬

That isn't meant as a criticism of this PR or the existing system, since I know the current on-page-load system has its merits and lots of good reasoning behind it. I tried to come up with some improvements before and couldn't. I just hope that as part of the actions v2 work we can somehow improve this scheme though 🤞

This reverts commit 4c39246.

vizro-core/src/vizro/models/_components/ag_grid.py

…ug/missing_DF_435

for more information, see https://pre-commit.ci

This reverts commit 300ef3f.

Adressing bug and adding tests

3d2403b

maxschulz-COL self-assigned this Apr 26, 2024

maxschulz-COL added Status: Ready for Review ☑️ Bug Report 🐛 Issue contains a bug report labels Apr 26, 2024

maxschulz-COL and others added 4 commits April 26, 2024 10:17

Fix remaining unit tests

747837a

Linting

425c773

CHangelog

e8c4ace

Merge branch 'main' into bug/missing_DF_435

794eed8

maxschulz-COL marked this pull request as ready for review April 26, 2024 08:21

maxschulz-COL requested review from Joseph-Perkins, antonymilne, huong-li-nguyen and Anna-Xiong as code owners April 26, 2024 08:21

[pre-commit.ci] auto fixes from pre-commit.com hooks

a18d2cc

for more information, see https://pre-commit.ci

petar-qb self-requested a review April 26, 2024 08:31

huong-li-nguyen approved these changes Apr 26, 2024

View reviewed changes

petar-qb reviewed Apr 26, 2024

View reviewed changes

vizro-core/src/vizro/models/_components/ag_grid.py Show resolved Hide resolved

maxschulz-COL mentioned this pull request Apr 30, 2024

[Bug] Fix reloading issues of the AG Grid #446

Merged

1 task

antonymilne reviewed Apr 30, 2024

View reviewed changes

vizro-core/src/vizro/models/_components/table.py Show resolved Hide resolved

antonymilne approved these changes Apr 30, 2024

View reviewed changes

maxschulz-COL and others added 5 commits May 2, 2024 08:40

Merge branch 'main' into bug/missing_DF_435

df41457

Different example

a3bf131

TBR Removing DF breaks filter persistence on page change

4c39246

Revert "TBR Removing DF breaks filter persistence on page change"

7d4e6c9

This reverts commit 4c39246.

Final comments and linting

a35f6c7

maxschulz-COL requested a review from antonymilne May 2, 2024 07:40

antonymilne approved these changes May 2, 2024

View reviewed changes

vizro-core/src/vizro/models/_components/ag_grid.py Show resolved Hide resolved

antonymilne mentioned this pull request May 2, 2024

[Tidy] Remove component to data mapping from data manager #451

Merged

1 task

Merge branch 'main' into bug/missing_DF_435

2143987

maxschulz-COL enabled auto-merge (squash) May 7, 2024 06:52

maxschulz-COL and others added 10 commits May 7, 2024 11:24

Merge branch 'main' into bug/missing_DF_435

b5fadca

Merge main

be60463

Merge branch 'bug/missing_DF_435' of github.com:mckinsey/vizro into b…

30c75dd

…ug/missing_DF_435

Fix tests after changes in main

1bdd2c1

Change back dev example

300ef3f

[pre-commit.ci] auto fixes from pre-commit.com hooks

5a531cb

for more information, see https://pre-commit.ci

Revert "Change back dev example"

2f1a92b

This reverts commit 300ef3f.

Merge origin

b30dd5a

Fix non running integration tests by adding if clause

bccea7e

Linting

542fe78

maxschulz-COL merged commit 296a99e into main May 12, 2024
34 checks passed

maxschulz-COL deleted the bug/missing_DF_435 branch May 12, 2024 07:48

antonymilne mentioned this pull request May 21, 2024

[Feat] Add new dcc.Loading features in dash 2.17 #487

Merged

1 task

petar-qb mentioned this pull request Aug 21, 2024

[Tidy] Return empty data frame from AgGrid, Table and Figure build methods #644

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Fix bug that prevented custom tables/grids with column referral #439

[Bug] Fix bug that prevented custom tables/grids with column referral #439

maxschulz-COL commented Apr 26, 2024 •

edited

Loading

huong-li-nguyen commented Apr 26, 2024 •

edited

Loading

maxschulz-COL commented Apr 26, 2024 •

edited

Loading

huong-li-nguyen commented Apr 26, 2024

antonymilne commented Apr 29, 2024

antonymilne left a comment

[Bug] Fix bug that prevented custom tables/grids with column referral #439

[Bug] Fix bug that prevented custom tables/grids with column referral #439

Conversation

maxschulz-COL commented Apr 26, 2024 • edited Loading

Description

Screenshot

Notice

huong-li-nguyen commented Apr 26, 2024 • edited Loading

maxschulz-COL commented Apr 26, 2024 • edited Loading

huong-li-nguyen commented Apr 26, 2024

antonymilne commented Apr 29, 2024

antonymilne left a comment

Choose a reason for hiding this comment

maxschulz-COL commented Apr 26, 2024 •

edited

Loading

huong-li-nguyen commented Apr 26, 2024 •

edited

Loading

maxschulz-COL commented Apr 26, 2024 •

edited

Loading