Can Marimo use unique default names for cells to serialize to a nicer Python file? #1954

danielhfrank · 2024-08-05T22:03:02Z

Description

Right now, when I work on a Marimo notebook, I'm delighted that the serialized format is a plain python file. However, by default, each cell is serialized as a function with name "__". This makes for rather funny-looking python files, and if you have any sort of linter in your codebase, it will certainly not be happy.

I can see that if I create a new name for my cell, then it will be used in the serialized python file. I think it would be great if each cell had a unique name, so that the rendered python file inherits those names for the autogenerated functions.

Alternative

No response

Additional context

I would be happy to contribute this change myself! I could use some pointers on whether the functionality I'm talking about is coming from where I linked above. If so, I can read up on contributing guidelines, and try to put it together!

The text was updated successfully, but these errors were encountered:

akshayka · 2024-08-05T22:54:45Z

The issue is that we want to introduce as few names into the file as possible. For example, in the future we might make it so that functions defined in a notebook that don't depend on any other variables are serialized directly into the notebook, instead of being wrapped in an app.cell decorator. If cells had unique names, especially human-readable ones, there would be a chance that these names would conflict with user-defined function names.

I understand the concern about linting, though. I am open to considering making the cell names unique to appease linters, though the names likely won't be very readable.

dmadisetti · 2024-08-06T00:49:53Z

I think there was the suggestion on naming based on execution graph position.

But to appease the linters, users can manually give names- which is much more readable than than anything that could be automated. Related, this code cleanliness check could be incorporated in a lint command: #1543

akshayka · 2024-08-06T00:59:28Z

I think there was the suggestion on naming based on execution graph position.

Right. But this will add additional constraints to function names, thinking ahead to when we add top-level functions (@app.function). Perhaps that's okay, because most users likely wouldn't create functions named _cell_0, _cell_1, ...?

But to appease the linters, users can manually give names- which is much more readable than than anything that could be automated.

Yes definitely. I suppose @danielhfrank's concern is that this is too much work?

dmadisetti · 2024-08-06T14:53:32Z

marimo lint --suggest-names could pass cells into a GPT and run a prompt to rename cells?

I think renaming to _cell_x gives the false security that default cell can be reliably utilized in an import. Or maybe prefix _ are hidden in the app API?

danielhfrank · 2024-08-06T17:22:42Z

Glad to see that this has sparked some conversation, I appreciate the attention!

But to appease the linters, users can manually give names- which is much more readable than than anything that could be automated.

Yes definitely. I suppose @danielhfrank's concern is that this is too much work?

Yes, indeed, I see that I can manually rename cells, but it's a lot of work - I think there could be a much smoother user experience if this were done automatically.

Having thought about it a bit, from my own perspective, I wouldn't really care if the function names were very human readable, or even if they reflected any sort of ordering. I think that a name like _cell_33c443 (some random hex identifier) would work well for my and others' use cases, and I expect would not interfere with any user code.

marimo lint --suggest-names could pass cells into a GPT and run a prompt to rename cells?

I think that a marimo lint command could help here if this didn't sound appealing as default behavior. Of course, my personal preference would be to change the defaults to unique names.

Thanks again for the consideration, and again, I would be happy to help contribute if you all think this is a constructive change!

ggggggggg · 2024-08-07T19:08:58Z

Right. But this will add additional constraints to function names, thinking ahead to when we add top-level functions (@app.function). Perhaps that's okay, because most users likely wouldn't create functions named _cell_0, _cell_1, ...?

I think it's pretty reasonable to define some cell naming scheme and enforce that users are not allowed to name cells that conflict with that scheme. As long as it has a clear error message when someone does conflict with a clear action item to fix it, I don't think you'll get any pushback. Think of it like a keyword in a language, you just can't use them as variables, end of story.

dmadisetti · 2024-09-18T15:10:42Z

A thought on this- the new cell hashing could be leveraged to come up with names that are unique to the cell and it's position in the notebook's graph. This would also catch dupes, since identical cells would hash to the same cell name.

I think we could truncate the hash to be more friendly. Is a _ prefix seem like a reasonable cell name constraint?

This would be nice in a source control aspect too, because you could see the downrange effects that a single change might have

dmadisetti mentioned this issue Sep 14, 2024

app.function: top-level functions in notebook files #2293

Open

This was referenced Dec 11, 2024

improvement: generate unique function names to work with linting #3129

Closed

improvement: single '_' function names to work with linting #3143

Merged

mscolnick closed this as completed in #3143 Dec 12, 2024

mscolnick closed this as completed in e5af6a0 Dec 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can Marimo use unique default names for cells to serialize to a nicer Python file? #1954

Can Marimo use unique default names for cells to serialize to a nicer Python file? #1954

danielhfrank commented Aug 5, 2024

akshayka commented Aug 5, 2024

dmadisetti commented Aug 6, 2024

akshayka commented Aug 6, 2024

dmadisetti commented Aug 6, 2024

danielhfrank commented Aug 6, 2024

ggggggggg commented Aug 7, 2024 •

edited

Loading

dmadisetti commented Sep 18, 2024

Can Marimo use unique default names for cells to serialize to a nicer Python file? #1954

Can Marimo use unique default names for cells to serialize to a nicer Python file? #1954

Comments

danielhfrank commented Aug 5, 2024

Description

Suggested solution

Alternative

Additional context

akshayka commented Aug 5, 2024

dmadisetti commented Aug 6, 2024

akshayka commented Aug 6, 2024

dmadisetti commented Aug 6, 2024

danielhfrank commented Aug 6, 2024

ggggggggg commented Aug 7, 2024 • edited Loading

dmadisetti commented Sep 18, 2024

ggggggggg commented Aug 7, 2024 •

edited

Loading