IO::FileDescriptor & Socket finalizers do far too much #14807

ysbaddaden · 2024-07-12T21:12:55Z

The initial idea for the finalizer was to avoid leaking file descriptors when you'd call File.new instead of File.open(&) and then forgot to close it. See #780 and 3bfd62a.

The actual implementation called the #close method, and it hasn't changed ever since, and... it's actually doing a lot more things than the original idea meant to:

it flushes any buffered data;
it re-enqueues pending readers & writers;
it finally closes the file descriptor.

When called from regular code, this is the expected behavior, but let's contemplate that it may happen in a GC finalizer, that is be called during a GC collection, while the world is stopped and we want/need to resume it ASAP.

Note: while I think the unflushed buffer can happen in practice (we can write to a socket, forget to flush, then lose the reference) I don't think the pending reader/write can happen: it would mean a fiber got suspended trying to read or write, and they must have the file/socket reference in the function call somewhere (i.e. at least one pointer on the fiber stack) so the GC won't collect it.

Trying to flush means that it can try to write, hence call into the event loop, that may have to wait for the fd to be writable, which means calling into epoll_wait 😱 (while the world is stopped). The event loop implementation may need to allocate, or we get an error and try to raise an exception that will also try to allocate memory... during a GC collection 😱

Proposal: I'd merely call #file_descriptor_close in the finalizer and either always print a warning to STDERR when we do, or maybe only when there was pending buffered data (that won't be sent).

The text was updated successfully, but these errors were encountered:

straight-shoota · 2024-08-08T11:48:07Z

We need non-raising variants of file_descriptor_close. The default implementations raise if the system close errors. In a finalizer we shouldn't care about this error though (and it would involve an allocation for the exception).

I don't think printing a warning is a good idea. Closing in the finalizer doesn't really contribute to whether the buffer is flushed. If there was no finalizer, the file descriptor would never be closed (and no data flushed).
Maybe warnings could be useful as a tool while transitioning away from the current semantics with flush. But it shouldn't be an eternal thing. And then I don't see a good mechanism to decide when to enable it and when not. So I'd rather leave it away. Maybe we can provide an opt-in for such warnings.

ysbaddaden added kind:question topic:stdlib:system labels Jul 12, 2024

Blacksmoke16 added status:discussion kind:refactor and removed kind:question kind:refactor labels Jul 12, 2024

ysbaddaden mentioned this issue Jul 16, 2024

Epoll event loop (linux) #14814

Closed

20 tasks

straight-shoota mentioned this issue Aug 8, 2024

Avoid flush in finalizers for Socket and IO::FileDescriptor #14882

Merged

straight-shoota closed this as completed in #14882 Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IO::FileDescriptor & Socket finalizers do far too much #14807

IO::FileDescriptor & Socket finalizers do far too much #14807

ysbaddaden commented Jul 12, 2024 •

edited

Loading

straight-shoota commented Aug 8, 2024

IO::FileDescriptor & Socket finalizers do far too much #14807

IO::FileDescriptor & Socket finalizers do far too much #14807

Comments

ysbaddaden commented Jul 12, 2024 • edited Loading

straight-shoota commented Aug 8, 2024

ysbaddaden commented Jul 12, 2024 •

edited

Loading