Timeout while waiting for prepare_data to finish #19266
Labels
data handling
Generic data-related topic
docs
Documentation related
feature
Is an improvement or enhancement
📚 Documentation
I initialize two processes. The first process is the one that creates the data inside prepare_data.
The second process waits for the first process.
The problem is that when I generate a large dataset it takes more than 1800 seconds, which is the default DDP timeout.
So my script kept freezing.
I think that the timeout should be emphasized in the documentation when one uses prepare_data in the lightning data module.
Currently, it is mentioned that prepare_data is called only by one process, but it is not mentioned that the other processes are waiting for the first process to finish and that there is a timeout for this. Also, it is not mentioned when the other processes start counting the time. Is it at the beginning of prepare_data or at the end? It seems to be at the beginning, which is what caused my script to timeout.
The timeout error is also not very informative. It just says that it timed out. Given that it is not described how exactly prepare_data works it is difficult to interpret the error.
I guess that the developer assumed that prepare_data won't take that long, but when one deals with very large data, almost certainly it will need more than 1800 seconds.
cc @Borda @justusschock @awaelchli
The text was updated successfully, but these errors were encountered: