Comparing multiple approaches for analysing bigger than memory data in one computer.
Here I used the monthly 2021, 2022 and 2023 "High Volume For-Hire Vehicle Trip Records" files from the New York Taxi and Limousine Commission data (16GB).
You can execute the run.R file to generate the plot with the results of the benchmark. Here my results: