-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: fix logic to check for nested GTFS files in ZIP #1972
fix: fix logic to check for nested GTFS files in ZIP #1972
Conversation
Hey @skalexch could you take a look at the 14 datasets that contains new errors? (You can see all of them by clicking on the arrow) New Errors (14 out of 1808 datasets, ~1%) ✅Details of new errors due to code change, which is less than the provided threshold of 1%.
|
@qcdyx the screenshot below shows the affected datasets and above them the folders that I extracted from them. I also included mdb-2854 as control. It does seem like for all of the concerned datasets, the GTFS files exist within a subfolder. For the control dataset, the extracted folder has the same name as the zipfile, which means that the files reside in the root directory. Please note that I could not download mdb-612 and mdb-1324 |
c1342e4
to
13bf3f5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks for your contribution!
Summary:
This PR fixes a bug with our logic to check whether a ZIP file we're loading has GTFS files in a subfolder. It looks like
ZipInputStream.getNextEntry
doesn't always return subfolders, depending on how the ZIP file was created. The subfolder and ZIP file having the same name in #1912 was a red herring.Closes #1912
Expected behavior:
We get a invalid_input_files_in_subfolder notice even if the subfolder is not treated as a standalone entry.
Testing:
Before:
data:image/s3,"s3://crabby-images/6d3f8/6d3f8618185c24c9ff3b5a0375f2a52797e735a7" alt="image"
After:
data:image/s3,"s3://crabby-images/e0616/e0616e87ce425663c9cf1dac6d2eab1584530a0e" alt="image"
Please make sure these boxes are checked before submitting your pull request - thanks!
gradle test
to make sure you didn't break anything