Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add address string cleaning functionality #46

Merged
merged 14 commits into from
Feb 3, 2024
Merged

Conversation

michplunkett
Copy link
Owner

@michplunkett michplunkett commented Feb 3, 2024

Describe your changes

Added an address_correction function that clears up a lot of ambiguities for address searching. I still haven't been able to get the cross street problem fixed, but I'm gonna save that for a later time.

Checklist before requesting a review

  • The code runs successfully.
(ucpd-incident-scraper-py3.11) michaelp@MacBook-Air-18 ucpd-incident-scraper % make correct_location                                                                        
python -m incident_scraper correct-location
[nltk_data] Downloading package wordnet to
[nltk_data]     /Users/michaelp/nltk_data...
[nltk_data]   Package wordnet is already up-to-date!
API queries_quota: 60
1425 E. Midway Pl. (Public Way) changed to 1425 E. Midway Plaisance (Public Way)
1130 E. Midway Pl. (Park District Skating Rink) changed to 1130 E. Midway Plaisance (Park District Skating Rink)
819 E. Midway Pl. (Public Way) changed to 819 E. Midway Plaisance (Public Way)
...
Area of Midway Pl. and S. Ellis Ave. changed to Area of Midway Plaisance and S. Ellis Ave.
1111 E. Midway Pl. (Winter Garden) changed to 1111 E. Midway Plaisance (Winter Garden)
1005 E. Midway Pl. (Park Dist. Property) changed to 1005 E. Midway Plaisance (Park Dist. Property)
1130 E. Midway Pl. (Skating Rink) changed to 1130 E. Midway Plaisance (Skating Rink)
78 of 16819 had their address updated.
78 addresses were updated.
Waiting up to 5 seconds.
Sent all pending logs.

@michplunkett michplunkett self-assigned this Feb 3, 2024
@michplunkett michplunkett changed the title Update address cleaning Add address string cleaning functionality Feb 3, 2024
@michplunkett michplunkett merged commit 1cfb264 into main Feb 3, 2024
2 checks passed
@michplunkett michplunkett deleted the testing_geocoders branch February 3, 2024 20:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant