Crawl the flickr picture archive to get a large amount of pictures with GPS annotations. Now try to train a deep learning model to learn the GPS location of a picture. It could be a good idea to remove persons from the pictures automatically, but you must figure this out for yourselves. Create a visualization that shows the accuracy of your approach and/or allows to upload a picture and predicts its GPS location.
4571314348 47065077@N00 savagecat 2010-04-16 11:00:49.0 1272811190 Canon+EOS+400D+DIGITAL On+the+Wall b%26w,black+%26+white,choir+tour,hadrian%27s+wall,noir+et+blanc,portrait,sewingshields,st+george%27s,wall -2.335453 55.010991 12 http://www.flickr.com/photos/47065077@N00/4571314348/ http://farm5.staticflickr.com/4036/4571314348_03a71b7bbb.jpg Attribution License http://creativecommons.org/licenses/by/2.0/ 4036 5 03a71b7bbb 62ffbfb29f jpg 0
Photo/video identifierUser NSIDUser nickname- Date taken
Date uploadedCapture device- Title
- Description
- User tags (comma-separated)
Machine tags (comma-separated)- Longitude
- Latitude
- Accuracy
Photo/video page URL- Photo/video download URL
License nameLicense URLPhoto/video server identifierPhoto/video farm identifierPhoto/video secretPhoto/video secret originalPhoto/video extension originalPhotos/video marker (0 = photo, 1 = video)
It would be best to start with a subset of the data/images and expand as progress is made.
- Transform data
- Add column headers
- Clean data
- Drop unwanted columns
- Only keep instances with GPS coordinates
- Download images
- Implement face recognition
- Identify all images with faces in them
- Drop all such instances
- Develop a deep learning model
- Train the model based on tags linked to the GPS coordinates
- Note accuracy and other metrics
- Improvise wherever possible
- Test the model
- With a different subset of images
- Note accuracy and other metrics
- Improvise wherever possible
- Create a fancy visualization
- Make it to the hall of fame (result showcase)