Image Processing and Computer Vision Assignments

Assignment Module 1: Product Recognition of Food Products

Goal: Develop a computer vision system that, given a reference image for each product, is able to identify such product from one picture of a store shelf.

Implementation Details: We tackle this assignment by implementing from scratch a Generalized Hough Transform (GHT) with Local Invariant Features. To improve accuracy, we integrated a color consistency check on the detected bounding boxes to filter out incorrect matches. This improvement is crucial because the GHT processes only greyscale images and cannot differentiate between templates that are visually identical in shape but differ in color.

The provided scene images were significantly ruined by salt noise. After conducting a noise analysis, we identified that applying a combination of Median Filtering, BM3D, Non-Local Means Filtering, and sharpening significantly increased the number of keypoints detected in the scene images. This improvement greatly enhanced the performance of our algorithm.

Example Images:

Figure 1: Reference product templates used for recognition.

Figure 2: Recognition results on a store shelf image.

Figure 3: Visualization of noise reduction.

Assignment Module 2: Product Classification

Goal: Implement a neural network that classifies smartphone pictures of products found in grocery stores.

Implementation Details: To facilitate neural network training on our laptops, we explored efficient architectures, focusing particularly on ShuffleNet. This architecture builds upon the depthwise separable convolution introduced in Xception by incorporating grouped pointwise convolutions and a novel shuffle layer. Leveraging these shuffle units, we implemented a compact neural network that achieved 72% accuracy on the validation set without any prior knowledge or pretraining. After we use a pretrained net (Resnet-18) achieving 91% accuracy also on validation set, showing the impact of transfer learning and prior knowledge in improving model performance.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
dataset		dataset
media		media
.gitignore		.gitignore
README.md		README.md
assignment1.ipynb		assignment1.ipynb
assignment2.ipynb		assignment2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Processing and Computer Vision Assignments

Assignment Module 1: Product Recognition of Food Products

Example Images:

Assignment Module 2: Product Classification

Example Images:

About

Releases

Packages

Contributors 4

Languages

alessioarcara/ipcv_assignments

Folders and files

Latest commit

History

Repository files navigation

Image Processing and Computer Vision Assignments

Assignment Module 1: Product Recognition of Food Products

Example Images:

Assignment Module 2: Product Classification

Example Images:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages