Video Summarization with ResNeXt and LSTM

Ever wished you could get a quick highlight reel of your favorite TV show or an important lecture? Look no further! This project is an implementation of a video summarization model that harnesses the power of ResNeXt for feature extraction and an LSTM network for sequence modeling. Our model is designed to predict the importance scores of video frames, which can then be used to generate a concise and engaging summary of any video. This project was created in Spring '22 for ACM AI at UC San Diego.

Overview

In this repository, you'll find the code for training and evaluating our video summarization model. By leveraging a pretrained ResNeXt network to extract features from video frames and an LSTM network to predict the importance scores of these frames, our model captures the essence of the video content. The importance scores are then used to select the most crucial frames, creating a summary that's both informative and entertaining.

Model Architecture

Encoder

Our encoder is the heavyweight of feature extraction, using a pretrained ResNeXt model. This powerful network dives into each video frame and extracts a feature vector, essentially capturing the frame's most significant details and nuances.

RNN Model

The RNN model, powered by an LSTM network, is the brain behind predicting the importance scores of the frames based on the features extracted by ResNeXt. Think of it as a meticulous editor who knows exactly which scenes make the cut. The network consists of several LSTM layers followed by fully connected layers, ensuring it captures the sequential nature of the video.

Training

Training our model is like a rigorous workout session. The training loop iterates through the dataset, passing batches of frames through the encoder and RNN model. It computes the loss and gradients to update the model parameters, ensuring our model gets smarter with each iteration.

Evaluation

Evaluating the model is akin to a performance review. The evaluation loop iterates through the validation dataset, computing the loss and predictions to assess how well our model is doing. It helps us fine-tune the model, making sure it picks out the best highlights every time.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
the-stellar-summarizer.ipynb		the-stellar-summarizer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Summarization with ResNeXt and LSTM

Overview

Model Architecture

Encoder

RNN Model

Training

Evaluation

About

Releases

Packages

Languages

Manav1712/video_summarization

Folders and files

Latest commit

History

Repository files navigation

Video Summarization with ResNeXt and LSTM

Overview

Model Architecture

Encoder

RNN Model

Training

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages