multimodal-machine-learning

This repository contains the code, dataset, and model outputs for the ICMI 2024 paper Multimodal User Enjoyment Detection in Human-Robot Conversation: The Power of Large Language Models. It includes scripts for prompting LLMs, training supervised models, and evaluating multimodal enjoyment detection.

human-robot-interaction affect-recognition multimodal-machine-learning multimodal-large-language-models enjoyment-detection

Updated Feb 6, 2025
Python

Rajeevveera24 / LatentAlignmentProcedural

Star

This repository is cloned from https://github.com/HLR/LatentAlignmentProcedural. This is a potential baseline explored for the textual_cloze task on the RecipeQA Dataset - https://hucvl.github.io/recipeqa/

multimodal-deep-learning multimodal-machine-learning recipeqa multimodal-prodedural-comprehension cmu-11777 cmu-11777-s24 cmu-11777-s24-mars

Updated Feb 11, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the multimodal-machine-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-machine-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-machine-learning

Here are 6 public repositories matching this topic...

dqqcasia / awesome-speech-translation

vincentlux / Awesome-Multimodal-LLM

thuiar / MIntRec2.0

aclai-lab / MultiData.jl

andre-pereira / ICMI2024LLMsEnjoymentDetection

Rajeevveera24 / LatentAlignmentProcedural

Improve this page

Add this topic to your repo