Repository navigation

#

multimodal-interactions

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

OpenEdge ABL
828
2 年前

Multimodal Sarcasm Detection Dataset

OpenEdge ABL
340
8 个月前

Context-Dependent Sentiment Analysis in User-Generated Videos

Python
124
2 年前

Mobile application for exploring fitness data using both speech and touch interaction.

TypeScript
77
2 年前

Multimodal sentiment analysis using hierarchical fusion with context modeling

Python
44
2 年前

Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances (ACL 2024)

Python
25
4 个月前

Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations

Jupyter Notebook
14
3 年前

Using voice and pen to draw diagrams quickly with automatically suggested icons and texts by AI in talking.

JavaScript
8
7 年前

A multimodal face liveness detection module that can be used in the context of face anti-spoofing

Python
6
7 个月前

Multimodal AI Assistant with Google Gemini-1.5-pro, gTTS, PIL, and SpeechRecognition Technologies!

Python
4
9 个月前

Technical Draft: A platform to augment web applications with multimodal interactions

TeX
3
10 年前

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

Jupyter Notebook
3
2 年前

Project for Multimodal Interaction course (A.Y. 2019/2020), GesturePad

Python
2
5 年前

Control of some Spotify's functionalities by voice

C#
0
7 年前

Challenge of gesture recognition for the course : "Multimodal Processing Recognition and Interaction" of the HES-SO university (Switzerland)

MATLAB
0
7 年前

Developed a multimodal interactive quiz app allowing users to select answers via hand gestures. Created a user-friendly UI/UX in Figma and built the front end with React Native, using MongoDB for data management. Implemented a backend with Express and Node.js, and trained CNN models in Python for gesture recognition, enhancing user engagement.

JavaScript
0
2 年前

Control of some Spotify's functionalities with gestures

C#
0
7 年前