Giter VIP home page Giter VIP logo

xiaoheng-zhang99's Projects

lavis icon lavis

LAVIS - A One-stop Library for Language-Vision Intelligence

low-rank-multimodal-fusion icon low-rank-multimodal-fusion

This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018

mcse icon mcse

NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings

medclip icon medclip

Medical image captioning using OpenAI's CLIP

medclip-1 icon medclip-1

A multi-modal CLIP model trained on the medical dataset ROCO

medclip-2 icon medclip-2

EMNLP'22 | MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts

meld icon meld

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

meld-sentiment-speech-emotion-recognition icon meld-sentiment-speech-emotion-recognition

Multimodal EmotionLines Dataset (MELD) has been created by enhancing and extending EmotionLines dataset. MELD contains the same dialogue instances available in EmotionLines, but it also encompasses audio and visual modality along with text. MELD has more than 1400 dialogues and 13000 utterances from Friends TV series.

mental-health icon mental-health

Repository containing code for my internship at Valencia Polytechnic University (summer 2021)

mhyeeg icon mhyeeg

Official PyTorch repository for Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals, ICASSPW 2023.

mimic-cxr icon mimic-cxr

Code, documentation, and discussion around the MIMIC-CXR database

mkgformer icon mkgformer

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

mm-dfn icon mm-dfn

Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations"

mmemotionrecognition icon mmemotionrecognition

Repository with the code of the paper: A proposal for Multimodal Emotion Recognition using auraltransformers and Action Units on RAVDESS dataset

mmtransformer icon mmtransformer

[CVPR 2021] "Multimodal Motion Prediction with Stacked Transformers": official code implementation and project page.

moco-cxr icon moco-cxr

MoCo-based unsupervised training for Chest X-Ray Interpretation

moel icon moel

MoEL: Mixture of Empathetic Listeners

mosei_umons icon mosei_umons

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

msvd-tl icon msvd-tl

Multi-Speaker Video Dialog with Frame-Level Temporal Localization

mtl4depr icon mtl4depr

Source code for paper Multi-Task Learning for Depression Detection in Dialogs (SIGDial 2022)

multimodal-deep-learning icon multimodal-deep-learning

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.