Giter VIP home page Giter VIP logo

mgr's Introduction

MGR

Multi-Granularity Reasoning for Social Relation Recognition from Images

ABSTRACT

Discovering social relations in images can make machines better interpret the behavior of human beings. However, automatically recognizing social relations in images is a challenging task due to the significant gap between the domains of visual content and social relation.

Existing studies separately process various features such as faces expressions, body appearance, and contextual objects, thus they cannot comprehensively capture the multi-granularity semantics, such as scenes, regional cues of persons, and interactions among persons and objects.

To bridge the domain gap, we propose a Multi-Granularity Reasoning framework for social relation recognition from images.

The global knowledge and mid-level details are learned from the whole scene and the regions of persons and objects, respectively.

Most importantly, we explore the fine-granularity pose keypoints of persons to discover the interactions among persons and objects.

Specifically, the pose-guided Person-Object Graph and Person-Pose Graph are proposed to model the actions from persons to object and the interactions between paired persons, respectively. Based on the graphs, social relation reasoning is performed by graph convolutional networks.

Finally, the global features and reasoned knowledge are integrated as a comprehensive representation for social relation recognition.

Extensive experiments on two public datasets show the effectiveness of the proposed framework.F

Framework

1585401791952

Experiments

1585401863792

1585401892052

Requirements

Python 3.6

PyTorch 0.3

Training

Feature Extraction

  1. Train PersonPair_Union model[6] to extract person node feature.

โ€‹ The code references git https://github.com/wzhouxiff/SR.git

  1. Extract object node feature though Mask RCNN

โ€‹ The code references git https://github.com/open-mmlab/mmdetection.git

  1. Extract pose node feature though SimpleBaseline

    The code references git https://github.com/microsoft/human-pose-estimation.pytorch

GCN

mgr's People

Contributors

mxtx0509 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.