Giter VIP home page Giter VIP logo

metalens's Introduction

MetaLens

Authors: Duy Ho, Bhuvan Chennoju image

“Metaverse”, a new form of social life via virtual reality, is a new ambitious vision set by Meta (formerly known as Facebook) that allows users to socialize in a virtual world. It is a groundbreaking concept in social networking in which we will perform many activities such as working, playing, studying, and interacting with each other in an immersive way. Thus, our project, MetaLens, plans to focus primarily on a vision in the future where daily lives are enhanced with technology and Artificial Intelligence (AI). Our lives will be surrounded by robots, drones, and other automatic systems that make our tasks a lot more convenient. Via AI, Deep Learning, Sentiment Analysis, and Object Recognition, MetaLens aims to help not only the citizens but also the government be aware of their citizen’s well-being and the city’s overall performance to derive appropriate solutions.

Datasets

Overview

image

Front End

Unity 3D:

  • Leading platform for 2D and 3D game and application development
  • Realistic physics mechanism and interaction (collision, gravity, speed, acceleration, …)
  • High-quality rendering and graphics
  • Cross-platform compatibility (mobile, android, PC, web, HoloLens, Oculus Quest, Oculus Rift, Vive, SteamVR, and Valve)
  • Abundant assets (standard, free, and paid) from Unity and other third parties
  • Strong community support and maintenance

OpenXR

  • Cross-platform compatibility (Oculus Quest, Oculus Rift, Vive, SteamVR, and Valve). image

Backend

  • Python: best framework for data science and deep learning
  • Keras: one of the most popular frameworks for building DL models
  • Jupyter Notebook: interactive tool to work and share Python code efficiently
  • Ngrok: Convenient to quickly and effectively establish connection to front end.
  • Detectron2: object detection and segmentation library with pretrained models ready for transfer learning and finetuning. image

UTK Face Dataset:

https://susanqq.github.io/UTKFace/ ~ 20,000 face images with gender, age, and ethnicity

Google Street View API https://developers.google.com/maps/documentation/streetview/overview ~ Houses, Utility Poles + Street Lights

COCO Dataset (80 objects)

https://cocodataset.org Vehicles

Road Damage Dataset

https://github.com/sekilab/RoadDamageDetector Road

Approach

Data Research/Collection

UTK has 2 versions:

  • Cropped + Grayscaled
  • Uncropped + Colored

Data Curation

  • Data simplification
  • Data Extraction
  • Data Structuring

Model Training (2 Phases):

Initial (naïve)

  • Close-up Face version (limited vision cues)
  • Simple < 10-layer CNN
  • Low accuracy for each category (~50-60%)

Revised

  • Colored Portrait version (more hints about clothes, skin tone, … )
  • Transfer Learning with ResNet-50 pretrained on ImageNet and Early Stopping
  • Increased accuracy to 70-90%

Model Inference and Deployment

  • Input: Image
  • Output: Age, Gender, Ethnicity, and Sentiment (Emotion)

API endpoint: deployed through ngrok

Application Deployment

Unity: PC version, Web, and VR

Model Predictions:

image

{
  "Age": "Middle-Aged",
  "Age Estimate": 55,
  "Ethnicity": "Indian",
  "Gender": "Male",
  "Sentiment": "Neutral"
}

Front-end Design (Unity):

image image gif1 gif1

Video

Video: https://youtu.be/wus7FLhRER4

Colab: https://colab.research.google.com/drive/1E7223DY3RbS-OVM6cZ3VO7MKQxmZ5XZe?usp=sharing

GitHub (VR Version): https://github.com/benamreview/HackARoo-Fall2021-VR-Assets

GitHub (PC Version): https://github.com/benamreview/HackARoo-Fall2021-Assets

PPT Slides (with animations & annotations): https://mailmissouri-my.sharepoint.com/:p:/g/personal/dhh3hb_umsystem_edu/ESgtQ36AxjpGjiy9yIZMPxEBW0nTIMS14dP1CtJY8M9EEA?e=Dresha

metalens's People

Contributors

duyhho avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.