Giter VIP home page Giter VIP logo

ai-cocreator-app's Introduction

AI Co-Creator

High-Level Description: This project is a voice assistant designed to be paired with the CAD program Autodesk Fusion 360. Its primary function is to interact with designers during their design process, providing real-time assistance and feedback.

To use the voice assistant, you must install both the Autodesk add-on and the Voice Assistant overlay.

Technical Description:

There is an Autodesk Fusion 360 add-on that includes an HTTP server to communicate with the Voice Assistant. The code for this is in a separate repository here.

AI Co-Creator is an Electron-based application that serves as an overlay for Autodesk Fusion 360. A human user controls when the voice assistant is listening (by starting and stopping it). The audio is then sent to a server that communicates with the Whisper API for transcription. Additionally, the app takes screenshots and sends both the audio and screenshots to another server, which queries the OpenAI API to generate a response. The voice assistant then speaks the response. The Voice Assistant app can also communicate with the Autodesk Fusion 360 add-on to highlight parts of the screen.

Features:

  • Voice Assistant Integration: Communicate with the AI assistant using voice commands to perform various tasks within Autodesk Fusion 360. The start and stop of audio and screenshot capture are controlled by the user.
    • The voice assistant has 4 states: sleeping, listening, processing, and responding
  • Overlay Functionality: The app runs as an overlay on top of Autodesk Fusion 360, providing seamless integration and accessibility.
  • Real-time Interaction: Receive real-time feedback and assistance from the AI as you work on your designs.

Folder Structure:

  • app: Contains the frontend code, using React technologies.
  • server: Contains an Express server that uses node-fetch and the OpenAI library to query the Whisper API, receive prompts, and generate responses for the Voice Assistant to speak back.

Main Technologies:

  • Electron: For creating a transparent overlay.
  • Express: For the backend server.
  • React: For the frontend.
  • OpenAI: For LLM technology.

Installation:

  1. Clone the Repository:

    git clone https://github.com/yourusername/ai-cocreator-app.git
    cd ai-cocreator-app
  2. Install Dependencies: Ensure you have Node.js and npm installed. Then, install the required dependencies:

    npm install
    cd app
    npm install 
    cd ../server
    npm install

Usage:

  1. Start Fusion 360: Open Autodesk Fusion 360 on your computer and ensure the add-on is running (more steps to be added soon).

  2. Launch AI Co-Creator Frontend:

    npm start

    This will start the Electron application. The app will run as an overlay on top of Fusion 360.

  3. Launch the Whisper and Assistant Servers:

    cd server
    npm start 
  4. Activate Voice Assistant: Click start, say a command, and then click stop. Begin interacting with the AI assistant using voice commands.

  5. Perform Tasks: Utilize the voice assistant to perform actions within Fusion 360, such as creating sketches, applying constraints, or generating designs. The voice assistant will also ask you questions to help you design better.

Version History:

  • Version 1.0.0: Initial release with core functionality.

ai-cocreator-app's People

Contributors

annaxu9 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.