Giter VIP home page Giter VIP logo

fuji-web's Introduction

Fuji-web Logo

Fuji-Web: AI based Full Browser Automation 🗻

Fuji-Web is an intelligent AI partner that understands the user’s intent, navigates websites autonomously, and executes tasks on the user’s behalf while explaining each action step.

Demo

FujiWebCompressed.mp4

How does it work?

Please read our blog post for a demo video, benchmarks and deep-dive technical overview!

Installing and Running

Download and Install the extension in your browser

  1. Go to the releases page, find the latest version of the extension and download "fuji-extension.zip".
  2. Unzip the file.
  3. Load your extension on Chrome by doing the following:
    1. Navigate to chrome://extensions/
    2. Toggle Developer mode
    3. Click on Load unpacked extension
    4. Select the unzipped folder

Use the extension

Please note that you may need to refresh the page for the extension to work.

  1. Find the Fuji icon in the top right corner of your browser and click on it to open the sidepanel.
  2. Create or access an existing OpenAI API Key and paste it in the provided box. This key will be stored in your browser, and will not be uploaded to a third party.
  3. Finally, navigate to a webpage you want Fuji-Web and type in the task you want it to perform.

Note: all prompts (text and image) are sent directly to the API of your selection. Fuji-Web does not attempt to collect any information from you.

Build the extension

If you want to build the extension from source, follow these instructions:

  1. Ensure you have Node.js. The development was done on Node v20 but it should work with some lower versions.
  2. Clone this repository
  3. Install pnpm globally: npm install -g pnpm
  4. Run pnpm install
  5. Run pnpm dev to start the development server, or pnpm build to build the extension.

When loading the extension, you will need to load the dist folder created by the build process.

Roadmap

  • Expose API for easy integration with browser automation frameworks (e.g. Puppeteer, Playwright, Selenium)
  • Add support for more complex & cross-tab workflows
  • Add support for more browsing behaviors (select from dropdown, extract content from entire page etc.)
  • Add support for saving workflows
  • Add support for sharing workflows & instructions with others
  • Create wikipedia-like knowledge base where users can work together to create knowledge that can improve the Fuji-Web's performance

Troubleshooting

Check out our Troubleshooting Guide for help with common problems.

Contributing

Interested in contributing to Fuji-Web? We'd love your help! Check out our Contribution Guide for guidelines on how to contribute, report bugs, suggest enhancements, and more.

We also have set up a dedicated channel for Fuji-Web feedback on Discord at https://discord.gg/yfMjZ8udb5.

Credits

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.