Giter VIP home page Giter VIP logo

infrawhispers / anansi Goto Github PK

View Code? Open in Web Editor NEW
48.0 1.0 2.0 6.76 MB

open source tooling for AI search and understanding

Home Page: https://infrawhispers.github.io/anansi/

License: Apache License 2.0

Rust 92.11% Python 3.37% Dockerfile 0.43% Makefile 0.38% Shell 0.67% HCL 2.87% Starlark 0.17%
ai embeddings large-language-models llms machine-learning ml rust rust-lang webassembly neural-networks neural-search semantic-search

anansi's Introduction

anansi ๐Ÿ•ธ๏ธ

anansi is a fully featured content vectorization system aimed at providing the latest advances in embedding generation, in-domain tuning and vector storage in an easy to use package.

Core Features

๐ŸŽ๏ธ Performance

  • Rust implementation of FreshDiskANN with support for scalar quantization
  • Configurable RocksDB based storage engine
  • ONNX runtime support for CUDA accelerated embedding models

๐Ÿ—’๏ธ Developer Experience

  • Build indices on unstructured data without worrying about whether or not it is text, image or video
  • Support for gRPC and HTTP clients
  • Single installation binary that can cross-compile to non-Linux targets

๐Ÿ’ก Machine Learning

  • Utilize cutting-edge embeddings models that are listed on the MTEB Leaderboard
  • Bin-pack model inference on the CPU or GPU, supporting request batching with little effort
  • Fine tune embedding generation with in-domain samples

Getting Started

docker pull infrawhispers/anansi:latest
docker run --name anansi -it -p 50051:50051 -p 50052:50052 -v /.cache:/app/.cache infrawhispers/anansi:latest

[1] standalone embedding generation using INSTRUCTOR

curl \
-X POST http://172.17.0.1:50052/encode \
-H 'Content-Type: application/json' \
-d '{
    "batches":[{
        "model_name":"INSTRUCTOR_LARGE",
        "model_class":"ModelClass_INSTRUCTOR",
        "text":{
            "data": [
                {
                    "instruction": "Represent the Science title:",
                    "value": "3D ActionSLAM: wearable person tracking ..."
                },
                {
                    "instruction": "Represent the Nature title:",
                    "value": "Inside Gohar World and the Fine, Fantastical Art"
                }
            ]
        }
    }]}
'

Documentation

We use docusaurus to generate our documenation, please either refer to the READMEs here or check out the documentation website.


FAQ

What's with the name?

anansi (/ษ™หˆnษ‘หnsi/ ษ™-NAHN-see; literally translates to spider) is an Akan folktale character and god of stories, wisdom and knowledge. We thought it was an apt name as we aim to provide ML applications with turn-key memory and persistence.

How do I contact the developers?

Hop onto Discord via this invite link or shoot an email to [email protected]

How do I contribute?

We welcome contributions of all sizes and contributors at all levels! Please take a look at open issues or look at #contributions in the Discord.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.