Giter VIP home page Giter VIP logo

openssm's Introduction

Note: we are in the process of refactoring the code base for open sourcing. You will see lots of updates over the next few days, including an SSM playground sandbox. Be sure to WATCH this repo for these updates.

OpenSSM – “Small Specialist Models” for Industrial AI

OpenSSM (pronounced open-sesame) is an open-source framework for Small Specialist Models (SSMs), which are key to enhancing trust, reliability, and safety in Industrial-AI applications. Harnessing the power of domain expertise, SSMs operate either alone or in "teams". They collaborate with other SSMs, planners, and sensors/actuators to deliver real-world problem-solving capabilities.

Unlike Large Language Models (LLMs), which are computationally intensive and generalized, SSMs are lean, efficient, and designed specifically for individual domains. This focus makes them an optimal choice for businesses, SMEs, researchers, and developers seeking specialized and robust AI solutions for industrial applications.

SSM in the ecosystem
Figure – example of SSMs collaborating in a semiconductor manufacturing ecosystem

A prime deployment scenario for SSMs is within the aiCALM (Collaborative Augmented Large Models) architecture. aiCALM represents a cohesive assembly of AI components tailored for sophisticated problem-solving capabilities. Within this framework, SSMs work with General Management Models (GMMs) and other components to solve complex, domain-specific, and industrial problems.

Why SSM?

The trend towards specialization in AI models is a clear trajectory seen by many in the field.

Specialization is crucial for quality ...not general purpose Al models – Eric Schmidt

... small models ... for a specific task that are good – Matei Zaharia

... small agents working together ... specific and best in their tasks – Harrison Chase

... small but highly capable expert models – Andrei Karpathy

As predicted by Eric Schmidt and others, we will see “a rich ecosystem to emerge [of] high-value, specialized AI systems.” SSMs are the central part in the architecture of these systems.

What OpenSSM Offers

OpenSSM fills this gap directly, with the following benefits to the community, developers, and businesses:

  • Industrial Focus: SSMs are developed with a specific emphasis on industrial applications, addressing the unique requirements of trustworthiness, safety, reliability, and scalability inherent to this sector.

  • Fast, Cost-Effective & Easy to Use: SSMs are 100-1000x faster and more efficient than LLMs, making them accessible and cost-effective particularly for industrial usage where time and resources are critical factors.

  • Reliable Domain Expertise: Each SSM has expertise in a particular field or equipment, offering precise and specialized knowledge, thereby enhancing trustworthiness, reliability, and safety for Industrial-AI applications. With self-reasoning, causal reasoning, and retrieval-based knowledge, SSMs provide a trustable source of domain expertise.

  • Vendor Independence: OpenSSM allows everyone to build, train, and deploy their own domain-expert AI models, offering freedom from vendor lock-in and security concerns.

  • Composable Expertise: SSMs are fully composable, making it easy to combine domain expertise.

Target Audience

Our primary audience includes:

  • Businesses and SMEs wishing to leverage AI in their specific industrial context without relying on extensive computational resources or large vendor solutions.

  • AI researchers and developers keen on creating more efficient, robust, and domain-specific AI models for industrial applications.

  • Open-source contributors believing in democratizing industrial AI and eager to contribute to a community-driven project focused on building and sharing specialized AI models.

  • Industries with specific domain problems that can be tackled more effectively by a specialist AI model, enhancing the reliability and trustworthiness of AI solutions in an industrial setting.

SSM Architecture

At a high level, SSMs comprise a front-end Small Language Model, an adapter layer in the middle, and a wide range of back-end domain-knowledge sources.

High-Level SSM Architecture
Figure – High-Level SSM Architecture

The above diagram illustrates the high-level architecture of an SSM, which comprises three main components:

  1. Small Language Model (SLM): This forms the communication frontend of an SSM.

  2. Adapters (e.g., LlamaIndex): These provide the interface between the SLM and the domain-knowledge backends.

  3. Domain-Knowledge Backends: These include text files, documents, PDFs, databases, code, knowledge graphs, models, other SSMs, etc.

SSMs communicate in both unstructured (natural language) and structured APIs, catering to a variety of real-world industrial systems.

SSM Composability
Figure – Easily combine domain knowledge with SSMs

The composable nature of SSMs allows for easy combination of domain-knowledge sources from multiple models.

Near-Term Roadmap

  • Play with SSMs in a hosted SSM sandbox
  • Create SSMs in your own development environment, and integrate SSMs into your own AI apps
  • Capture domain knowledge in various forms into your SSMs
  • Apply SSMs in collaborative problem-solving AI systems

Getting Started

You can begin contributing to the OpenSSM project or use our pre-trained SSMs for your industrial projects. See our Getting Started Guide for more information.

Community

Join our vibrant community of AI enthusiasts, researchers, developers, and businesses who are democratizing industrial AI through SSMs. Participate in the discussions, share your ideas, or ask for help on our Community Forum.

Contribute

OpenSSM is a community-driven initiative, and we warmly welcome contributions. Whether it's enhancing existing models, creating new SSMs for different industrial domains, or improving our documentation, every contribution counts. See our Contribution Guide for more details.

License

OpenSSM is released under the Apache 2.0 License.

openssm's People

Contributors

ctn avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.