Giter VIP home page Giter VIP logo

decoded-pathway's Introduction

Decoded Pathway

Overview

Using generative AI to chat with local documents has become a safer entry point for integrating artificial technology into various settings. This methodology helps maintain privacy and reduce hallucinations by grounding an AI's response in user-created documents. As the subject-matter-expert, you can accurately evaluate the responses, which is the position we want to be in when interacting with a technology we do not understand.

These models are built around splitting text into chunks and converting it to vectors. I created a process to convert raw data analysis results into a human-readable format so that the explanations around the results provide curated insights rather than exposing an LLM to raw data tables.

The goal was to create a way for anyone to ask questions about attributes associated with any O*Net occupation. I added context of how occupations are related and educational institutions that provide training as needed.

Data Files

  • O*NET Resource Center
    • Occupation Data - This file contains each O*NET-SOC code, occupational title, and definition/description.
    • Task Statements - This file contains the tasks associated with each O*NET-SOC occupation.
    • Technology Skills - This file contains the Technology Skills examples, including hot and in-demand technologies, associated with O*NET-SOC occupations.
    • Tools Used - This file contains the Tools Used examples associated with O*NET-SOC occupations.
    • Abilities - This file contains the Content Model Ability data associated with each O*NET-SOC occupation.
    • Interests - This file contains the general occupational interest (RIASEC) high-point codes and numeric profile data for each O*NET-SOC occupation.
    • Work Styles - This file contains the Content Model Work Styles data associated with each O*NET-SOC occupation.
    • Work Values - This file contains the Content Model Work Values data associated with each O*NET- SOC occupation.
    • Knowledge - This file contains the Content Model Knowledge data associated with each O*NET-SOC occupation.
    • Skills - This file contains the Content Model Skill data associated with each O*NET-SOC occupation.
    • Work Activities - This file contains the Content Model Work Activity data associated with each O*NET-SOC occupation.
    • Work Context - This file contains the Content Model Work Context data associated with each O*NET-SOC occupation.
    • Job Zones - This file contains each O*NET-SOC code and its corresponding Job Zone number.
    • Education, Training, and Experience - This file contains the percent frequency data associated with Education, Training, and Experience Content Model elements.
    • Education, Training, and Experience Categories - This file contains the categories associated with the Education, Training, and Experience content area.
  • National Center for Education Statistics
  • U.S. Census Bureau

Quick Start

GPT4ALL is a free tool that you can download, which is optimized to run LLMs on your local machine. I successfully ran everything on my 2020 M1 Mac with 16GB of memory. You can follow the steps below to get started:

  1. Download GPT4ALL.
  2. Install the following models: mistral-7b-openorca.Q4_0.gguf and SBert model.
  3. Create a folder to store your local documents.
  4. Move the localDocs/ONET.txt file into your new directory.
  5. Optional: Use the localDocs/single folder to use individual occupation files as reference material.
  6. In GPT4All, go to Settings > LocalDocs tab, find your directory, give the resource a name, and press "Add."
  7. Wait for it to index your files. It takes about 10 minutes (depending on your machine).
  8. Once indexing is complete, return to the chat window.
  9. Start a new chat and select the "database" icon from the top-right. This should open a popup with your local document.
  10. Select the resource. Now, all future responses will consider the local document for context.

If you get lost, you can refer to this documentation.

Example Chat

Prompt:

I am a dishwasher. What path should I follow to become an operations research analyst?

Response:

To become an Operations Research Analyst from the position of a Dishwasher, you would need to follow this career progression:

  1. Gain experience and education in mathematics or related fields such as Computer Science, Statistics, or Engineering. This can be done through community college courses while working as a dishwasher.
  2. Pursue an Associate's degree in Mathematics, Operations Research, Industrial Engineering, or a related field from a community college or online institution.
  3. Obtain relevant work experience by seeking internships or entry-level positions in data analysis, logistics, or operations research departments within various industries.
  4. Pursue further education and obtain a Bachelor's degree in Operations Research, Industrial Engineering, Mathematics, Statistics, or a related field from a university.
  5. Gain professional experience by working as an Operations Research Analyst for several years to develop expertise in the field. This may involve working on various projects within different industries, such as manufacturing, logistics, healthcare, or finance.
  6. Consider obtaining certifications and professional credentials relevant to your work, such as Certified Analytics Professional (CAP) from INFORMS or other industry-specific certifications.
  7. Continuously develop skills in data analysis, modeling, problem-solving, communication, and leadership through professional development courses, workshops, and conferences.
  8. Network with professionals within the field to establish connections that may lead to job opportunities or mentorship.
  9. Eventually, you can advance your career by pursuing a higher degree (such as Master's or Ph.D.) in Operations Research or a related field if desired. This will open up more advanced positions and research opportunities.

Prompt:

What schools should I consider for the Bachelor's degree?

Response:

Some top schools to consider for a Bachelor's degree in Operations Research or related fields include: Massachusetts Institute of Technology (MIT), Stanford University, Harvard University, Carnegie Mellon University, California Institute of Technology (Caltech), and Georgia Institute of Technology. These institutions are known for their strong programs in mathematics, engineering, and data analysis, which can be beneficial for pursuing a career as an Operations Research Analyst.

Prompt:

Are those schools aligned with the occupation education requirements?

Response:

Yes, these schools are known for granting credentials that align well with the educational requirements related to this occupation. These institutions offer programs in mathematics, engineering, and data analysis, which can be beneficial for pursuing a career as an Operations Research Analyst.

Prompt Observations

Context Matters

The context of the conversation matters. For example, I asked what skills are needed to be an architect, and it responded using ONet skills. I then asked what schools I should attend if I wanted to develop those skills, and it returned a list of universities associated with ONet skills. I got a different response when I asked which skills offer degrees in architecture. Combining those prompts returned a list of schools that offered degrees related to the previous skills found in the data.

decoded-pathway's People

Contributors

camaal avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.