Giter VIP home page Giter VIP logo

dataset-generator's Introduction

Hello!

Interests:

  • AI safety (alignment)
  • Machine learning & deep learning
  • Science
  • Maths

I was the original author and current maintainer of the Book Project and the Books API.

dataset-generator's People

Contributors

knjk04 avatar

Watchers

 avatar  avatar

dataset-generator's Issues

Remove extra characters in the exported JSON result

Given this query:

HP quotes

I get back this JSON:

{
  " Quote ": {
    "1": " \"Happiness can be found, even in the darkest of times, if one only remembers to turn on the light.\" ",
    "2": " \"It does not do to dwell on dreams and forget to live.\" ",
    "3": " \"It is our choices, Harry, that show what we truly are, far more than our abilities.\" ",
    "4": " \"Fear of a name increases fear of the thing itself.\" ",
    "5": " \"We are only as strong as we are united, as weak as we are divided.\" ",
    "6": " \"It takes a great deal of bravery to stand up to our enemies, but just as much to stand up to our friends.\" ",
    "7": " \"It is our choices that show what we truly are, far more than our abilities.\" ",
    "8": " \"Happiness can be found even in the darkest of times, if one only remembers to turn on the light.\" ",
    "9": " \"It is our choices that show what we truly are, far more than our abilities.\" ",
    "10": " \"It is our choices that show what we truly are, far more than our abilities.\" "
  },
  " Author ": {
    "1": " Albus Dumbledore ",
    "2": " Albus Dumbledore ",
    "3": " Albus Dumbledore ",
    "4": " Albus Dumbledore ",
    "5": " Albus Dumbledore ",
    "6": " Albus Dumbledore ",
    "7": " Albus Dumbledore ",
    "8": " Albus Dumbledore ",
    "9": " Albus Dumbledore ",
    "10": " Albus Dumbledore "
  }
}

Expected result:

  • No backslashes in the quotes
  • No leading and ending quotes in the column names and values

Exported JSON should be an array of objects containing columns in each object

Given this query:

HP quotes

I get back this JSON:

{
  "Unnamed: 0": {
    "0": null,
    "1": null,
    "2": null,
    "3": null,
    "4": null,
    "5": null,
    "6": null,
    "7": null,
    "8": null,
    "9": null,
    "10": null
  },
  " Quote ": {
    "0": "------",
    "1": " \"Happiness can be found, even in the darkest of times, if one only remembers to turn on the light.\" ",
    "2": " \"It does not do to dwell on dreams and forget to live.\" ",
    "3": " \"It is our choices, Harry, that show what we truly are, far more than our abilities.\" ",
    "4": " \"We must all face the choice between what is right and what is easy.\" ",
    "5": " \"Fear of a name increases fear of the thing itself.\" ",
    "6": " \"It is our choices that show what we truly are, far more than our abilities.\" ",
    "7": " \"It is our choices that make us who we are, and we can always choose to do what is right.\" ",
    "8": " \"Happiness can be found even in the darkest of times, if one only remembers to turn on the light.\" ",
    "9": " \"It is our choices that make us who we are, and we can always choose to do what is right.\" ",
    "10": " \"It is our choices that show what we truly are, far more than our abilities.\" "
  },
  " Author ": {
    "0": "-------",
    "1": " Albus Dumbledore ",
    "2": " Albus Dumbledore ",
    "3": " Albus Dumbledore ",
    "4": " Albus Dumbledore ",
    "5": " Albus Dumbledore ",
    "6": " Albus Dumbledore ",
    "7": " Albus Dumbledore ",
    "8": " Albus Dumbledore ",
    "9": " Albus Dumbledore ",
    "10": " Albus Dumbledore "
  }
}

Expected result:

  • No unnamed object
  • An array of objects where each object contains the quote and author

Containerise backend and frontend services

Using docker-compose, create services that run the backend (Flask) and frontend (streamlit).

The backend service should start first as the frontend should depend on the frontend service running.

Add option to use gpt-3.5-turbo

Add gpt-3.5-turbo functionality that uses the Chat Completions endpoint

Acceptance criteria:

  • Add gpt-3.5-turbo to the list of radio buttons
  • Query GPT-3.5 or Da Vinci 003 depending on which radio button was selected
  • Return an error message still with GPT-3.5 if no table could be generated
  • Add a tooltip that offers advice on whether a user should use Da Vinci or 3.5

Only show app logs

The logs currently show logs from the OpenAI and standard library. Restrict it to only this app's logs.

Allow users to use their own API Key

To save on costs, allow a user to add their own API key.

Acceptance criteria:

  • Only allow GPT 3.5 if a user is not supplying their own API key
  • Limit the number of queries to 3 if a user is not supplying their own API key
  • Have a text input box that allows a user to enter in their API key
  • Use the user's API key instead of the app's key for the duration of the session

Hide heading link

image

Remove the link to the header, ideally with Python code rather than CSS.

Add Claude

Once granted access to Anthropic Claude, add an endpoint that queries and returns a response from Claude.

Incorporate DALL-E

Add an endpoint that generates images using DALL-E.

The frontend should show a grid of images.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.