Giter VIP home page Giter VIP logo

test's Introduction

skribbliohints.github.io

This GitHub site contains data about the words used by skribbl.io. The English data was collected in late 2019, and it was complete until November 9, 2022, when skribbl.io released an update that added more words. Now the English and German data is outdated. The Spanish data is still up-to-date.

https://skribbliohints.github.io

The raw data has been placed in 3 files in this repository: words.json (English), German.json (German), and Spanish.json (Spanish). The data was collected by a bot that played both public and private games on skribbl.io. It played private games against itself in two separate browser tabs, and it played public games against actual humans. However, on approximately March 2, 2020, skribbl.io added reCAPTCHA, and since then it has placed the bot in public games with no other players, so the bot cannot play against humans. The English data was collected in late 2019, but the German and Spanish data were not collected until after reCAPTCHA had been added, so they do not have public game data.

Each data file uses JSON (JavaScript Object Notation) with the words themselves as the keys in the outermost object. Each word maps to an object with the following keys:

  • count The total number of times the word was seen in both public and private games. Since the bot played a lot more rounds in private games than public games, most of count comes from private games.
  • lastSeenTime The timestamp when the word was last seen by the bot. This is in the format returned by the JavaScript Date.now() function. So it is the number of milliseconds elapsed since January 1, 1970.
  • publicGameCount Unfortunately, this field has a bug. This was supposed to be the number of times the bot saw the word in public games. But due to a bug in the bot's code, it updated publicGameCount only if it guessed the word correctly. The bot guessed words based on the hint displayed by skribbl.io, so words with unique patterns ended up having higher publicGameCount. The bug was fixed on April 23, 2020, but corrected data could not be collected because skribbl.io had introduced reCAPTCHA.
  • difficulty The portion of players that did not guess the word correctly in public games, including the bot.
  • difficultyWeight The number of players used to calculate the difficulty. Due to the bug in publicGameCount, this is the most accurate measurement of the frequency of the word in public games.

If a word has not been seen in a public game, it will not have the public game keys.

This website has some data analysis tools written in JavaScript. It does not use the raw field names. Instead, it uses descriptive text and it manipulates some fields. For example, instead of difficulty, the website has "Percent of players that guess correctly (including the bot)" which is 1 - difficulty. It also adds the "Difficulty percentile" field to each word. The percentile is calculated by the JavaScript code, so it is not stored in the data. The r= number below the graph is the Pearson correlation coefficient of the data shown in the graph.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.