Giter VIP home page Giter VIP logo

sound_names's Introduction

Sound Names: Classify Names Using the Sequence of Sounds

Building on prior work that classifies names based on the sequence of characters, we create a model that capitalizes on sequence of sounds to classify names.

To capture the phonetic similarity of different names, we first produce sound encodings of names using https://pypi.org/project/Metaphone/#contents and then use LSTM on top to test classification accuracy. We find that the accuracy is substantially lower than what we can achieve when we just apply LSTM to the name strings. This suggests that there is some information in the spellings (aside from the sound) and very plausibly that the sound encoding algorithms do not capture the way a name is read completely.

In the future, we plan to ensemble the two models.

Scripts

  1. Download FL Voter Data
  2. Prepared FL Voter Data
  3. LSTM Model Based on Metaphone Encoding
  4. Comparison Between Ethnicolr and Soundnames and Naive Average of the two models

Authors

Suriyan Laohaprapanon and Gaurav Sood

sound_names's People

Contributors

soodoku avatar suriyan avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.