- 🔡 Work wtih NLP everyday, but time series is my crush.
- 📫 How to reach me: [email protected]
- 🔭 Co-Author of The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
- 🌱 Maintainer of PIISA Project
- 🤗 Contributor to SantaCoder
Name: Ian Yu
Type: User
Company: Groupby Inc
Location: Toronto
Streamlit App for my Asset Allocation System
Forecasting returns and risks for 5 major markets with a neural network, and allocate investment weighting to maximize return while minimizing risks. Click link to see the web app
PII Detection Code Developed by the AISC Community For BigScience Datasets
Tools for managing datasets for governance and training.
This script shows how you can calculate the distance and duration between an origin and many destinations using Python and Google API distance matrix
Gimme Mochi
Jupyter handsontable integration
An end-to-end asset allocation package
Text pre-processing for NLP datasets
My personal Nextjs 13 starter template!
PII Processing code to detect and remediate PII in BigScience datasets. Reference implementation for the PII Hackathon
Project setup through poetry and cookiecutter
A scikit-learn based module for multi-label et. al. classification
My first-time project to build a longer-term stock market forecast as the first step towards my goal of building an asset allocator optimizer.
This data is scraped from Whole Foods Featured On Sale Web Page. Features EDA, Amazon Prime to Non-Amazon Prime membership discounts on sale products as well as app deployment to show live insights.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.