melcutz Goto Github PK
Name: Claudiu Branzan
Type: User
Location: Seattle, WA
Name: Claudiu Branzan
Type: User
Location: Seattle, WA
Python parser for Adblock Plus filters
Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even when making big crawls (one billion pages).
Detect and classify pagination links
Simulation of measures to prevent spread of Covid19
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
A library for debugging/inspecting machine learning classifiers and explaining their predictions
Extract embedded metadata from HTML markup
Tool to flatten stream of JSON-like objects, configured via schema
Formasaurus tells you the type of an HTML form and its fields using machine learning
A scalable frontier for web crawlers
A simple fuzzy matching set for python strings
Google Analytics collector-as-a-service (using GA measurement protocol).
Google Cloud Client Library for Python
Free Notebooks and code
Given a new image, determine if it is likely derived from a known image.
Stand-alone language identification system
Simple MCMC
A python library detect and extract listing data from HTML page.
Models built with TensorFlow
Run MapReduce jobs on Hadoop or Amazon Web Services
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Automatically exported from code.google.com/p/negex
Semantic natural language understanding at scale using Spark, machine-learned annotators and deep-learned ontologies
A simple algorithm for clustering web pages, suitable for crawlers
sqldf for pandas
Visual scraping for Scrapy
Predictive Services query client
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.