krickert Goto Github PK
Name: Kristian Rickert
Type: User
Company: Gazebo Today Magazine
Bio: a man has no bio
Location: New York
Blog: krickert.com
Name: Kristian Rickert
Type: User
Company: Gazebo Today Magazine
Bio: a man has no bio
Location: New York
Blog: krickert.com
Simple bash script that crawls a website from a list of files and saves the output. Useful for creating unit tests. Not much more.
Given a binary tree, calculates the maximum path
Norconex Web Crawler (or spider) is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
Create jobs to launch crawls for selenium.
web crawler that works over selenium and extracts the text from the plain html
Client tools for working with Fusion, such as to support the hbase-indexer sending docs to a Fusion pipeline.
grpc micronaut serialization is broken in 4.2.0 - example code to demonstrate this behavior
Solr query parser plugin that performs proper query-time synonym expansion.
Implementation of a Gherkin language to test against selenium.
Plugins that extend Solr's capabilities
Automatically downloads a database of known IP addresses as well as other location data and creates a lucene index for spatial searching for IP addresses within a specific range (or other criteria). Stores lat/lon, zip, city, country, and IP addresses for fast lucene search. Index is 1GB upon completion.
Dead simple java wget. Just one static class and a status enum. No dependencies.
Sample code to return a 1x1 transparent GIF using Spring MVC as well as a simple servlet. This is useful if you want to implement tracking on your website. It's an in-memory servlet so it's fast as hell and won't open a file handle list most people do so dumbly.
Simple MongoDB oplog reader written in java. Made to read the oplog from multiple sources.
A committer allowing a norconex crawl to publish crawled data to a Kafka topic.
Mirror of Apache Lucene + Solr
Takes in markdown documents and outputs well structured test. Meant for a precursor for chunking in a text processing pipeline.
Integration between Micronaut and GRPC
simple micronaut kafka container test for kafka unit testing. Examples include kafka serialization with strings, with avro, and with protocolbufs.
a place to store the vectorized documents for solr search
NLP Named Entity Recognition Text Processor Microservice
Takes in a PIpeDocument for a PipeService and runs it through the configured stages.
Rag Models for protocol buffers
A Python Script that prepairs and installs a Raspberry Pi compatiable distro to an SD Card
Search API for the vector-based search engine ecosystem
RAG search engine based on wikipedia
Introduction to search: from query to results
Apache Solr open-source search software
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.