sohail-sankanur Goto Github PK
Name: Sohail Sankanur
Type: User
Company: Monash University
Bio: Big Data Engineer
Location: Melbourne
Name: Sohail Sankanur
Type: User
Company: Monash University
Bio: Big Data Engineer
Location: Melbourne
Performing Data Analysis of Text Data and CSV data considering the data is very large. PySpark has been used as the tool for analysis and MongoDB is used as Database for storage.
In this project, I have implemented a language analyser which investigates the linguistic characteristics of children with some form of language disorder. The analyser can perform basic descriptive statistics on a number of linguistic features in the form of visualization. The dataset is known as ENNI [https://childes.talkbank.org/access/Clinical-MOR/ENNI.html] which is a collection of narrative transcripts gathered for a clinical study carried out in Alberta, Canada, to study children with language disorders. Two sets of data were collected: the first set is from children diagnosed with Specific Language Impairment (SLI) — one form of language disorders; and the second set is from children with the typical development (TD). Based on certain rules defined in the transcript, I have filtered out relevant lines using regex and then used them to clasify and visualize. I have made extensive use of Python Classes and Functions throughout this project. Python packages used: glob re matplotlib numpy
Project for wrangling of Uber Dataset. Missing Values, Falsified Values and multiple type of outliers in the dataset has been removed using tools and techniques of Data Wrangling.
This is a combat simulator which pits one army against another. The army combat is simulated through well defined rules, constraints and error handling.
Cortex Analyzers Repository
Documentation of Cortex
Machine Learning to predict and tell the possibilities of Loan Defaults which could occur.
3 node elasticsearch cluster and kibana docker-compose file for self learning
Single node ElasticSearch with security enabled along with Kibana docker-compose file
Facial Image Recognition using Deep Neural Network on smile and Neutral Images
Using Deep Learning classifying fake and real news.
Logs sent to network of Elasticsearch containers using Logstash, Kibana used for analysis.
Project to predict Rain in Australia using Machine Learning. The datasets which is used is large hence Big Data Concepts and techniques have been used. Pyspark is the tool used and MongoDB is used as database
pollen - A command-line tool for interacting with TheHive
Predictive Power Score (PPS) in Python
Apache Spark - A unified analytics engine for large-scale data processing
TA-thehive Cloud Edition
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
A repository to share contributions related to TheHive Project
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.