Giter VIP home page Giter VIP logo

Sohail Sankanur's Projects

big-data-analytics icon big-data-analytics

Performing Data Analysis of Text Data and CSV data considering the data is very large. PySpark has been used as the tool for analysis and MongoDB is used as Database for storage.

child_language_analyzer icon child_language_analyzer

In this project, I have implemented a language analyser which investigates the linguistic characteristics of children with some form of language disorder. The analyser can perform basic descriptive statistics on a number of linguistic features in the form of visualization. The dataset is known as ENNI [https://childes.talkbank.org/access/Clinical-MOR/ENNI.html] which is a collection of narrative transcripts gathered for a clinical study carried out in Alberta, Canada, to study children with language disorders. Two sets of data were collected: the first set is from children diagnosed with Specific Language Impairment (SLI) — one form of language disorders; and the second set is from children with the typical development (TD). Based on certain rules defined in the transcript, I have filtered out relevant lines using regex and then used them to clasify and visualize. I have made extensive use of Python Classes and Functions throughout this project. Python packages used: glob re matplotlib numpy

cleaning-uber-dataset icon cleaning-uber-dataset

Project for wrangling of Uber Dataset. Missing Values, Falsified Values and multiple type of outliers in the dataset has been removed using tools and techniques of Data Wrangling.

combat_simulator icon combat_simulator

This is a combat simulator which pits one army against another. The army combat is simulated through well defined rules, constraints and error handling.

machine-learning-to-predict-rain icon machine-learning-to-predict-rain

Project to predict Rain in Australia using Machine Learning. The datasets which is used is large hence Big Data Concepts and techniques have been used. Pyspark is the tool used and MongoDB is used as database

pollen icon pollen

pollen - A command-line tool for interacting with TheHive

ppscore icon ppscore

Predictive Power Score (PPS) in Python

spark icon spark

Apache Spark - A unified analytics engine for large-scale data processing

thehive icon thehive

TheHive: a Scalable, Open Source and Free Security Incident Response Platform

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.