Ashish Kumar Yadav's Projects
It calculates the accuracy score and confusion matrix for a logistic regression model. The dataset is about coupon used or not in an apparel store known as Simmons .
Hypothesis testing using T-test,ANOVA,chi-square test.
TO determine if there is any evidence about deiffernce in mean arsenic concentration in rural arizona community and phoenix metropolitan community.
Config files for my GitHub profile.
Basic Logical Interface of ATM machine of any bank purely using c++
This problem concludes which factor is significantly effecting the CAT Score out of College type,program type,and interaction factor type for sample data. Here factorial Experiment design and Two Way Anova is used.
Here i have taken an example of a school called as ABN Public School.In this we performed chi square test of independence to determine whether the variables 'gender' and 'student motivation' are independent of each other or not.
In this repo i have tried to explain how to calculate Euclidean Distance,manhattan distance, and Finally Calculating the Dissimilarity Matrix/Distance Matrix using python.
Basic c++ programs
Here we will learn how to identify a curvilinear model and how to do the modelling for that using OLS(Ordiniary Least Square) method. Plotting residual plots for the models.
It consists of examples of data visualisation using suitable type of chart on an example dataset - P12OfficeSupplies.csv
Performing DBSCAN(Density based spatial clustering of application with noise) Clustering. As the name suggest it is used specially for diligently handling the noise data or outliers in a dataset.
To check whether the St. john's wort drug treatment is more effective than placebo drug treatment in treating major depression patients.
Detecting Parkinson Using extreme gradient boosting(XGBOOSTING) Algorithm.
In this repository I have performed Exploratory data analysis on the dataset famously known as House Price Prediction.
This is an another project in which i have Performed Exploratory data analysis on a dataset about online retailers.
In this repository we have performed Exploratory Data analysis to visualise and clean the data. After that we have build two models that is Logistic Regression model and XGBClassifier model to predict the survivors values. And at last we have computed the accuracy for both of our model and also the classifiaction report of the logistic Regression Algorithm.
Mostl oftenly used Encoding techniques for categorical Varibales are performed here.
In this repository I have performed Exploratory Data Analysis on the dataset student_performance.csv. In which i have tried to detect outliers,missing values,relationship among features and across features,Categorical data and continuous/numerical data.
We have detected fake news with Python in this project. We took a political dataset, implemented a TfidfVectorizer, initialized a PassiveAggressiveClassifier, and fit our model. We ended up obtaining an accuracy of 92.74% in magnitude.
In this code handling of the missing values for the categorical features from any dataset is shown.
In this project we have performed all types of feature transfromation on the titanic dataset and we have seen the usage of qqplot to check whether a feature is normal/gaussian distributed or not.
This is a plain chatbot devloped using the OPENAI api. It leverages the following libraries - langchain, openai, huggingface_hub, python-dotenv, streamlit, pandas.
In this we have taken a sample data to check if that data follows normal distribution or not. A hypothesis test is performed to check this using the chi-square test pf goodness of fit.
In this code, we will determine that whether the frequency by which car arrives as per our dataset follows a posison distribution or not using the chi-Square test.
This code will check whether our data follows a uniforn distrbution or not. Using the goodness of fit test with the help of chi square method.
This project perform analysis about the term "Cloud Computing" . It analyses how many times the word has been searched and by how many people on daily,monthly and yearly basis in the whole world. After analysing this we can answer questions like when search was maximum,when it was minimum,what were the search trends, which countries have searched most about it etc. This projectcan help us understand what's the current requirement of market is and what problems needed to be solved for effective growth of any organiztaion.
In this code the missisng numerical values inside any feature is handled using various techniques which are mentioned in the coding part itself.