Daria Plewa's Projects
The project focused on using Machine Learning (glm function on data Adult) in R. The goal of it was to develop a model that predicts a person's income (under 50k per year or over 50k per year).
Finding mutations in genomic data with the use of the chi2 test and Parallel functions in Python and R
The performance of individual CNV detection software and state-of-the-art sequencing. All analyses were performed using the Python and R programming languages.
This project explores the PBRM1-PIAS1 interaction in epithelial differentiation through ChIP-seq analysis, highlighting EZH2's role and implications for cholesterol biosynthesis in cellular processes.
Here I commit my projects of data cleaning and (mostly) visualization in tools such as Excel, Power BI, and Tableau.
Investigating HIV subtype evolution through maximum-likelihood and GTR model analysis, focusing on clade formation and genetic diversity. Utilizing BEAST for HIV-1 sequence analysis to estimate evolutionary rates and MRCA, highlighting subtype diversification.
Study of microsatellite sequences in the Red fox population
Genomic Analysis of Canis lupus with the use of Genomic Maps and Philogenetic Trees
A robot powered training repository :robot:
Project of Medical Database with the possibility of logging for users and adding new data to base
Analysis of SNPs that have the greatest impact of appearance of mammary gland tumour in dogs.
The project focused on the analysis of foxes from 3 different farms and the subsequent presentation of trait heritability results, genetic trends, breeding values and relationship matrix.
Estimation of the Shapiro-Wilk test using the Monte Carlo method.
Config files for my GitHub profile.
PCA analysis of GUS data
Genomic Analysis Pipeline: Automate data preprocessing, variant calling, and annotation with Snakemake. Ensure reproducibility and reliability in genomic studies.
Codes from Course The basics of Statistical Data Modelling
Analyzing genetic variation and selection in three-spined stickleback populations, these projects apply statistical and evolutionary genomic analyses to understand environmental influences and identify outlier SNPs, highlighting the interplay between natural selection and population differentiation.
Translation of the FASTA file with genetic code into aminoacid code.
This project leverages PLINK for GWAS Quality Control, focusing on SNP analysis and PCA for ancestry insights, showcasing the vital role of bioinformatics in enhancing population genetics research.
This project conducts spatial analysis on lung cancer tissue using Scanpy and data from 10x Genomics, focusing on preprocessing, quality control, and functional analysis of spatially variable genes. Insights into the molecular heterogeneity of lung cancer are uncovered, highlighting regions of interest for further research.
Suncharge company supply chain data visualisation.
This project focuces on analysis of survival patients with Aids, with Python library Lifelines
The aim of this project was to create a classification model for patients with suspected brain tumour development based on MRI images with Keras and TensorFlow.