Giter VIP home page Giter VIP logo

inf506-dataanalysismethods's Introduction

DataAnalysisMethods -INF506 Turkish German University

ÜBUNG 1 invert normal_cdf by using binary search and plot data histogram (mu=0 and sigma=57)

ÜBUNG 2 Eine Münze wird 2, 3, 4, 5, 10, 15, 50, 150 Mal geworfen und jeder Wurf aufgezeichnet. Sie zeichnen und vergleichen die beobachteten posterioren Wahrscheinlichkeiten. (bitte kodieren Sie die Beta-Verteilung von Grund auf)

ÜBUNG 3

You should fetch the data from the URL http://spamassassin.apache.org/old/publiccorpus/ to create a spam filter dataset. This dataset should include emails annotated as spam (20030228_spam.tar.bz2) and not spam (20030228_easy_ham.tar.bz2) print(len(ham_filenames)) print(len(spam_filenames))

After fetching, you should read and sort data in the folders. By using the parser api of the email package (https://docs.python.org/3/library/email.parser.html) you should be able to parse the content of the emails. import email import email.policy print(ham_emails[1].get_content().strip()) print(spam_emails[6].get_content().strip())

Explore content type of emails whether they are text/plain or text/html. For plain text return content, otherwise return html to plain version. X = np.array(spam_emails + ham_emails, dtype=object) sample_html_spam = X[0] print(html_to_plain_text(sample_html_spam.get_content()))

ÜBUNG 4 Iris-Dataset aus der sklearn-Bibliothek laden. (1) Datenverständnis; deskriptive Statistikergebnisse, Überprüfung von Nulleinträgen (2) Korrelationsanalyse (3) Ausreißeranalyse (4) explorative Datenanalyse; Plotten Sie paarweise Relationen von Features, Plotten Sie gemeinsame und marginale Verteilungen von zwei Features

Bitte kommentieren Sie alle Grafiken in den Spalten (2), (3), (4)

inf506-dataanalysismethods's People

Contributors

busrasari avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.