Giter VIP home page Giter VIP logo

datamining-machinelearning's Introduction

DataMining-MachineLearning

本研究主要目的在于利用商业智能与数据挖掘的技术整合,利用UCI中信用卡违约客户数据,使用聚类模型探索客户分类,并利用分类算法建立一套相对稳定且有效的预测模型,提供相关部门与发卡机构一个准则,以降低违约比例,进而降低信用风险。

文章数据集来源于UCI网站的OnlineNewsPopularity数据集,这是关于13-14年发表在Mashable网站的新闻数据,共4万条样本,63个变量。原有数据含有的因变量为文章的转发量,我们将1400作为分界点,将转发量大于等于1400的新闻标记为热门新闻,将小于1400的文章标记为非热门新闻。我们分别使用决策树、bagging、随机森林算法,根据新闻的特征预测新闻是否为热门新闻。建模发现,bagging和随机森林比决策树的预测效果好,预测错误率分别为33.7%,33.7%,38.3%。影响因素方面,文章主题、发表时间、关键字、参考链接类型等都对新闻热度有重要影响。

datamining-machinelearning's People

Contributors

snowing-st avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

snowdj gitititit

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.