Giter VIP home page Giter VIP logo

chinese_text_classification's Introduction

chinese-text-classification

这份工程代码是CSDN博客:《Python中文文本分类》的源代码.

其中,master分支是python3.6

python2.7的代码请到python2.7分支

这里仅仅为里演示代码原理, 因此只方进去原始数据集的一小部分,完成的数据集请到下面的链接里下载:

训练集

http://download.csdn.net/download/github_36326955/9747927

测试集

http://download.cs

如果你有任何的问题,请在本项目github主页中的issues栏中提出,或者方位上面的博客地址,在下方评论处发布问题。

step1: corpus_segment.py

step2: corpus2Bunch.py 请自觉创建目录train_word_bag和test_word_bag

step3: TFIDF_space.py

step4:NBayes_Predict.py

如果你觉得很棒棒,也许可以打个赏? 手机扫一扫:

chinese_text_classification's People

Contributors

sheldonresearch avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

chinese_text_classification's Issues

vectorizer.fit_transform 错误

作者你好,我在执行你的源代码TFIDF_space.py的时候, tfidfspace.tdm = vectorizer.fit_transform(bunch.contents)这一行报错TypeError: cannot use a string pattern on a bytes-like object
使用你博客上面的代码,报错TypeError: 'builtin_function_or_method' object is not iterable,请问是什么原因?刚开始学,不太懂,请您指教!我的微信zhan10,期待您的解答!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.