Giter VIP home page Giter VIP logo

shihua110's Projects

book icon book

Deep Learning 101 with PaddlePaddle (『飞桨』深度学习框架入门教程)

cluecorpus2020 icon cluecorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

clustering icon clustering

Clustering / Subspace Clustering Algorithms on MATLAB

hac-text-clustering icon hac-text-clustering

It's the HAC algorithm that Im using to sort newspaper articles by news. You can adapt it to pretty much any type of text.

jiagu icon jiagu

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类

keyword_extraction icon keyword_extraction

利用Python实现中文文本关键词抽取,分别采用TF-IDF、TextRank、Word2Vec词聚类三种方法。

kmeans-emails-clustering-visualization-nlp icon kmeans-emails-clustering-visualization-nlp

KMeans-Emails-Clustering-Visualization-NLP: KMeans is used to cluster the emails. The words in the contents of emails are tokenlized and stemmed. This project transforms the corpus into vector space using tf-idf.By multidimensional scaling, the clustering result is visualized.

kmeans_elbow icon kmeans_elbow

Code for determining optimal number of clusters for K-means algorithm using the 'elbow criterion'

nlp-project-book-insights-with-plotly icon nlp-project-book-insights-with-plotly

Plotly-Dash NLP project. Document similarity measure using Latent Dirichlet Allocation, principal component analysis and finally follow with KMeans clustering. Project is completed with dynamic visual interaction.

optimization-of-kmeans-algorithm icon optimization-of-kmeans-algorithm

K-Means is a clustering algorithm which is used for cluster analysis in data mining; it partitions the data set into k clusters. In this project, K-Means algorithm is optimized using PSO (Parm Swarm Optimization)in terms of time. PSO simulates the social behavior of birds and helps to improve candidate solution iteratively. This project is made in python and has been tested on some standard data sets.

text_analysis_tools icon text_analysis_tools

中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词)

text_cluster_ensemble icon text_cluster_ensemble

文本聚类集成,使用K-Means获得聚类成员,使用组平均的层次聚类算法对共协矩阵再次划分;数据集从复旦大学中文文本分类语料库中选取

textinfoexp icon textinfoexp

自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等

tweet-clustering icon tweet-clustering

Embedding Tweets using Doc2Vec (vectorizer) and clustering tweet vectors using Kmeans

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.