Giter VIP home page Giter VIP logo

stock_price_finbert's Introduction

STOCK_PRICE_FINBERT

The reviews of guba were taken as the initial corpus, and the Finbert model was used to analyze the polarity of Chinese comments and predict the stock rise.

Project Natures

The project is mainly composed of crawlers, Finbert model and hypothesis testing.

The crawler technology uses Xpath and Json technology to crawl 20W reviews on the stock bar forum of Oriental Fortune.com.

The Finbert model supports layer-by-layer thawing and accumulation of gradients. At the same time, random segmentation of the data set is used in each epoch, which not only adds randomness to model training, reduces the risk of overfitting, and can alleviate the problem of small sample size to a certain extent. Finally, the bert model is encapsulated to facilitate future model expansion, and Finbert is given the Sklearn interface.

Hypothesis testing is used to assess the degree of correlation between polarity and stock gains in a certain period of time.

Use of code

Click the link below to download the pre-trained bert model and place it in /models/language_model/. https://drive.google.com/open?id=1AQitrjbvCWc51SYiLN-cJq4e0WiNN4KY

Issue to be solved

The scattered topics of social comments make it difficult to clean a large amount of dirty data, which seriously affects the performance of the model.

stock_price_finbert's People

Contributors

muuyesen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.