Giter VIP home page Giter VIP logo

projeto_final's Introduction

Breno Arosa's Undergraduate Project

This thesis compares the use of traditional ML algorithms to Convolutional Neural Networks on evaluating sentiment analysis of tweets. Combining to this, the use of semi-supervised techniques enables the development of models without requiring huge labelled datasets. Therefore, the present work is a study case on the use of CNN for tweets' sentiment analysis in english to enable the use on languages without labelled data.

This project is heavily inspired by the work of Yoon Kim Convolutional Neural Networks for Sentence Classification and Alec Go et al. Twitter sentiment classification using distant supervision.

Abstract

Social networks have changed the way people interact and they become more and more present in their lives. The production of digital content, in turn, accompanies this growth. This large amount of data produced hinders the process of extracting information, since such data are mostly unstructured.

Tools like natural language processing are able to aid automating this procedure. Among these techniques, machine learning algorithms have been shown to be efficient text classifiers in tasks such as sentiment analysis. In parallel, it has been observed in recent years the emergence of Deep Learning techniques that have broken performance barriers in the most diverse areas of artificial intelligence. However, the efficiency of these models depends on large training datasets, which have a costly production process since the labelling of this data is done manually.

This work presents the elaboration of a method for generating Deep Learning classifiers for sentiment analysis of social networks messages without the necessity of manually annotated datasets. In this respect, a dataset will be formed with noisy annotation and will be used to train convolutional neural networks. The results obtained by the Deep Learning classifiers will be evaluated in comparison to machine learning algorithms traditionally applied in natural language processing.

Compiled Version

projeto_final.pdf

Compiling from source

The Scons make system is being used.
Compiling from source can be achieved by executing:

cd path_to_projetc/
scons .

projeto_final's People

Contributors

brenoarosa avatar

Watchers

James Cloos avatar  avatar

Forkers

iabreuda

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.