Giter VIP home page Giter VIP logo

sgnlp's People

Contributors

atenzer avatar benedictleedm avatar jonheng avatar k-weiming avatar raymondng76 avatar vincenttzc avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

sgnlp's Issues

download_pretrained for demo API does not cache downloaded files/models

To allow the containers to start up quicker, models and files were downloaded and cached during build time.

Recent changes in the huggingface transformers package has broken this functionality:

  • Released in v4.22.0
  • Issue

Possible choices moving forward:

  • Write a simple caching utility function
  • Stick to versions of transformers before 4.22.0

Change demo api to use gevent worker

  • Using multiple workers of the default type 'sync' in gunicorn is not working on Kubernetes
  • Workers constantly terminated due to signal 9
  • Try gevent to see if it works out

Common api setup

  • Create a function or class that handles common setup between the demo apis

Add LIF model and utilize allennlp framework

  • Original LIF code uses allennlp framework: https://github.com/nusnlp/LIF
  • Earlier attempts of reimplementing the model in pytorch led to worse results
  • For now, we will retain the original code and framework used to train it
  • Add a class with a from_pretrained method such that usage experience will be similar to other models

Add overview of docs

The documentation's initial page does not elaborate much on what exists in the documentation.

Further elaboration on what is available can help users get started.

  • Add elaboration on side tabs (quickstart, advanced tutorial, api reference)

Fix multi-word aspect issue with Sentic-GCN preprocessor

The current implementation of preprocessor matches a single aspect index for the purpose of matching postprocessor output. The aspect index field for process_input payload should be expended to handle aspects with multiple indexes.

Wrong assert comparison for SenticGCN dataclass

Latest SenticGCN implementation for the Dev branch.
In the dataclass.py, post_init method in SenticGCNTrainArgs, there are the following assertions,

assert self.repeats > 1, "Repeats value must be at least 1."
assert self.patience > 1, "Patience value must be at least 1." 

The comparison operator should be >= instead.

LSR improvements

  • Add tests
  • Fix bug that happens on certain inputs
  • Ensure training is stable and replicable
  • Refine documentation

UFD additional tutorial

  • Add additional tutorial on how to use UFD to train and evaluate on custom dataset
  • Bug fix for UFD parse_args_and_load_config util function

Add parent class for preprocessor

  • Create a module named sgnlp.base
  • Add abstractmethods for preprocess, save, load
  • Add batch iteration to parent __call__
  • Parent __call__ should return a dictionary

Rumour Detection Improvements

  • Change demo api output from a fraction to percentage
  • Change module file names to lowercase
  • Remove dependency on the datasets package

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.