Giter VIP home page Giter VIP logo

coderecommendations's People

Contributors

endruk avatar

Stargazers

 avatar

Watchers

 avatar  avatar

coderecommendations's Issues

Split large x into chunks

To also train on large x inputs, I can chunk down the input sequences and train them on multiple iterations of the encoder.

[index] * 80000 -> x * [[index] * z]
where x stands for the number of encoder iterations and z for the length of each chunk.

Ignore batches that are too long

Due to the different time_lengths of all batches it could be that some batches may be too long to fit on a single GPU memory.
Implement a mechanic to prevent using those batches in training.

Multiprocessing Vocab Creation Bug

The vocab creation terminates wit hthis error:

2019-02-06 13:17:46,482 [INFO] sizes of the dataset:
2019-02-06 13:17:46,483 [INFO] training: 197416
2019-02-06 13:17:46,483 [INFO] validation: 29612
2019-02-06 13:17:46,483 [INFO] testing: 19742
2019-02-06 13:17:46,498 [DEBUG] start vocab creation
Traceback (most recent call last):
  File "./run_training.py", line 58, in <module>
    dataset.build_vocab(top_k=top_k, num_processes=num_processes)
  File "/mnt/raid/data/karge/Github/CodeRecommendations/Seq2Seq_Pytorch_test/Data/dataset.py", line 121, in build_vocab
    result = process.get()
  File "/usr/lib/python3.5/multiprocessing/pool.py", line 608, in get
    raise self._value
  File "/usr/lib/python3.5/multiprocessing/pool.py", line 385, in _handle_tasks
    put(task)
  File "/usr/lib/python3.5/multiprocessing/connection.py", line 206, in send
    self._send_bytes(ForkingPickler.dumps(obj))
  File "/usr/lib/python3.5/multiprocessing/connection.py", line 393, in _send_bytes
    header = struct.pack("!i", n)
struct.error: 'i' format requires -2147483648 <= number <= 2147483647

This indicates that the pandas dataframe pass does not work.
Size of the CSV-file: 15GB

This is a SO-thread for this issue:
https://stackoverflow.com/questions/47776486/python-struct-error-i-format-requires-2147483648-number-2147483647

So: I have to re-work multiprocessing vocab creation
For now I stick with single process vocab creation

Build generation workflow

As the title says: build a generation workflow to have a full result.
(to test models and for final usage)

Feature Design

Improve training by using features

  • Add features to dataset
  • Add features to models
  • Test models

Add validation test generation

At the end of a validation iteration there can be a test generation, using one element of the validation set and use it to generate an output - print the result.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.