Giter VIP home page Giter VIP logo

mlis-test's Introduction

Review Assignment Due Date Open in Visual Studio Code

Internship Technical Challenge

Details

Please modify and commit this file to fill in your details below:

Full Name: Tamal Chowdhury Student ID: 20561092 Username: ppxtc2

Task

Your task is to finish implementing the functions provided in interview/custom_slm.py, using the docstrings and unit tests as a specification for what the function does. This class implements a custom language model with a restricted vocabulary, provided in the data/1000.txt file. All tokens generated using this vocabulary list should be able to be generated by this custom class.

Please complete an implementation for all function bodies that are currently filled with "pass".

Do not modify function names or variable names of the class as these will be used for testing. Some functions have already been implemented for you to save time. These implementations do not need to be modified to pass. Inside the data folder, we have provided a file that can be used to construct your model object for testing.

Make sure you can pass the provided unit tests and verify your implementation is correct. Please note that passing the unit tests is not sufficient to guarantee a correct implementation.

Once you are satisfied with your submission, make sure to commit your final version to the main branch in this repository.

Getting Started

To start, you must locally clone this repository to your local machine using Git. Use Git best practices while implementing your solution. Make sure to test your implementation using the data provided and to verify yourself that your code is correct and robust.

Rules

  • DO NOT use any AI assistance tools in your submission, such as ChatGPT or GitHub Copilot. If you have these tools built into your IDE, please disable them for the submission.
  • You are only allowed to use the libraries provided in the requirements.txt file and packages provided in the python standard library (e.g. os and typing etc).
  • Your code must run 'as submitted', with no additional packages or software required.
  • You can use the internet to access package documentation, StackOverflow answers etc, but try to write the code yourself and avoid copying and pasting large chunks of code.

Guidance

  • Follow Git best practices throughout your development.
  • Try not to spend too much time on the task.
  • Interact with your implementation for testing using a notebook like interface. The provided dev.py can be used with VS Code to allow a Jupyter-like workflow, without needing to use a notebook file. DO NOT commit any Jupyter notebook files with this repository.
  • Passing all the unit tests does not guarantee a correct and robust implementation.
  • If your machine is not powerful enough to run the GPT2 medium model, you can try to develop using the small model by changing "openai-community/gpt2-medium" to "openai-community/gpt2" in the __init__ function. The unit tests are unlikely to pass with this model. Make sure to change the model back upon submission. Another option is to use a service like Google Colab which should be able to use the models.

mlis-test's People

Contributors

tomchow01 avatar github-classroom[bot] avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.