Giter VIP home page Giter VIP logo

fakh-nlp-coding-challenge's Introduction

Coding Challenge

This coding challenge is to be done for the position as NLP Engineer for the joint project of the Robert Koch Institute and the World Health Organization Hub on public health intelligence. The deadline is 2024-01-27, 23:59 CET.

Task description

We made two queries to our ElasticSearch database. The queries returned information on potentially relevant articles about public health events from two groups with different surveillance objectives. The files are named data_query_from_9367.json and data_query_from_9578.json. Your task is to build a classifier that can identify whether an article belongs to query 9367 or 9578 using only text data (title, description, or fullText). Do not assume that data is clean.

Additionally, you should demonstrate your skills by analyzing the data in one more way. This additional analysis should be related to natural language processing, however, there is no other restriction.

Evaluation criteria

  • Aim to spend no more than 2-3 hours on the aforementioned tasks.
  • Create a private repository for your work on GitHub and invite aauss to view the repository:
    • Make your work reproducible. We will not contact you if something is unclear.
    • Document your work well but also be concise.
    • Your work should run on a basic laptop.
    • No paid services should be required to run your code.
    • Make a release on GitHub before the deadline. Then send me an invite. You should not make any commits after the deadline.
  • Follow best practices of writing code.
  • Follow best practices in machine learning and natural language processing.
  • Present your work well. This can be done in a notebook or similar.
  • Discuss results, bugs, and confusion. Be critical.
  • Accurate performance on the dataset has the least priority.
  • You will not be able to do all the tasks equally well in this time. Plan your time well.
  • You can use any package or programming language to complete your tasks. There is a preference for Python but you can use any language you feel most comfortable with for this task.

During the interview

We will use the aforementioned evaluation criteria to mark your work. Furthermore, we will ask you specific questions about your code and approach. Lastly, we will try to challenge certain aspects of your work. This may include mistakes or missed opportunities for improvement.

fakh-nlp-coding-challenge's People

Contributors

faisalsouz avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.