Giter VIP home page Giter VIP logo

domain-name-classification-with-gcn-kaggle's Introduction

Classification is a very popular task with multidisciplinary applications among which are bioinformatics, computer vision and natural language processing.

In this challenge, you are given a subgraph of the Greek web graph where nodes correspond to domain names (~65K domains). A directed edge between two nodes indicates that there exists a hyperlink from at least one page of the source domain to at least one page of the target domain. Furthermore, your are provided with the textual content of webpages crawled from a subset of these domain names (~40K domains).

A subset of these domains were manually classified into 10 categories and split into a training and a test set.

Your task is to predict the categories to which the domain names of the test set belong using graph-theoretical, textual, and other information. You are being provided with starting code that displays this task as a classification problem.

For each domain name of the test set, your model should predict the category to which this domain name belongs.

The evaluation metric for this competition is the logarithmic loss. This metric is defined as the negative log-likelihood of the true class labels given a probabilistic classifier's predictions.

domain-name-classification-with-gcn-kaggle's People

Contributors

sotirislegkas avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.