Giter VIP home page Giter VIP logo

science-of-genius-title-impact's Introduction

Science of Genius - Analysis of Titles and Abstracts

Communication is an essential component of the scientific endeavor, yet the relationship between the textual properties of scientific papers and their reception by the scientific community is relatively unknown. As a component of the Science of Success project, this project will explore whether specific lexical factors are indicative of the attention an article received, as measured by normalized citation indices, by combining analytical tools from natural language processing, data science, and the science of science. The initial stages of the study will focus on the temporal and disciplinary variance in the length and syntactic features of article titles in the Web of Science, a massive dataset containing over 50 million articles published since 1900. The project also aims to develop a quantitative model, which can estimate the impact a scientific article could create in the community.

This quantitative model of article presentation will be useful for maximizing the impact of future articles and thereby accelerate scientific growth.

Environment


  • Installing dependecies for the project

    • sudo apt-get install python3-tables
    • sudo pip3 install seaborn pandas networkx

Building Data For Analysis


  • These notebooks shows data preparation steps for the analysis.

Exploratory Data Analysis


  • These notebooks explore the temporal distribution of structures in titles of Applied Physics articles conditioned on Journals in which they appear.

Models - Title


  • These models show how linear/weighted linear regression models have been used to predict log c5 (citation counts five years from the year of publication).

Novelty


  • These notebooks show how new (interesting) words come up in literature (titles) and how they decay through time.

Semantics EDA


  • These notebooks show how variations in parts of speech of titles have changed over the years for different disciplines.

Propagation of Words


  • These notebooks show different methods of selecting concepts from titles of publications and how growth and decay of concepts happen for different disciplines.

Word Usage Fluctuations


  • These notebooks show how much fluctuations occur in word usage. We also try to characterize these fluctuations to known distributions.

science-of-genius-title-impact's People

Contributors

srjit avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.