Giter VIP home page Giter VIP logo

harxiv's Introduction

HArXiv

A highlight tool for reading ArXiv papers.

How to Install

  1. Clone this repo.

git clone https://github.com/Erutan-pku/harxiv.git

  1. Install Scrapy

See https://doc.scrapy.org/en/latest/intro/install.html

How to Run

  1. Download the information of arxiv papers and save to arxiv_20210529.json.

time scrapy crawl arxiv -o arxiv_20210529.json

  1. Use arxiv_20210529.json to generate a web page with highlight.

python3 postdeal.py arxiv_20210529.json arxiv_20210529.html

Configuration

See config.json. You can change the subject (spider:domain),ignore some subject or some conference name in description (ignore_subj and ignore_desp), highlight some phrase in title, or highlight some author.

{

  "spider": {
  
    "domain": "cs.CL", 
    
    "top_n" : 9999 
    
  },
   
  "ignore_subj": ["Sound (cs.SD)", "Audio and Speech Processing (eess.AS)"],
  
  "ignore_desp": ["INTERSPEECH"],
  
  "title_highlight": [
  
    ["lower_weight", ["Summarization", "Machine Translation"]],
    
    ["blue", ["Named Entity Recognition"]],
    
    ["red", ["Question Answering", "Question Generation", "Reading Comprehension"]]
    
  ],
  
  "author_highlight": [
  
    ["red", ["Zhiyuan Liu", "Maosong Sun"]]
    
  ]
  
}

harxiv's People

Contributors

erutan-pku avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.