datetime is not imported before use

OOUTREACHY PROJECT

See OutreachyProposal for background.

NOTE: This is still a work in progress. If you see a bug or something wrong please do let me know. Thanks

This repository is a collection of python modules written as work for Outreachy internship with Wikimedia Foundation with guidance of Mike Peel, the mentor.

Development (is) being done with Python 3.8.0 and master branch of Pywikibot package. Some modules may requires additional libraries, where such is the case, is noted in the brief module note below.

common.py
- This is a meta module that contains the base logic and generic functions that all the other modules can use to avoid code duplication. It facilatates converting value to approrriate data type for wikibase needs as well as pushing the collected data to the data repository (Wikidata)
official_website.py
- This module extracts official website links from Wikipedia article and add them to corresponding data item of the page on the repo. This module uses BeautifulSoup library (4.9.3) apart from the standard requirments. It does not validate that the url is actually working, but it does ensure that it is valid URL in structure.
twitter_username.py
- This module primarily extracts Twitter usernames of subjects from Wikipedia page, or set of pages, and then use the username to extract its corresponding numeric id from Twitter. The username is then exported to Wikidata as Twitter username claim, and the numeric identifier as Numeric id qualifier. This module requires Twitter developer API key to work fully correctly.
mb_release_group_data.py
- This module currently extracts and processes MusicBrainz release group identifier. It can also work on sing page or set of pages. The value extract is fully validated by default it will loop and process pages of the relevant Wikipedia category
lepindex-id.py
- This module extracts LepIndex (an dentifier for a Lepidoptera taxon in the UK Natural History Museum's 'Global Lepidoptera Names Index') from Wikipedia articles and stores them in the data repository. It can work with arbitrary page or set of pages (categorized) such as the set automatically generated by this wikipedia category.
book_data.py
- This modules can be used to extract and export multiple value statements from wikipedia articles about books to Wikidata. Presently it can process a single page or list of pages and primarily extract either one, two or all of these: OCLC number, ISBN number (both 10 and 13) as well as Number of pages. There's a basic validation for each value extracted to reduce chance of invalid values.
power_stations.py
- This modules extracts data from articles about Power stations on Wikipedia.
find_a_grave-id.py
- This modules works with Find a Grave dentifier. The relevant value is also extracted from Wikipedia and basic validation is applied. It is then exported to the corresponfing item of the wiki page as a Find A Grave memorial ID claim statement. The script, by default, loops through this relevant category on English Wikipedia
theatre-venue-data.py
- This modules extracts data from Wikipedia articles about stadia, arenas, other sporting venues, as well as theatres and cinemas.
world_football_dot_net.py
nft_data.py
alumni_data.py
game_data.py
Next
Next
Next

LICENSE

The code in this responsitory is made available under the MIT LICENSE.

Navigate interactively at sourcegraph

ammarpad / outreachyproject Goto Github PK

outreachyproject's Introduction

OOUTREACHY PROJECT

See OutreachyProposal for background.

LICENSE

outreachyproject's People

Contributors

Stargazers

Watchers

Forkers

outreachyproject's Issues

datetime is not imported before use

requirements.txt missing

License missing

Summary is empty.

how to set up pywikibot to actually upload the changes?

add a good summary when uploading

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent