Giter VIP home page Giter VIP logo

argus's People

Contributors

didriklundberg avatar qwoper avatar ran4 avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

argus's Issues

Handle environments for dates of death and birth

I have written methods of handling some common environments for date of birth (and age). The below should all parse perfectly well:
{{birth date|1993|2|4|df=yes}}
{{dob|1993|2|4|df=yes}}
{{birth date and age|1993|2|4|df=yes}}
{{bda|1993|2|4|df=yes}}

Yet there remains more obscure birth date environments, and death date environments as can be seen here: http://en.wikipedia.org/wiki/Template:Birth,_death_and_age_templates

These need to be taken care of accordingly.

Fetch more statistics from the database.

We need to be able to fetch more interesting statistics from the database.

Examples of cool stuff: Things that can be fitted to graphs and tested against hypotheses.

Revert to using mixed upper- and lowercase characters in attribute values

We've done initial testing on parsing attribute values and achieved a rugged functionality. Sooner or later, we are going to have to revert to using mixed upper- and lower-case characters in order to be able to display the information neatly. This will have to be fixed before the attribute values are sent to be parsed.

We are still going to want attribute keys to be lower-case only, though.

Categories aren't handled

Consider if we should read category information from Wikipedia markup language in xmlwikiparser2.py.

Clean up git repository

This stuff should be presentable. Currently, the project looks like a mess. We could start by putting things in separate subdirectories - readme files and similar in the root directory, code in one folder, input in one folder, output in one (or the two latter in the same), and so on.

Remove pictures from attribute values

We don't want pictures in our attribute values. We can identify pictures by having certain file extensions, but we must remove them in a smart way so that we might eventually still salvage the rest of the attribute value, if there is more than just an image.

Examples of what we want to remove:
[[File:Andrei Tarkovsky.jpg|240px]]
Jacques-Louis David - The Emperor Napoleon in His Study at the Tuileries - Google Art Project.jpg

Handle several lists in one attribute value

One attribute value might hold several lists. Empirically, this has been shown to occur with a significant frequency. We are going to need to handle this either by splitting up these values before we send them to be parsed, or at parse time.

Find out obscure shorthand and alternate names for list environments

The Wikipedia list environment "plainlist" can apparently also be invoked as "plain list", and so on. We need to find thorough documentation on this matter - apparently the template articles on Wikipedia are insufficient.

Alternatively, we can find these by trial and error.

Handle more environments!

The "convert" environment

Example: {{convert|550|ft|m|0}}

The convert environment displays and converts values in different metric and imperial units. We need to parse all cases which might appear in biographical infoboxes.

"oldstyledate" environment

Displays date both in old and new style.

"army" environment

Displays flag and name of army.

"post-nominals" environment

Displays orders from British Empire.

"linktext" environment

Displays list of links.

"air force" environment

Displays flag and name of air force.

"flag" environment

Displays flag and country name.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.