Giter VIP home page Giter VIP logo

dwcvocabs's People

Contributors

marcelooyaneder avatar tucotuco avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dwcvocabs's Issues

Merge TILMAP Vocabs

In Vocabularies folder in Dropbox.

Also, please change that you have incorporated the UMZM vocabs still in that folder. I believe you did, but if not, merge those, too.

Then delete 'em.

Merge 20 sets of vocabularies

All in Dropbox>Vocabularies.

These include past vocabs that did not merge completely as well as several new sets from recent datasets.

Remove standard field from Geography

We are not using this. If one wanted to know if it is standard or not, one could check that the verbatim fields are the same as the standardized fields.

Clean up continent for countries having more than one

Revise Oaxaca for Districts

Oaxaca has 570 municipalities divided in districts. Revise Oaxaca geography to include districts at the county level and municipalities at the municipality level.

spanish csv translation

Hi, im a native spanish speaker, and i can add a csv file of the dwc terms based on your DarwinCloud.csv file or edit it as you wish.
i was searching and a lot of terms were missed, i' ll gladly help. greetings from chile

Merge Vocabs - 3 sets

In Dropbox>Vocabularies:

ALMNH PaleoBotany
APSU MemphisHerps
SIO MarineVerts

The SIO data is a priority for me so that I can rerun this large data set against the newest vocabs.

Complete CountryStateCounties table

This table is used to automatically fill in county-level information from the combination of country, stateProvince, and verbatimCounty for combinations that have been seen and checked before. As of today 1266 combinations outside of North America need to be checked.

Verbatim islands and islandGroups to island and islandGroup

For records that have verbatimIsland or verbatimIslandGroup populated, transfer these values to island and islandGroup as appropriate, removing the word "Island" from the island field except where "Island" is an essential part of the name (e.g., "Long Island").

Redo Russian Geography

Geography for Russia is inconsistent in terms of the forms of names chosen. At times they are English interpretations, at times, transliterations of the Russian.

sex.csv missing "Female" with first letter upper-case

Female is not present in the mapping so I believe it is getting passed through as-is ("Female" gets published rather than "female").

Sample:

http://portal.vertnet.org/o/ku/kui?id=21259

Download:

Your VertNet download file is now available for a limited time at
https://storage.googleapis.com/vn-downloads2/vertnet_female_specimens-4ca385f9dd4c425d92ed7cf9151bd394.tsv.

Query: sex:female
Matching records: 61000
Request submitted: 2016-01-14T18:49:00.376040
Request fulfilled: 2016-01-14T19:04:16.854560

Determine and document rules for waterBody

To help clarify instances where multiple terms are possible, among other issues.

Example 1: North Atlantic Ocean vs Northern Atlantic Ocean vs Atlantic Ocean.

Example 2: The order in which tributaries are listed (alpha order or order of progression).

Happy to help.

Redo Kenyan Geography

Geography uses former provinces rather than current counties as first-level subdivisions.

Waterbody content when waterbody is unnamed

Question:

How do we complete the dwc:waterbody field if the waterbody is unnamed and the verbatim is "unnamed creek" or "unnamed tributary"? Do we treat this the same is if the verbatim contained "unknown"?

Currently, I am leaving "unnamed creek" and treating it in the same way I would treat a proper place name, in part because it implies the recorder might be away that the waterbody is, in fact, unnamed and NOT unknown.

Implement no-multiplicity rule in geography

There are still geography vocabs that give a list of named places separated by commas, such as "Wyoming, Montana". These should be omitted following the rule that the values must be standard values for a single administrative region.

Merge UMZM Verts

Zip in Vocabularies<Dropbox.

Contains Mamm, Aves, at least one parasite, and maybe a fish or two.

Add geography vocabulary from MaNIS, HerpNET, ORNIS

The Gazetteer of aggregated georeferences has a large set of distinct higher geography. In anticipation of a locality service that includes these data and can resolve the higher geography to standardized values, add any new geography combinations to the LookupGeography table and resolve Canada, USA, and Mexico.

Geography in OpenRefine

Run LookupGeography through OpenRefine to see what can be cleanup up quickly. Worked well with LookupClassification.

Review "incorrectable"

Formalize the criteria along with the patterns for associated error values. Incorrectable should mean that there is an irresolvable inconsistency.
Is it worth adding a flag for "unable to locate"? Right now that information is in the error field only.
Should there be a flag for "there is more than one". Right now that information is in the error field only.
Should there be a flag for "no current equivalent administrative region"? Right now that might only be in the notHigherGeography field along with things that simply are not administrative regions.
Add flag for "verified"? This could act as a double-check, meaning that everything is complete and accurate according to "opinionBy" as of the opinionDate.

Add separate Classification table for Plantae, reserve existing for Animalia

Homonyms have become a prevalent issue for SIB Colombia using the vocabs, so the proposed solution is to separate the Classifications for plants and animals to avoid the majority of these issues.
Solution is admittedly a band-aid, and use of a taxonomic lookup service is the better eventual solution to add outside the migrator.

Redo Ireland Geography

Remove level with Munster, Leinster, etc. and use modern counties as first-level subdivisions, matching GADM.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.