Giter VIP home page Giter VIP logo

dwca_hunter's People

Contributors

dependabot[bot] avatar dimus avatar gaurav avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

dwca_hunter's Issues

(itis) Database artifacts should be ignored

Database artifacts (any taxon name with the word 'artifact' in the unacceptability_reason field) are errors in ITIS, and I think they should be ignored when creating the DwC-A file.

suggest to index Plazi's treatment bank

hi @dimus et al.

Are you still maintaining globalnames.org ?

If so, I was hoping you can consider the following:

Plazi https://plazi.org keeps an extensive list of taxonomic literature and associated taxonomic names.

Plazi exports these taxonomic literature <> name links as DwC-A and register them with GBIF.

You can find their publications at https://www.gbif.org/occurrence/search?dataset_key=6384b520-7e9f-4874-a414-76c2e9b01d74&type_status=TYPE .

As a user (GloBI), I would like to be able use the Global Names resolvers to find taxonomic treatments in Plazi.

The taxonomic treatments can be located by linking the TaxonId fields that are available in the Plazi publications.

for example, when I lookup: Rhinolophus denti , I expect to find a Plazi name with id 885887A2FFC88A21F8B1FA48FB92DD65.taxon (also see https://www.gbif.org/occurrence/2597533915). This identifier can then be translated into a link to the related taxonomic treatment via http://treatment.plazi.org/id/885887A2FFC88A21F8B1FA48FB92DD65 .

See also attached screenshots.

Screenshot from 2020-09-07 14-18-39
Screenshot from 2020-09-07 14-18-23

fyi @myrmoteras

As a User I want to see the ASM Mammalian Diversity Database appear as a data source

As a User I want to see the ASM Mammalian Diversity Database appear as a data source in https://resolver.globalnames.org and related services .

It appears that the https://mammaldiversity.org resource is being used among mammal researchers.

I was able to download attached dump mammal.json.gz using curl 'https://mammaldiversity.org/species-account/api.php?q=*' -H 'User-Agent:' | gzip > mammal.json.gz .

Note that the User-Agent was somehow needed.

See globalbioticinteractions/globalbioticinteractions#446 .

FYI @tigerhawkvok @n8upham

mammal.json.gz

taxon ids of unknown provenance in Mammal Species of the World v3

When inspecting the json results for matches of Enhydra lutris against Mammal Species of the World (data source id 174), taxon_id 28576 and internal_id 14001090 is found (see below). However, it appears that the internal_id 14001090 is the identifier that the Mammal Species of the World exposes to link to their taxon pages (e.g., https://www.departments.bucknell.edu/biology/resources/msw3/browse.asp?id=14001090). For some reason, the taxon_id 28576 cannot be found in any of the Mammal Species of the World data products.

This suggests that the (current) internal_id are suitable to be used as taxon ids (incl. in path ids hierarchy).

   
data_source_id 174
data_source_title "The Mammal Species of The World"
gni_uuid "3096feea-1216-5f59-ab70-fcff3492cef6"
name_string "Enhydra lutris Linnaeus 1758"
canonical_form "Enhydra lutris"
classification_path "Mammalia|Carnivora|Caniformia|Mustelidae|Lutrinae|Enhydra|Enhydra lutris"
classification_path_ranks "class|order|suborder|family|subfamily|genus|species"
classification_path_ids "1|25367|27246|28538|28539|28575|28576"
taxon_id "28576"
local_id "14001090"
edit_distance 0
imported_at "2018-08-04T20:50:18Z"
match_type 2
match_value "Exact match by canonical form"
prescore "3|0|0"
score 0.988
29  

(resource_itis.rb) get_ranks should use both rank_id and kingdom_id

At the moment, get_ranks uses the rank_id to store rank names. However, ITIS uses duplicate rank_ids to identify differently named ranks between kingdoms (e.g. "phylum" has kingdom_id=1, rank_id=30, while "division" has kingdom_id=3, rank_id=30). Using only rank_ids therefore forces everybody onto the ranks defined for kingdom Chromista (kingdom_id=6).

This should be a pretty easy fix: changing every piece of code that uses @Ranks to use both rank_id and kingdom_id when looking up the appropriate term. This should be pretty easy, as @Ranks is only used on three lines.

Update Index Fungorum

Paul Kirk sent a new dump of Index Fungorum, it has to be converted to DWCA and imported.

Adding The Parasite Tracker TCN Taxonomy to Global Names

@dimus
The Arctos community would like to explore the possibility of adding ectoparasite taxonomy in use by institutions participating in the Parasite Tracker TCN to Global Names. The TPT is working to assemble taxonomy reference files for major groups of ectoparasites, with the names and classifications in csv format. Please let me know what steps would be involved with integrating these with Global Names.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.