globalnamesarchitecture / gnlist-resolver-gui Goto Github PK

View Code? Open in Web Editor NEW

0.0 0.0 1.0 1.6 MB

This tool allows cross-mapping scientific name checklists to a variety of biodiversity databases

License: MIT License

Ruby 17.21% Elm 39.95% Shell 0.65% JavaScript 3.08% HTML 37.80% CSS 0.75% Dockerfile 0.57%

gnlist-resolver-gui's People

Contributors

Watchers

Forkers

gitter-badger

gnlist-resolver-gui's Issues

As a User I want to have a scalable service

Currently we cannot have more than one pod, because non-commercial version of ingress can send request to any pod in the system. We need a organize a mount that can be shared by all pods. This will also allow us to do rolling restart. I also will double check that minikube works correctly.

As a User I want to know which date of Catalogue of Life or other repositories I am using

Use this data in the name of the file:

name of the original file
name of the data source
date for the data source release
date when the match was done

As another ticket -- add a setting that will allow to configure file name.

As a User I want to easily know if there is only one found match, or serveral

Add a field that will tell how many rows correspond to this match

As a user I want a more consistent ETA indicator

As a System Administrator I would like to be able using list-resolver by installing static html page

We need to figure out feasibility of this approach. If people install the project as a web page they would need to download Elm's JS file from a remote link and then some combination of CORS/Proxy would do the trick of connecting to the server.

~~This ticket is about an investigation phase. If we find the approach feasible we will set our system to support "application as a static web page" approach.~~

As a User I want to have classification field in the output

As a User, eventually, I want a clean GUI using Material Design approach

Placeholder for MDL framework, with initial implentation

As a User eventually I want to understand why my requests fail during file upload

Error handling (top level) placeholder

As a Developer I do not want to have old tickets conflicting with new ones

We moved the prototype code from gn_crossmap_web with whole history. As a result when we create a ticket in old repository the numbers are duplicated with old code. We need to flatten old history.

As a user I want to have an online documentation that explains what should I do at each step

As a User I want to see outlinks to used header terms

Injestion progress bar does not show how much time was spent after completion

Changes from

As a Developer I want to refactor Crossmap-based names to ListResolver-based names

This is a split from #7 issue, as this work need to be done before switching to gnindex-api resolver. We will have the branch for #7 separate from master for quite a while, and it is good to keep it as similar as possible with master, to make upstream rebases simpler

Problems with inputRank N/A

Infraspecies without authors (Natrix natrix cypriaca) gives n/a as inputRank.

Orobanche densiflora Salzmann ex Reuter in DC. gives inputRank n/a. Why?

output of a small file seems not CSV but a binary file?

see input and result files attached

Speed of processing big files needs to be improved

Capitalized infraspecies are not handled

Infraspecies is written with capital produces wrong canonical match. E.g. Iris humilis Georgi
subsp. Arenaria (Waldst. et Kit.) A.et D.Löve
Astragalus macrocarpus DC. subsp. Lefkarensis

As a developer I want to have a test framework for Elm

As a User I want to have an ability to use not only English Language

Placeholder for i18n

As a User I want ingestion part of the workflow to go faster when I use GraphQL based resolver

Ingestion step depends on biodiversity gem. It does make sense to migrate to gnresolver project as it is significantly faster, and has more features. For this we need to move gn_list_resolver gem to JRuby

Some UTF-8 characters are not depicted correctly

Output in UTF-8. Why do some accepted species names have special characters depicted
correctly, while others don’t. E.g. & versus & etc.?

As a User I do not want to go to resolution step if my data are not suited for it.

If I have data that will not does not have enough information for resolution process, I want to see an error message that explains what needs to be done for this step to succeed. I should not be able to click on "Continue" button in this case.

Investigate flexibility of Material Design for different looks for different oranizations

As a User I want faster, better quality results of list resolution

Migrate to GraphQL API of gnindex

As a Catalogue of Life User I want to have the output in CoL's Darwin Core archive format

As a Developer I prefer to fork gn_crossmap_web code than start from empty repo

Clone gn_crossmap_web and start refactoring

As a User I want an option of splitting matched name into name and authorship fields

As a user I want to have a choice between 'simple' and 'advanced' results

Simple results have 3 categories:

match
fuzzy
no match

Advanced have
exact + canonical = match
fuzzy = fuzzy
partial + partial fuzzy + genus + unmatch + error = no match

As a User I want to see highlights where fuzzy match names differ

Docker image is bloated because of all the junk development files, and it does not create main.js

This ticket grew from a bug fix to an improvement, so I'll give estimate it

Single column CSV does not show correctly

Fuzzy matching does not work for Adenophora lilifolia

would expect that Adenophora lilifolia gets some score because Adenophora liliifolia with only one additional character is an accepted name? http://webservice.catalogueoflife.org/col/webservice?name=Adenophora+liliifolia

As a User I want a slick and clear Material Design interface

We do have Material design framework, but it is not polished yet. So we need to enhance it.

Show how Cataloge of Life Logo will look like on the page

As a User I want to avoid mistakes when I enter terms for the header

We have to normalize headers to the terms that the program understands. To do that we need to create workflow that prevents entering wrong combinations of data

As a user I want to be able to save resolution page and see how did it progress on another computer

Currently Resolution "page" has to be opened from the start to the end of the process. Users should be able to leave the page, return to it again while the resolution continues to happen in the background

As a User I want ability to move back and forth the process in case if I need to change some of my settings for resolution

As a Catalogue of Life user I want a return of different ID

The tax_id field that the resolver imports from CoL and outputs in the results is useless, it is just a database index id, changes with every edition and is not used in other CoL services. Unfortunately CoL is in a transition phase from offering LSIDs earlier to 'natural keys' after AC2017 edition. These are however already in DwC, in the references field. I would suggest to extract these from the references field. Example: from http://www.catalogueoflife.org/annual-checklist/details/species/id/1d761fa6e15f9ba277ad7784af78c8b4/synonym/5fc5c8ab89caeede5ede03be346369a7 the identifier can be extracted for both the synonym and its current accepted name. It may also be helpful to give this whole url in the output.

File upload page significantly slowed down for big files after refactoring

Upload of the file happens in about the same time, but the 'wait' period after upload increased from 7 to 40 seconds for 1 million names file according to my measurements on production servers