Giter VIP home page Giter VIP logo

Comments (21)

proceps avatar proceps commented on July 1, 2024

@dhobern, we already have 2 ways to see this stats in TW. In the Filter Nomenclature, you can search by updated date range and a person who made the change. You can add the family name with descendants to the search parameters and get the stats for a single family. We also have a specialized task called the Project Activity, where you can track for each user of the project how many records were created/updated, not just in taxonomy, in any TW model, how much time was spent, number of records edited per hour, etc. Please try and let us know if this satisfy your needs.

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Thanks @proceps - yes, I realise I can do this, but I really want to start automating various metrics for datasets in ChecklistBank ad especially those that are components for COL, so I'd like to have the necessary information exposed there, and adding these two fields to the export felt like it should be a small/quick matter.

from taxonworks.

mjy avatar mjy commented on July 1, 2024

@dhobern We have a "Verifier" role that would add more explicity semantics than the Housekeeping created/updated. If we created the ability to batch add this role/metadata via the TaxonName filter (and see it's individual use on the radial annotator) would that be a better way to record this data? I'm hesistant to overload the semantics of houskeeping updated/created by linking them to things like Scrutinizer.

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Hmmm - I fixed around 2000 names over the weekend (combinations included) and would like to be able to develop a traffic-light-based map for the whole insect order to understand how recently each name was touched by someone - without seriously slowing myself down.

The Created/Updated housekeeping elements are really useful for anyone using TW to clean up dirty data, and I don't have time to edit yet another radial link each time I'm working my way through tidying a name. So I'm not sure the Verified flag would fit what I need (although I guess it would be nice to have such a flag I could use at the level of subgenera and up so I could easily mark sections that I believe are currently complete).

The scrutinizer and scrutinizerDate fields in COLDP are really mainly used as a modifiedBy/modifiedAt pair, so what you already have seems a perfect fit to me.

from taxonworks.

mjy avatar mjy commented on July 1, 2024

@dhobern Curious- would an internal report meet your needs or does Checklist bank have some capabilities to do this? I.e. it seems this is going to be of use to others as well. If you have a sample plot (napkin sketch) and/or table please share. We can make a generalized report that loops valid family names pretty easily.

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Thanks @mjy - I couldn't generate my report internally in TaxonWorks - I need to integrate with the datasets that COL uses to replace some of the worst sections in GLI. Right now, this would be external to ChecklistBank, but COL wants to increase the internal metrics for all datasets there, so I would expect some components would migrate into the CLB functionality.

I suppose my general thought is that this is useful contextual information for many analyses and presentations of the exported data and it seems that it should be part of what a user gets when they download from TW.

from taxonworks.

mjy avatar mjy commented on July 1, 2024

I would suggest COL needs to implement Housekeeping concepts. Too often something like this gets done, and it gets sloppy. This seems particularly important when we are trying to give attribution to people for what they actually did (we can't promise every updater is a scrutinizer), and when we try to record proper metadata provenance trails.

I don't mind adding this, but it's going to have big stars all over it. We literally just introduced the Georeferencer role to deal with this exact problem (we were attributing Georeferencer to people who added data, not people who did the Georeferencing).

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Thanks @mjy - let me discuss with Markus - maybe we can add optional timestamp elements to all COLDP tables and use those instead.

from taxonworks.

mdoering avatar mdoering commented on July 1, 2024

Adding modified/modifiedBy to ColDP for all records makes sense to me and is already part of the database model anyhow. I am having more problems with the scrutinizer property which exists from the start of COL. There hardly seem to be sources out there that track this concept and most often you find the housekeeping modifiedBy being used for it instead.

from taxonworks.

mdoering avatar mdoering commented on July 1, 2024

ColDP doesn't mind about extra columns, so you could already include 2 new columns (I would probably call them modified & modifiedBy) if you want while I can work on supporting them in CLB

from taxonworks.

mdoering avatar mdoering commented on July 1, 2024

CatalogueOfLife/coldp#73

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Thanks @mdoering and @mjy - based on this discussion, I would be thrilled if TW could include modified (date or datetime) and modifiedBy in (each of) the CSV outputs included in the TW COLDP downloads.

from taxonworks.

mjy avatar mjy commented on July 1, 2024

@mdoering @dhobern Sounding better, now we have to do one better and leapfrog the oldness. We need to be able to include ORCiD or Wikidata or other global identifiers as pointers to the Person/People in question. I believe TDWG is moving forward with something like identifiedByID, perhaps we can have updatedByID, createdByID in addition to updatedBy and createdBy(agent name)? Note I'm not necessarily advocating for also including the created concept, but it is standard Housekeeping in hopefully all "modern" approaches.

from taxonworks.

mdoering avatar mdoering commented on July 1, 2024

It is standard, but I rarely see any use for the created fields. Do you? I', happy to also have modifiedByID, but I wonder if we can just place the ORCID in modifiedBy in case we have that. I know you don't like overloading...

from taxonworks.

mjy avatar mjy commented on July 1, 2024

I do like created from a Time perspective, lag/latency means a lot. We have found it useful in filtering results as well when we are trying to track down provenance-related issues.

Classic ontology related responses from what I've experienced include the ID and a human readable label, even if redundant. If we just pass ORCiD then you're going to have to lookup names if you add any functionality on top of the dataset- if you're not planning to do any of that then one field should be fine I think.

from taxonworks.

mdoering avatar mdoering commented on July 1, 2024

I will anyway lookup orcids and have to find also a way to manage users similar to how we already track local CLB users. I would actually love to only ever see ORCIDs or other resolvable identifiers instead of usernames like smith or green88 that are surely not globally unique. That is rather unlikely to exist in most systems though.

from taxonworks.

mjy avatar mjy commented on July 1, 2024

I would actually love to only ever see ORCIDs

Perfect. We'll send ORCiDs or names in that field then.

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Great - thank you both so much

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

@mjy - Thanks. When will we see this change in the site? I just created a small COLDP export for a genus and it seems not yet to be included.

from taxonworks.

mjy avatar mjy commented on July 1, 2024

In general when you see the most recent CHANGELOG that has a release number in front of the changes they will be live. We're hoping to have it live this week or early next.

from taxonworks.

dhobern avatar dhobern commented on July 1, 2024

Thanks @mjy

from taxonworks.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.