Giter VIP home page Giter VIP logo

bcdm's Introduction

Barcode Core Data Model

Welcome to the repository for the Barcode Core Data Model, a metadata standard designed to revolutionize the way DNA barcode data is structured, shared, and utilized across the scientific community. This standard represents a collaborative effort to establish a comprehensive framework that not only enhances data exchange but also fosters innovation in the development of tools and applications centered around DNA-based biodiversity data.

Purpose

The DNA Barcode Metadata Standard ensures that DNA barcode datasets are structured in a unified, consistent, and clear manner, facilitating interoperability and effective data sharing between research groups and databases. This standardization is intended to streamline the cataloging and analysis of biodiversity data, enabling researchers to collaborate more efficiently.

Core Features

  • Comprehensive coverage of essential data elements related to DNA-based species identification, including specimen collection, DNA sequences, and taxonomic classification.
  • Establishment of a common vocabulary for data interpretation
  • Alignment with existing biodiversity data models and databases, including the Barcode of Life Data System (BOLD) and Darwin Core Data, streamlining data exchange.
  • Serves as a foundation for the development of analytical tools, databases, and resources that utilize DNA barcode data, opening new avenues for research, conservation, and education.

Files in this repository

This document provides the definitions and format for the current supported universal fields included in the Barcode Core Data Model (BCDM). Please notice that the BCDM field identifier_email is not part of the exported fields.

Column name Definition
field Standardized/universal name for the data field.
data_type Data type of the field. For example: integer, float,string, and string:date
data_format Additional formatting constraint for the data value. For example: a timestamp field may follow this format %d-%b-%y; default indicates no extra formatting constraint specified.
definition Meaning of the term, including their controlled vocabulary when applicable.

This document provides mapping of the BCDM data fields to the BOLD Relational Database.

Column name Definition
bcdm_field Standardized/universal name for the data field.
bold_db The relational database to which the data of the universal field are residing, in the format: server:database_name. Example: db1:newdb12
bold_db_table The specific database table to which the universal field is mapped to.
bold_db_field The specific database field to which the universal field is mapped to.
rules Set of regex used for basic data validation.

To provide more convenient and streamlined access to the data, a single relational database view (aka. single pane view) containing all the universal data fields that have been defined.

Column name Definition
bcdm_field Standardized/universal name for the data field.
bold_field Contains information on how to map the universal field to the database through the single pane view. Most cases the format will be the fieldname as in the single pane view. However there might be additional joins to other tables required which will be denoted with double underscore __.

This document provides the mapping of the BCDM to Darwin Core standard. Please notice that not all BCMD fields have an equivalent term in the Darwin Core standard and therefore they will not be part of the export. DarwinCore fields can be found in the following link: Darwin Reference Guide

Column name Definition
bcdm_field Standardized/universal name for the data field.
dwc_field Corresponding term in the Darwin Core glossary.
bcdm_type Data type of the field as in the BCDM. For example: integer, float,string, and string:date
bcdm_format Additional formatting constraint for the data value as in the BCDM. For example: a timestamp field may follow this format %d-%b-%y; default indicates no extra formatting constraint specified.
dwc_type Data type of the field as in the Darwin Core standard.
dwc_format Additional formatting constraint for the data value as in the Darwin Core standard.

bcdm's People

Contributors

dnadiversity avatar rvosa avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.