Giter VIP home page Giter VIP logo

geodacenter / opioid-policy-scan Goto Github PK

View Code? Open in Web Editor NEW
11.0 6.0 14.0 1.45 GB

The Opioid Environment Policy Scan provides access to data at multiple spatial scales to help characterize the multi-dimensional risk environment impacting opioid use in justice populations across the United States.

R 16.13% HTML 43.41% Jupyter Notebook 40.14% Python 0.32%
spatial-data public-health spatial-analysis accessibility opioids health covid-19

opioid-policy-scan's Introduction

Opioid Environment Policy Scan (OEPS) Data Warehouse

Public Site - OEPS Explorer

Visualize, download, and explore OEPS data on the OEPS Explorer.

About

The Opioid Environment Policy Scan (OEPS) is a free, open-source data warehouse to help characterize the multi-dimensional risk environment impacting opioid use and health outcomes across the United States.

The OEPS provides access to data at multiple spatial scales, from U.S. states down to Census tracts. It is designed to support research seeking to study environments impacting and impacted by opioid use and opioid use disorder (OUD), inform public policy, and reduce harm in communities nationwide.

We developed the OEPS as a free, open-source platform to aggregate and share publicly-available data at the Census tract, zip code, county, and state levels. Geographic boundary shapefiles are provided for ease of merging datasets (csv files) for exploration, spatial analysis, or visualization. Download the entire data repository, or you can filter and download by theme or spatial scale with the OEPS Explorer. All datasets are accompanied by documentation, detailing their source data, year, and more. Learn more about our methods and approaches, including the risk environment framework, on our Methodology page.

The OEPS is led by the Healthy Regions and Policies Lab at the Center for Spatial Data Science, University of Chicago. It was developed for the NIH's JCOIN Methdology and Advanced Analaytic Resource Center (MAARC). See Team and Acknolwedgements below for more.

Citation

Susan Paykin, Dylan Halpern, Qinyun Lin, Moksha Menghaney, Angela Li, Rachel Vigil, Margot Bolanos Gamez, Alexa Jin, Ally Muszynski, and Marynia Kolak. (2021). GeoDaCenter/opioid-policy-scan: Opioid Environment Policy Scan Data Warehouse (v1.0). Zenodo. https://doi.org/10.5281/zenodo.5842465

Wiki

We welcome open source contributions and feedback, including suggesting or contributing relevant data, application development, or sharing applied research. To learn more, visit the OEPS Wiki.

Data Overview

Variable constructs are grouped thematically below to highlight the multi-dimensional risk environment of opioid use in justice populations. In the Metadata column, linked pages provide more detail about the data source, descriptions of data cleaning or processing, and individual variables included.

Geographic Boundaries

Variable Construct Variable Proxy Source Metadata Spatial Scale
Geographic Boundaries State, County, Census Tract, Zip Code Tract Area (ZCTA) US Census TIGER/Line, 2018 Geographic Boundaries State, County, Tract, Zip
Crosswalk files County, Census Tract, Zip Code Tract Area (ZCTA) HUD Office of Policy Development and Research Crosswalk Files County, Tract, Zip

Policy Variables

Variable Construct Variable Proxy Source Metadata Spatial Scale
Prison Incarceration Rates Prison population rate and prison admission rate by gender and ethnicity Vera Institute of Justice, 2016 PS01 / Prison Variables County
Jail Incarceration Rates Jail population rate by gender and ethnicity Vera Institute of Justice, 2017 PS02 / Jail Variables County
Prescription Drug Monitoring Programs (PDMP) Any PDMP; Operational PDMP; Must-access PDMP; Electronic PDMP OPTIC, 2017 PS03 / PDMP State
Good Samaritan Laws Any Good Samaritan Law; Good Samaritan Law v1.0 arrest OPTIC, 2017 PS04 / GSL State
Naloxone Access Laws Any Naloxone law; law allowing distribution through a standing or protocal order; law allowing pharmacists prescriptive authority OPTIC, 2017 PS05 / NAL State
Medicaid Expenditure Total Medicaid spending KFF, 2019 PS06 / MedExp State
Medicaid Expansion Spending for adults who have enrolled through Medicaid expansion KFF, 2018 PS07 / MedExpan State
Syringe Services Laws Laws clarifying legal status for syringe exchange, distribution, and possession programs LawAtlas, 2019 PS08 / Syringe State
Medical Marijuana Laws Law authorizing adults to use medical marijuana PDAPS, 2017 PS09 / MedMarijLaw State
State & Local Government Expenditures Government spending on public health, welfare, public safety, and corrections US Census, 2018 PS11 / Government Expenditures State, Local

Health Variables

Variable Construct Variable Proxy Source Metadata Spatial Scale
Drug-Related Death Rates Death rate from drug-related causes CDC WONDER, 2019 10-year Health01 / Drug-Related Death Rate State, County
Hepatitis C Rates HepC prevalence, mortality HepVu, 2017 Health02 / Hepatitis C State, County
Physicians Number of Primary Care and Specialist Physicians Dartmouth Atlas, 2010 Health03 / Physicians Tract, County, State
Opioid Prescription Rates Opioid prescription rate HepVu, CDC 2018-2019 Health04 / Opioid Indicators State, County
Opioid Mortality Rates Rates of death from narcotic drug overdoses HepVu, NVSS, 2014-2019 Health04 / Opioid Indicators State, County
Access to MOUDs Travel time (drive, walk, bike) and distance to nearest MOUD SAMHSA 2019, Vivitrol 2020 Access01 / Access: MOUDs Tract, Zip, County, State
Access to FQHCs Travel time (drive) and distance to nearest Federally Qualified Health Center (FQHC) US COVID Atlas, HRSA, 2020 Access02 / Access: FQHCs Tract, Zip, County, State
Access to Hospitals Travel time (drive) and distance to nearest hospital CovidCareMap, 2020 Access03 / Access: Hospitals Tract, Zip, County, State
Access to Pharmacies Travel time (drive) and distance to nearest pharmacy InfoGroup, 2018 Access04 / Access: Pharmacies Tract, Zip, County, State
Access to Mental Health Providers Travel time (drive) and distance to nearest mental health provider SAMSHA, 2020 Access05 / Access: Mental Health Providers Tract, Zip, County, State
Access to Substance Use Treatment (SUT) Services Travel time (drive) and distance to nearest substance use treatment (SUT) service SAMHSA, 2021 Access06 / Access: Substance Use Treatment Tract, Zip, County, State
Access to Opioid Treatment Programs (OTP) Travel time (drive) and distance to nearest Opioid Treatment Program (OTP) SAMHSA, 2021 Access 07 / Access: Opioid Treatment Programs Tract, Zip

Demographic Variables

Variable Construct Variable Proxy Source Metadata Spatial Scale
Race & Ethnicity Percentages of population defined by categories of race and ethnicity ACS, 2018 5-year DS01/ Race & Ethnicity Variables State, County, Tract, Zip
Age Age group estimates and percentages of population ACS, 2018 5-year DS01 / Age Variables State, County, Tract, Zip
Population with a Disability Percentage of population with a disability ACS, 2018 5-year DS01 / Other Demographic Variables State, County, Tract, Zip
Educational Attainment Population without a high school degree ACS, 2018 5-year DS01 / Other Demographic Variables State, County, Tract, Zip
Social Determinants of Health (SDOH) SDOH Neighborhood Typologies Kolak et al, 2020 DS02 / SDOH Typology Tract
Social Vulnerability Index (SVI) SVI Rankings CDC, 2018 DS03 / SVI County, Tract, Zip
Veteran Population Population as defined by veteran status ACS, 2018 5-year DS04 / Veteran Population Variables State, County, Tract, Zip
Household Type Household Types and Group Quarters Populations ACS, 2018 5-year DS05 / Household Type Variables State, County, Tract, Zip
Homeless Population Homelessness Census Variables HUD, 2018 DS06 / Homeless Population Variables State, County, Tract, Zip

Economic Variables

Variable Construct Variable Proxy Source Metadata Spatial Scale
Employment Trends Percentages of population employed across industries ACS, 2018 5-year EC01/ Jobs by Industry State, County, Tract, Zip
Essential Worker Jobs See COVID-19 Variables EC02 / Jobs by Occupation
Unemployment Rate Unemployment rate ACS, 2018 5-year EC03/ Economic Variables State, County, Tract, Zip
Poverty Rate Percent classified as below poverty level, based on income ACS, 2018 5-year EC03/ Economic Variables State, County, Tract, Zip
Per Capita Income Per capita income in the past 12 months ACS, 2018 5-year EC03/ Economic Variables State, County, Tract, Zip
Foreclosure Rate Mortgage foreclosure and severe delinquency rate HUD, 2009 EC04 / Foreclosure Rate State, County, Tract
Internet Access Percentage of Households without Internet access ACS, 2019 5-year EC05/ Economic Variables State, County, Tract, Zip

Physical Environment Variables

Variable Construct Variable Proxy Source Metadata Spatial Scale
Housing Occupancy Rate Percent occupied units ACS, 2018 5-year BE01 / Housing State, County, Tract, Zip
Housing Vacancy Rate Percent vacant units ACS, 2018 5-year BE01 / Housing State, County, Tract, Zip
Long Term Occupancy Percentage of population living in current housing for 20+ years ACS, 2018 5-year BE01 / Housing State, County, Tract, Zip
Mobile Homes Percent of housing units classified as mobile homes ACS, 2018 5-year BE01 / Housing State, County, Tract, Zip
Rental Rates Percent of housing units occupied by renters ACS, 2018 5-year BE01 / Housing State, County, Tract, Zip
Housing Unit Density Housing units per square mile ACS, 2018 5-year BE01 / Housing State, County, Tract, Zip
Urban/Suburban/Rural Classification County classification USDA-ERS BE02 / Rural-Urban Classifications County
Urban/Suburban/Rural Classification Zip code and Census tract classification USDA-ERS BE02 / Rural-Urban Classifications Tract, Zip
Alcohol Outlet Density Alcohol outlets per square mile, alcohol outlets per capita InfoGroup, 2018 BE03 / Alcohol Outlets State, County, Tract, Zip
Hypersegregated Cities US metropolitan areas where black residents experience hypersegregation Massey et al, 2015 BE04 / Community Overlays County
Southern Black Belt US counties where 30% of the population identified as Black or African American US Census, 2000 BE04 / Community Overlays County
Native American Reservations Percent area of total land in Native American Reservations US Census TIGER/Line, 2018 BE04 / Community Overlays County
Residential Segregation Indices Three index measures of segregation: dissimilarity, interaction, isolation ACS, 2018 5-year BE05 / Residential Segregation County, State, Zip
NDVI Normalized Difference Vegetation Index (NDVI) average value Sentinel-2 MSI, 2018 BE06 / NDVI State, County, Tract, Zip
Parks Coverage Percent of land area covered by parks and green space OSM, 2022 BE07 / Parks State, County

COVID-19 Variables

Variable Construct Variable Proxy Source Metadata Spatial Scale
Essential Worker Jobs Percentage of population employed in Essential Jobs as defined during the COVID-19 pandemic ACS, 2018 5-year EC02 / Jobs by Occupation State, County, Tract, Zip
Cumulative Case Count Daily cumulative raw case count (01/21/20 - 03/03/2021) The New York Times, 2021 COVID01 / COVID Variables State, County
Adjusted Case Count per 100K Daily cumulative adjusted case count per 100K population (01/21/20 - 03/03/2021) The New York Times, 2021 COVID02 / COVID Variables State, County
7-day Average Case Count 7-day average case count (01/21/20 - 03/03/2021) The New York Times, 2021 COVID03 / COVID Variables State, County
Historical 7-day Average Adjusted Case Count per 100K 7-day average adjusted case count per 100K population (01/21/20 - 03/03/2021) The New York Times, 2021 COVID04 / COVID Variables State, County

Full Documentation

Please refer to the complete Data Documentation for more information about individual datasets, variables, and data methods. Contact Susan Paykin with any questions.

Team

The OEPS is led by the Healthy Regions & Policies Lab team including Susan Paykin, Qinyun Lin, Dylan Halpern, and Marynia Kolak, along with Moksha Menghaney and Angela Li.

Acknowledgements

The OEPS was developed for the Methodology and Advanced Analytics Resource Center (MAARC), part of the NIH-HEAL Initiative Justice Community Opioid Innovation Network (JCOIN). The Healthy Regions & Policies Lab leads spatial analytics for the MAARC, which provides data infrastructure and statistical and analytic expertise to support individual JCOIN studies and cross-site data synchronization.

This research was supported by the National Institute on Drug Abuse, National Institutes of Health, through the NIH HEAL Initiative under award number UG3DA123456. The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the NIH, the Initiative, or the participating sites.

opioid-policy-scan's People

Contributors

a-moosh avatar angela-li avatar balcaglayan avatar bucketteofivy avatar cavillanueva1 avatar juhe0120 avatar makosak avatar menghamo avatar mradamcox avatar nofurtherinformation avatar oliang2000 avatar qinyun-lin avatar rnvigil avatar spaykin avatar wam0 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

opioid-policy-scan's Issues

Reconcile out discrepancy between `master` and `main` branch

At some point in this repository's history, a main branch was created to become the new default. However, the old default, master still exists and when I preview a pull request from the old master into the new main there is a long list of commits that would be applied. I doubt we should actually be applying all of these commits, but I would like to look through them to figure out what they contain, and, if relevant, use git cherry-pick to pull them into main.

Ultimately, the master branch should be deleted.

Inspect and merge historic data

The historic datasets in OEPS-historical-data branch ultimately need to be merged into the main branch. Some updates to the datasets may need to be made, so that may as well take place before the merge is performed.

Standardize shapefiles

A little overhaul of the spatial data here would be good. Some tasks include:

  • Inspect non-join columns, like state name or county name, and perhaps add more if needed. These will be columns that can be added to joined and exported data, so formatting and making them look good is important.
  • Standardize GEOIDs across spatial resolutions and time (e.g. the join field for ZTCAs should be the same name in 2010 shp as 2018).
  • Update columns in all CSVs as needed to streamline the joins.

Update homelessness variables

Revise/integrate mds a bit more to link the two estimates and related proxies (household type & homeless pop); remove ACS from point in estimate variable since that one only uses HUD.

Find/create smaller ZCTA 2010 shapefile

The file we've been able to find so far from the Census Bureau is really big, abou 800mb. We don't need geometries with that much detail, so it would be better to have a generalized file to use going forward. I have been asking folks at the CB, maybe there is an official generalized file available, as seems to exist for more recent years.

If not, I think it would be acceptable to run a generalization operation on the file (in QGIS or elsewhere), as long as the contiguous boundaries are retained.

Fill out remaining 1980, 1990, and 2000 DS01 variables

The variables present in the DS01 historic data do not match the variables listed in the DS01 data tables documentation. This seems to be because Social Explorer's "Historic Census Data on 2010 Census Tracts" datasets do not include the counts needed for the DS01 data table documentation, which was likely caused by the historic censuses not aggregating their data directly into those relevant categories. But, as our historic DS01 data files are based on Social Explorer's data, we are also missing those categories.

However, there does seem to be a workaround for the some of the data. The historical censuses seem to have released dis-aggregated tract level race, ethnicity, age, and education attainment data from which most of the missing data can be reconstructed. I'm currently planning to download this data from IPUMS NHGIS and then crosswalk the data to 2010 census tracts using weights from the Longitudinal Tract Database, but have a few open questions about data comparability I wanted to track here that will need answered prior to merging these changes. Namely:

  1. The 1980 Census seems to have asked if respondents were of Spanish origin as opposed to Hispanic origin, which they started doing in 1990. Is it sufficient to simply note this discrepancy in documentation, or is there research indicating that the difference in wording heavily changed how respondents interpreted the question? (I don't currently believe this is the case, but it's probably wise to double check anyhow).
  2. The 1980 Census also reports "Years of School Completed" with categories such as "High School: 1-3 years" and "High School: 4 years," whereas future censuses report "Educational Attainment" with categories such as "9th to 12th grade, no diploma" and "High School graduate (includes equivalency)." At minimum, this means that any estimate of percent population with less than a high school diploma (for the noHSP variable) will exclude GEDs for the 1980 population but not for 1990 on. Are these sufficiently different that the 1980 Census education variable should be renamed or treated differently, or is it sufficient to just note this discrepancy in the documentation?
  3. Due to "major differences between the disability questions," the US Census Bureau does not advise comparisons disability data comparisons between the Censuses taken prior to 1990 and the 2000 Census. As an example of these discrepancies, disability data collected in the 1980 and 1990 Censuses only consider the civilian non institutionalized population of 16 years of age and older, whereas the 2000 Census considers the civilian non institutionalized population of 5 years of age and older. It is probably desirable to make the differences between the 1980/1990 and 2000 disability data as apparent as possible for end users. Towards that end, do we want to separate out the 1980 and 1990 disability data into a unique variable to reflect this difference in collection methodology?

Access to Internet

##B28002_001: Estimate!!Total:
##B28002_002: Estimate!!Total:!!Estimate!!Total:!!With an Internet subscription
##B28002_012: Estimate!!Total:!!Internet access without a subscription
##B28002_013: Estimate!!Total:!!No Internet access

We can calculate % of households without access to the Internet using B28002_013/B28002_001.

Add county & state-level access metrics for all resources

Update access metrics with county & state level resources.

These will likely have a different methodology (ex. % of tracts within 30-min distance, or average distance across tracts), but it's fine as long as documented and integrated within the same file.

Homelessness proxy %

Using a doubled-up housing %; may require a literature review. Preferred at census tract scale and higher.

Fix variable name disagreements

There are a few disagreements between the variable names in data dictionaries and in the CSV files themselves. Creating this ticket to track their resolution ahead of the v2 release.

Ran a validation script against the table definitions (which are generated directly from the data dictionaries) and the CSVs themselves. Here is the output, i.e. the variable names that this ticket should address. Note that some of these have more to do with geometry fields, and should be fixed via #68.

VALIDATE INPUT SOURCE: csv/C_1980.csv
WARNINGS ENCOUNTERED: 0

VALIDATE INPUT SOURCE: csv/C_1990.csv
WARNINGS ENCOUNTERED: 2
  1 source columns missing from schema: Age15_24P
  1 schema fields missing from source: A15_24P

VALIDATE INPUT SOURCE: csv/C_2000.csv
WARNINGS ENCOUNTERED: 2
  1 source columns missing from schema: Age15_24P
  1 schema fields missing from source: A15_24P

VALIDATE INPUT SOURCE: csv/C_2010.csv
WARNINGS ENCOUNTERED: 2
  1 source columns missing from schema: VacP
  1 schema fields missing from source: VacantP

VALIDATE INPUT SOURCE: csv/C_Latest.csv
WARNINGS ENCOUNTERED: 1
  1 source columns missing from schema: Unnamed: 0

VALIDATE INPUT SOURCE: csv/S_1980.csv
WARNINGS ENCOUNTERED: 2
  2 source columns missing from schema: STATEFP, Age15_24P
  4 schema fields missing from source: G_STATEFP, GEOID, STUSPS, A15_24P

VALIDATE INPUT SOURCE: csv/S_1990.csv
WARNINGS ENCOUNTERED: 2
  2 source columns missing from schema: STATEFP, Age15_24P
  4 schema fields missing from source: G_STATEFP, GEOID, STUSPS, A15_24P

VALIDATE INPUT SOURCE: csv/S_2000.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: STATEFP, Age15_24P, OccP
  4 schema fields missing from source: G_STATEFP, GEOID, STUSPS, A15_24P

VALIDATE INPUT SOURCE: csv/S_2010.csv
WARNINGS ENCOUNTERED: 1
  4 schema fields missing from source: G_STATEFP, STUSPS, ChildrenP, Age18_64

VALIDATE INPUT SOURCE: csv/S_Latest.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: TotPopE, NoHSP, PrMisuse20
  3 schema fields missing from source: TotPop, NoHsP, PrMsuse20P

VALIDATE INPUT SOURCE: csv/T_1980.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: NoHSP, ChildrenP, OccP
  4 schema fields missing from source: TRACTCE, COUNTYFP, STATEFP, NoHsP

VALIDATE INPUT SOURCE: csv/T_1990.csv
WARNINGS ENCOUNTERED: 2
  4 source columns missing from schema: Age15_24P, NoHsp, ChildrenP, OccP
  5 schema fields missing from source: TRACTCE, COUNTYFP, STATEFP, A15_24P, NoHsP

VALIDATE INPUT SOURCE: csv/T_2000.csv
WARNINGS ENCOUNTERED: 2
  4 source columns missing from schema: Age15_24P, NoHsp, ChildrenP, OccP
  6 schema fields missing from source: TRACTCE, COUNTYFP, STATEFP, A15_24P, NoHsP, PciE

VALIDATE INPUT SOURCE: csv/T_2010.csv
WARNINGS ENCOUNTERED: 2
  2 source columns missing from schema: GiniCoeff, VacP
  7 schema fields missing from source: TRACTCE, COUNTYFP, STATEFP, AgeOv18, NonRelFhhP, NonRelNfhhP, VacantP

VALIDATE INPUT SOURCE: csv/T_Latest.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: TotPopE, NoHSP, FqhcMinDis
  3 schema fields missing from source: TotPop, NoHsP, MinDisFqhc

VALIDATE INPUT SOURCE: csv/Z_1980.csv
WARNINGS ENCOUNTERED: 2
  4 source columns missing from schema: ZCTA, Age55_59, Ov65P, PacIsP
  4 schema fields missing from source: GEOID, PacISP, HispP, Ovr65P

VALIDATE INPUT SOURCE: csv/Z_1990.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: ZCTA, Ov65P, PacIsP
  4 schema fields missing from source: GEOID, PacISP, HispP, Ovr65P

VALIDATE INPUT SOURCE: csv/Z_2000.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: ZCTA, Ov65P, PacIsP
  4 schema fields missing from source: GEOID, PacISP, HispP, Ovr65P

VALIDATE INPUT SOURCE: csv/Z_2010.csv
WARNINGS ENCOUNTERED: 2
  3 source columns missing from schema: PacIsP, MedInc, VacP
  2 schema fields missing from source: PacISP, VacantP

VALIDATE INPUT SOURCE: csv/Z_Latest.csv
WARNINGS ENCOUNTERED: 0

New metadata file is needed.

The following variables in data of the Census Tract scale do not appear in any metadata files. We will have to create new metadata files for them.

SocEcAdvIn
LimMobInd
UrbCoreInd
MicaInd

Update metadata files for 2.0 release

We discussed how to approach the metadata update that is needed now that a lot of files structures and dictionaries have changed. The approach we agreed upon has two parts which are described below. Note: the new data dictionaries are in XSLX format, and can be found here: https://github.com/GeoDaCenter/opioid-policy-scan/tree/main/data_final/dictionaries. These will be helpful references for step 1 and in step 2 they will need to be updated. (These steps are more suggestion for how to go about the process, not required workflow).

This only concerns all non-geography markdown metadata files.

1 . Update all existing metadata markdown files.

The files here: https://github.com/GeoDaCenter/opioid-policy-scan/tree/main/data_final/metadata from v1 are still very much relevant after the reorganization, but they need to be updated in the following ways:

  • Update variable names where necessary
  • Update Themes where necessary
    • Match to values in the data dictionaries
    • Edit: On further inspection, these themes are handled elsewhere, outside of the markdown files. So this step will be handled in a different ticket.
  • Update author/modified like so:
    Author: <original author>
    Last Modified: <new date>
    Last Modified By: <updater>
    

2. Add Metadata Location column to the new Data Dictionaries

A new Metadata Location column will be added to each of the new XLSX data dictionaries, with a URL pointing to the GitHub-hosted location of the corresponding Markdown file for each variable row. These should be the "raw" urls referencing the main branch (we'll update to the 2.0 tag later, just before creating that release). For example:

https://raw.githubusercontent.com/GeoDaCenter/opioid-policy-scan/main/data_final/metadata/Access_FQHCs_MinDistance.md

This will also offer a good QA/QC opportunity for whether we are missing markdown files: each row must have a value for Metadata Location.

Remove master branch

The master branch of this repo was deprecated in favor of main at somepoint, but there are a lot of hard-coded urls that still point to resources on GitHub in the master branch. These should all be removed in favor of the v1.0 tag.

Remove/archive v1 datasets ahead of v2 release, promote new CSVs

To prepare for the upcoming release of the v2 datasets, which have been consolidated by spatial resolution and now include historical tables, we should at least move all of the old CSVs into a new directory, like data_final/v1.1 (must double-check the actual release number here), and move the contents of data_final/consolidated into a more primary location. Maybe something like data_final/v2.0/tables and data_final/v2.0/dictionaries.

There is other content in the data_final directory that we'll have to figure out what to do with, but this ticket only concerns the old and new CSVs.

Proposal: Split explorer branch into new repository

I've been reading through this repo and its accompanying wiki and other documentation in order to understand it better (which is thorough and really helpful), and I'd like to propose that the explorer branch, which holds the static website and is deployed through Netlify, is split into a separate repository. Here are a few reasons I think this would be a good move:

  • As it is, the explorer branch is completely independent of the main branch, and CSV files are manually copied from main to explorer for publication, or fetch calls in explorer access content in main through direct calls to https://github.raw...., so there is no functional logic to link the two (I may be missing something though).
  • Archives of the main branch are published to Zenodo (one per release) and associated with a DOI.
    • This means that release tags on this repo get tied to versions in Zenodo of the same DOI, so tagging releases of the explorer branch isn't really possible, at least without adding a good bit of confusion to the release lineage.
    • This also means that significant functional changes to the main branch, like moving the explorer codebase into it as a subdirectory, seem inappropriate.
  • Similarly, we will need to make changes to the explorer website in the future, at the very least to set netlify configs like pinning a node version (for example, the deployment/build process seems to break with node >= 17 (or so), due to an openssl issue, thanks @bucketteOfIvy for finding this), so developing on the main branch of a new repo will be generally less cumbersome and more sustainable.

We should be able to clone the branch independently into a new repo while retaining all relevant commit history--the process would be something like this not just a copy-paste into a new blank repo. This means that all past contribution history would remain intact (this is a requirement as far as I'm concerned).

I'm thinking the new repo would be something like healthyregions/oeps-explorer.

Anyway, I'm writing this ticket out mostly to get the idea in front of @spaykin and @nofurtherinformation, as you have been the main contributors to this repo, and I don't think a change like this can happen without your input. Like I mentioned, work will definitely need to happen on the explorer at some point this summer, so this is essentially a preparatory step for that.


to complete:

  • resolve explorer-debug branch (merge into explorer, or just document and delete?)
  • create new repo as described above
  • update this repo's readme as needed
  • update this repo's wiki as needed
  • look around for references to the explorer in other literature and update urls
  • setup netlify build from new repo (this may include a domain change as well)
  • delete explorer branch from this repo

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.