Giter VIP home page Giter VIP logo

mcmurray-wabiskaw-preprocessed-datasets's Introduction

McMurray-Wabiskaw-preprocessed-datasets

McMurray/Wabiskaw preprocessed datasets

Purpose

This repository serves as a place to put datasets preprocessed for specific purposes that originally came from Athabasca Oil Sands Data McMurray/Wabiskaw Oil Sands Deposit - Electronic Data zip file. The original dataset is described in Open File Report 1994-14 and can be downloaded from this webpage.

A first use-case is a place to put McMurray datasets preprocessed to only have data useful for top prediction and facies prediction that are called in Rockhound, a Python package for quickly and easily calling easy to work with geologic datasets into either a pandas array or xarray. This reduces the need to find, clean, and arrange data when all you need is a starter dataset.

Information About The Original Datasets

The original dataset is a collection of over 2000 wells made public by the Alberta Geological Survey's Alberta Energy Regulator. To quote their webpage, "In 1986, Alberta Geological Survey began a project to map the McMurray Formation and the overlying Wabiskaw Member of the Clearwater Formation in the Athabasca Oil Sands Area. The data that accompany this report are one of the most significant products of the project and will hopefully facilitate future development of the oil sands." It includes well log curves as LAS files and tops in txt files and xls files. There is a word doc and a text file that describes the files and associated metadata.

Wynne, D.A., Attalla, M., Berezniuk, T., Brulotte, M., Cotterill, D.K., Strobl, R. and Wightman, D. (1995): Athabasca Oil Sands data McMurray/Wabiskaw oil sands deposit - electronic data; Alberta Research Council, ARC/AGS Special Report 6.

Please go to the links below for more information and the dataset:

Report for Athabasca Oil Sands Data McMurray/Wabiskaw Oil Sands Deposit http://ags.aer.ca/document/OFR/OFR_1994_14.PDF

Electronic data for Athabasca Oil Sands Data McMurray/Wabiskaw Oil Sands Deposit http://ags.aer.ca/publications/SPE_006.html Data is also in the repo folder: SPE_006_originalData of the original repo for this project here.

The Latitude and longitude of the wells is not in the original dataset. @dalide used the Alberta Geological Society's UWI conversion tool to find lat/longs for each of the well UWIs. A CSV with the coordinates of each well's location can be found here. These were then used to find each well's nearest neighbors.

Please note that there are a few misformed .LAS files in the full dataset, so the code in this repository skips those.

If for some reason the well data is not found at the links above, you should be able to find it here.

Credits to Original Dataset Authors:

Wynne, D.A., Attalla, M., Berezniuk, T., Berhane, H., Brulotte, M., Cotterill, D.K., Strobl, R.S. and Wightman, D.M. (1994): Athabasca Oil Sands database; McMurray/Wabiskaw deposit; Alberta Research Council, ARC/AGS Open File Report 1994-14, 51 p.

Also please see the SPE_006.txt file in the dataset zip file for additional information on contacting the originally authors.

License and use constraints

In the metadata file for the original datasets SPE_006.txt the dataset is described as Access Constraints: Public and Use Constraints: Credit to originator/source required. Commercial reproduction not allowed.

The preprocessed datasets built on the original datasets therefore follow a similar scheme. The license that best matches the use constraints of the original dataset is the Attribution-NonCommercial 2.0 Generic (CC BY-NC 2.0) license. A human-readable version of this license is here. The key points are free to share, must attribute, and non-commercial so please don't try to resell the data.

Code for processing original dataset into preprocessed datasets

Currently the Jupyter notebooks in the preprocessing_code directory are a good summary of the steps used to preprocess the files. Various python modules in predictatops project were used in the preprocessing. Eventually that code will be extracted and included here such that all preprocessing is in this repository.

Preprocessed datasets derived from original

Original datasets but with LAS well log files that don't load well separated and geographic coordinates CSV added saved in Zip file

  • Location in repository: processed_datasets
  • mannville_demo_data.zip
  • SHA256: 5e60e02306b15ce7bcb6ec35278566696a6b37627ce095360cdba26ef3f99fdf
  • What is included:
  • How to load:

Preprocessed dataframe for prediction of Top McMurray Pick Saved in single .h5

  • Location in repository: processed_datasets
  • name: mcmurry_top_dataframe.h5.zip
  • SHA256: b546eb7f1ceccfc3b0db0dcd8b0f204387aca717d88836700f6ed29d692d442b
  • What is included:
  • How to load:

Preprocessed dataframe from facies prediction in single .h5

  • Location in repository: processed_datasets
  • name: mcmurry_facies_dataframe.h5.zip
  • SHA256: 367d7de144d430cb014db04281092fa3179c122e3396cdd111008f6885f9fe0c
  • What is included:
  • How to load:

mcmurray-wabiskaw-preprocessed-datasets's People

Contributors

justingosses avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

abv-hub

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.