Giter VIP home page Giter VIP logo

wdl_mm's Introduction

Wasserstein Distributional Learning via Majorization-Minimization

Introduction

This is the GitHub repo for paper Wasserstein Distributional Learning via Majorization-Minimization.

Data

Abstract

There are two real-world applications in our paper.

The data sets for the climate modeling are the daily temperature data from Berkeley Earth and the physical driver data from Intergovernmental Panel on Climate Change (IPCC).

The data sets for the income modeling are the income distribution data from American Community Survey (ACS) and the county-level health indices of from the County Health Rankings & Roadmaps program

In our experiment, we first calculated quantile and density functions from the raw point data. We then performed analyses using these transformed distribution functions.

Availability

The data sets used in this paper are publicly availalable and the access links are provided in the Description section.

Description

The data sets used in this paper are publicly available and the links to download them are provided below.

The daily temperature data set can be downloaded at http://berkeleyearth.org/data/.

The physical driver data can be downloaded at http://www.climatechange2013.org/images/report/WG1AR5_AIISM_Datafiles.xlsx.

The income distribution data of Year 2014 from ACS can be downloaded at https://www2.census.gov/programs-surveys/acs/data/pums/.

The county-level health indices of Year 2014 from the County Health Rankings & Roadmaps program can be downloaded at https://www.countyhealthrankings.org/.

Code

Description

The WDL algorithm is implemented in Python 3. All the function definitions can be found in folder lib/WDL.py.

The first comparison method Frechet regression is implemented using the R package frechet.

The second comparison method CLR regression is implemented by adapting the R codes from the reference paper Talská, R., Menafoglio, A., Machalová, J., Hron, K., & Fišerová, E. (2018). Compositional regression with functional response. Computational Statistics & Data Analysis, 123, 66-85.

Instructions for Use

Reproducibility

  1. The subfolder named codes/simulation contains the codes to reproduce all the results in simulations, including Table 1 and Figure 2&3.
  2. The subfolder named codes/climate contains the codes to reproduce all the results in climate modeling, including Figure 1, Figure 4&5&6, and Figure D in the Appendix.
  3. The subfolder named codes/income contains the codes to reproduce all the results in income modeling, including Table 2 and Figure 7&8.

wdl_mm's People

Contributors

chengliangtang avatar

Stargazers

Liyuan Yang avatar an-seunghwan avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.