spatial-ews / spatialwarnings Goto Github PK

View Code? Open in Web Editor NEW

14.0 11.0 4.0 20.42 MB

An R package to compute spatial early-warning signals of ecosystem degradation

License: Other

R 88.81% C++ 9.09% C 0.77% Makefile 0.06% Shell 0.12% MATLAB 1.15%

r ecology space indicators catastrophic shifts critical points transitions

spatialwarnings's People

Contributors

Stargazers

Watchers

Forkers

agmayor w529 jguerber liuqx315

spatialwarnings's Issues

Fit patch size distributions

fitpsd.R et patchsizedistr_ews.R

patchsizedistr_ews.R seems to be obsolete.

Add the documentation.

May be we have also to think to the structure of this function ? Cumulative patch size distribution could be separated from power law fitting function ?

Spectral EWS: Add the workflow function

Add a workflow functions (spectral_spews + print/summary/plot methods) that informs the use about :

changes in shape of powerspectrum
summarise those changes in SDR

In more details :

[doc] Spectral EWS workflow

Make sure the spectral ews workflow is documented the following functions should have their respective roxygen comments:

Remove the discrete= argument in indiator_* functions

Make sure what the x-axis is in the r-spectrum returned by ic_powerspectrum

Add demo data

There are a couple of data files in the original repo, but no documentation on how to interpret them.

Could we add a couple of valid binary matrices as data objects that we can use for writing examples?

Article/Vignette

As put in http://r-pkgs.had.co.nz/vignettes.html :

A vignette is like a book chapter or an academic paper: it can describe the problem that your package is designed to solve, and then show the reader how to solve it. A vignette should divide functions into useful categories, and demonstrate how to coordinate multiple functions to solve problems. Vignettes are also useful if you want to explain the details of your package. For example, if you have implemented a complex statistical algorithm, you might want to describe all the details in a vignette so that users of your package can understand what’s going on under the hood, and be confident that you’ve implemented the algorithm correctly.

Following this idea, I suggest dividing our vignette into several parts :

`source()` command in R functions

Some R script contain source() command. We don't need those commands since when the package is loaded, all the scripts of the R folder are sourced.

A grep command in the repository gives that 20 source() command are present, we need to delete them.

To work locally, you can install the package by doing

install_github("fdschneider/spatial_warnings")
library(spatialwarnings)

Normally, it should work and you don't need to source manually individual scripts.

If you modify a function locally and you want to test it, you can run devtools::load_all() which load all the scripts which are in the R/ folder.

ForestGapModel

Check 2d phase diagram axes as per example.R code. Are the axes swapped?
Not all replicates for d values very close to transition in the continuous phase have non-zero densities. Patch Size distributions to be calculated only on the basis of finite density snapshots.

Create big wrapping function for all indicators

Create a function that will compute all the indicators and produce a summarizable/plottable object.

# We need a catchy short name for that
all_indicators <- function(data, # `binary_matrix` object
                           indicators = list(variance = indicator_variance, psd = indicator_psd, ...), # list of functions
                           ?)

It should return an object of class indicators with the following structure:

list(variance = list(value = xx, ...), # OR variance = xx
     skewness = list(value = xx, ...), 
     psd = `psd object`, 
     ...) # Component names are taken from the list of functions above.
(class: c('spatial_indicators', 'list'))

Numerical indicators (e.g. variance, skewness, moran's I) should have a value= in their output list so they get recognized by the summarize function.

define R package structure

We need to wrap this into a valid R package.

The globally accepted input should be a binary matrix, A.

Any factorial matrix F with the focal state "+" can be easily converted into such by A <- F == "+".
Same works for integer matrices, I. A <- I == 1

Many of the present functions can be run on that directly. Others need patch counts. Thus a core function label()should provide a fast labelling of patches and return a matrix with integer patch IDs.

Then a secondary function could take in the binary matrix as input and return a vector of patch sizes and/or a table of cumulative patch sizes.

Spectral EWS workflow

Make sure the spectral ews workflow works: the following functions should be implemented and working.

Generic SPEWS: Notes on testing for significance near the critical point

For all of them we compute a null distribution of indicator values based on shuffled matrices, but the way we test against this distribution might have to differ from indicator to indicator:

Moran's I should always be higher so we just do a one-tailed test for higher than expected values
Variance should always be higher in the coarse-grained observed data than in the coarsed-grained null dat (so one-tailed test too). However, the significance can decrease just before the tipping point (e.g. observed in Sabiha's runs for the pq model).
Skewness can increase or decrease depending on whether the shift is upwards or downwards. In all cases, the absolute value of skewness should always increase before a critical point. This absolute value skewness is expected to be lower on a null coarsed-grain matrix (??) than observed.

[doc] Misc. functions needing documentation

These functions will be available to users as they can be useful, and need documentation:

rspectrum (compute the r-spectrum)
coarse_grain
patchsizes (get patch size distribution of a binary matrix)
as.binary_matrix (convert to binary matrices)

Transform the indicator_* functions to fit in the workflow

Currently there is no standard naming for the output of the indicator_* functions: we need that so the summary function can produce the right output for each of them (e.g. a table with values for numerical indicators, see #19).

Numerical indicators (e.g. variance, skewness, etc.)

In particular, all indicators that return "a single number" and not a complex object (e.g. a curve fit) should return EITHER :

a single numeric value (numeric vector of length 1) OR
a list with (but not necessarily limited to) a component named value

Here is a list of the numerical functions that need to be modified :

indicator_moran
indicator_skewness (/!\ [Alex] I'm currently cleaning so please wait a bit)
indicator_variance (/!\ [Alex] I'm currently cleaning so please wait a bit)
indicator_largestpatch

Non numerical indicators (e.g. patch-size distr, etc)

These functions should explicitely add a specific class to their output. For example, indicator_fitpsd sets the class of its output to psdfit.

There should be a summary method available for this class that will be called within the global summary of all the indicators. (e.g. summary.psdfit).

These ones are :

I might have forgotten some indicators so please add them below :o)

Create as.* functions (for output)

After running all the indicators the user is likely to want to export the results to a more standard object like a data.frame (so it can do beautiful plots in ggplot for example).

To do this we need functions that take the output object:

list(variance = list(value = xx, ...), # OR variance = xx
     skewness = list(value = xx, ...), 
     psd = `psd object`, 
     ...) 
(class: c('spatial_indicators', 'list'))

and convert it to something like this

> as.data.frame(indic_object) 

data.frame of x rows and x columns
         replicate value
variance 1            xx
variance 2            yy
variance 3            zz
skewness 1            aa
skewness 2            bb
(etc.)

class: c('data.frame', 'list')

Non numerical indicators cannot fit into this data structure so I suggest dicarding them.

[doc] Patch-based EWS workflow

Make sure the patch-based ews workflow is documented: the following functions should have their respective roxygen comments:

patchdistr_spews (main function)
- plot methods
- predict methods

Add functions that help a user choose a coarse-graining size

This function should print the ic_value ~ coarse_graining size plot so the user can decide on an optimal coarse-graining size.

When the input is several matrices along a gradient, then we don't know on which snapshot this curve should be displayed. Moreover they are not comparable across values of the gradient as density changes.

A rule of thumb could be using (1) something not too close to the tipping point and (2) with a density that is not too high.

Split r-spectrum and theta-spectrum in different functions

Make the package CRAN-compatible

If we plan to submit our package on the CRAN, we need to pass all the required automatic tests. CRAN reviewers are very strict about this so all NOTEs/WARNINGs/ERRORs must be squashed.

The package can be checked within R using the devtools::check() function. Pass the argument cran=TRUE to run a CRAN-like check.

Note that this is not to be confused with the unit-tests run using test(). What check() does is checking whether the package will be runnable on other computers and whether it follows the CRAN guidelines. (check() run the unit tests though and stops if those fail).

Example output:

* using log directory ‘<snip>/spatial_warnings.Rcheck’
* using R version 3.2.1 (2015-06-18)
<snip>
Status: 1 ERROR, 10 WARNINGs, 6 NOTEs
See

Error during installation of spatialwarnings package

Error:
unexpected end of input in spatial_indicators_main.R

Allow people to coarse grain when computing moran's indicator ?

print.generic_spews_summary assumes that summary object has pvalues

e.g. This code produces an error

ex <- summary(generic_spews(forestdat$matrices), null_replicates = 0)
print(ex)

Generic EWS Workflow

Make sure the generic ews workflow works: the following functions should be implemented and working.

Detrending breaks the layout of significance table for generic_spews

When using detrending, means can be negative due to floating point errors, so a minus sign is added in front of the number => an extra column is added and the layourt breaks. Similarly, plots show a bogus trend.

Solution : set means to zero when detrending has been used.

Reproducible example:

a <- generic_spews(forestdat$matrices, detrend = TRUE)
a.summary <- summary(a)

a.summary # broken layout
plot(a.summary) # broken mean trend

Add function for fractal indicators

input is a binary matrix or a list of binary matrices (using S3 methods)
See discussion at fdschneider/caspr#4

Add a warning when a matrix looks binary but it is not

Indicator functions should return a warning when a matrix looks like a binary matrix while not being one as it has big consequences on the way indicators are computed.

Maybe the right test could be length(unique(states)) <= 1

Funtion lbl to label patches in a matrix.

Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

myfftshift_ews.R Matlab function

Add documentation

This function is called in the case of spectral analysis, right ?

Fractal geometric function

Find reference of the index.
Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

Create as.binary_matrix.* functions (for input)

The package needs to do its best try at converting data to binary matrices. The best way to do this I think is to create a binary_matrix class and as.binary_matrix.* S3 methods :

as.binary_matrix.data.frame
as.binary_matrix.matrix (for non-binary matrices)
as.binary_matrix.list (for lists of stuff, determines what kind of objects the list is made of and apply the appropriate as.* function to them)
as.binary_matrix.ca_run (not to be defined in this package but in caspr)
?

Notes for the spectral density ratio

We should use the following ranges:

remove the zero frequency
low_range: lower 20%
high_range: higher 20%

Summary function / general workflow

Having all the single-indicator functions, we need a user-friendly function now that computes all the indicators and reports about their values. I suggest following the usual pattern of creating a new object class "shift_indicator" and add usual methods.

A user workflow could look this way (a bit similar to a glm workflow basically) :

convert original data to binary matrix(/ces) (use as.* function family)
run the indicators function to create the result object containing all the appropriate indicator values (naming is to be discussed)
using the summary function on the indicators object, see what indicators show an interesting value
plot diagnostics

What I have in mind for a summary function is something with an output like :

 > summary(glm(Sepal.Length ~ Sepal.Width + Species, data = iris))

Call:
glm(formula = Sepal.Length ~ Sepal.Width + Species, data = iris)

Deviance Residuals: 
     Min        1Q    Median        3Q       Max  
-1.30711  -0.25713  -0.05325   0.19542   1.41253  

Coefficients:
                  Estimate Std. Error t value Pr(>|t|)    
(Intercept)         2.2514     0.3698   6.089 9.57e-09 ***
Sepal.Width         0.8036     0.1063   7.557 4.19e-12 ***
Speciesversicolor   1.4587     0.1121  13.012  < 2e-16 ***
Speciesvirginica    1.9468     0.1000  19.465  < 2e-16 ***

---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Thoughts ? Ideas for the plot function ?

Add spatial indicator function : indicator_largest_patch

Use one of the templates provided in [1] to add a function that returns the largest patch size (with user documentation).

[1] https://github.com/fdschneider/spatial_warnings/blob/master/R/indicator_example.R

Add patch size distribution fitting

input should be a vector of patch sizes.
returns a list of fitting results for all candidate distributions and a general summary of the best fitting function. see issue fdschneider/caspr#2

Broken indicators

Some indicators will not run on the example datasets :

indicator_powerspectrum (cannot find function rspec_ews)
indicator_corrfunc (missing value where TRUE/FALSE needed)

2D bifurcation diagrams

Here are the bifurcation diagrams with two cross-sections (Figure in the same style as Flo's Fig. 2).

In top row plots, areas shaded in gray indicate that density of mussels/veg/trees is zero. Contour lines indicate the density.

I also put this figure in the dropbox with the corresponding code (in ./figures/bifurcation_diagrams)

Correlation function

Find reference of the index.
Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

rspec_ews.R function to calculate r-spectrum and theta-spectrum.

Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

Add patch label function

Factorize some code in ic_{variance|skewness|moran}

Most of the code is shared between those three functions, most of the code can be factorized. (not a high priority though)

Computation of skewness on zero-sd matrices

Right now skewness returns NA if the vector of values has a std. deviation of zero. It should return zero instead.

Coarse graining crashes R with subsize = 0

Add a check in coarse_grain() to catch this ? (or maybe do it at higher level to avoid exceptions in highly-repeated code).

[doc] Generic EWS workflow

Make sure the generic ews workflow is documented the following functions should have their respective roxygen comments:

generic_spews (main function)
- plot methods
indictest.generic_spews_* (assess significance)
- plot methods

Morans' I (spatial autocorrelation)

Find publication of the index. I have already the paper of Moran (1950) but it's quite hard to read. There are others publications on this index.
Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

High-resolution results

The model results are starting to arrive !

Grazing model (~220 MB, ~3GB in memory)
Musselbed model
Forestgap model (~88MB)

As usual these files contain two variables: upper_branch and lower_branch. Both are lists of three components :

DatBif
snaps
time_series (new!) containing the full time series of global and local cover

access them with upper_branch[[1]] (or 2 or 3)

List of all the indicators that we want in the package

It would be great to have the list of all the indicator functions that we need for the package in this respository.

Spectral main function

Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

Reduced matrix function

It seems a function to do coarsed grain

Change the structure of the fonction in the standard way as it was proposed by Alex.
Add the documentation

Patch-based EWS workflow