r-spatialecology / shar Goto Github PK

View Code? Open in Web Editor NEW

18.0 18.0 10.0 195.49 MB

:package: Analyse species-habitat associations in R

Home Page: https://r-spatialecology.github.io/shar

License: GNU General Public License v3.0

R 100.00%

habitat-association landscape-ecology point-pattern-analysis spatial-analysis

shar's People

Contributors

Stargazers

Watchers

Forkers

r4gis kant kundyyy davidvanegal lionel68 tretherington zekemarshall rubak spatstat-revdep chriswudel

shar's Issues

Using classes

Maybe instead of checking if objects are suitable, introduce classes?

Reproduce point pattern to a larger window-extent

Hello,

I would like to reconstruct a point pattern from a small tree plot data to a larger size (eg. from 0.3 ha to 1 ha).
For this, I tried reconstruct_pattern_marks(). However, I learned that it will produce a pattern with the same number of points.
Do you have any advice, which function I can use for this? I wonder if there is an option to vary the number of points and window-extent to reconstruct.

Thanks,
kiranaw

Add arguments to shift torus translation map only n cells

Better color palette for plot_randomized_raster()

Use viridis scale

Use envelope style of plotting

Community guidelines

Regarding @openjournals/joss-reviews#3811 JOSS requires "clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support" so I think you need to add something to the README to explain that.

I think I also forgot to include this for my JOSS paper 😳, so if it is helpful you could look at my virtualNicheR package to see how I included that information.

Update pcf

Check if https://zenodo.org/record/8026693 can be used to optimize reconstruction

Homepage

Can you please do your homepage magic @marcosci ? I started with a .yaml-file but not quite sure how do build it.

Paper typos

Regarding @openjournals/joss-reviews#3811 I think there a couple of typos that you could correct:

Line 41: environmental conditions need to be broken by randomizing the data as a null model
Line 45: The first approach to simulate null model data, is to randomize the environmental data

select_species

Not sure if the function is really needed or just an old artifact of older package structure

Test data examples potentially confusing?

Regarding @openjournals/joss-reviews#3811 I'm wondering if you could improve the test data examples slightly to avoid any confusion.

Ideally I think I should be able to run test data and get exactly the same results to confirm everything is working properly. This is a bit challenging for you as the software uses data randomisation so you are unlikely to get exactly the same results. However, I think it would be helpful if the conclusions of the results stays the same.

Everything seems to be fine with

results_habitat_association(pattern = species_a, raster = torus_trans)
## > Input: randomized raster | Quantile thresholds: negative < 0.025 - positive > 0.975
##   habitat count lo hi significance
## 1       1     9  2 14         n.s.
## 2       2    25  9 24     positive
## 3       3    27 11 26     positive
## 4       4     0 11 29     negative
## 5       5    12  5 18         n.s.

for which I get identical output in R.

But for

results_habitat_association(pattern = reconstruction, raster = landscape_classified)
## > Input: randomized point pattern | Quantile thresholds: negative < 0.025 - positive > 0.975
##   habitat count    lo    hi significance
## 1       1     8  1.00 23.35         n.s.
## 2       2    22 25.85 49.10     negative
## 3       3    33 40.80 68.05     negative
## 4       4    19 44.00 71.10     negative
## 5       5   118 20.85 60.05     positive

I get the following result

results_habitat_association(pattern = reconstruction, raster = landscape_classified)
> Input: randomized point pattern | Quantile thresholds: negative < 0.025 - positive > 0.975
  habitat count    lo    hi significance
1       1     8 27.90 48.10     negative
2       2    22 52.90 77.05     negative
3       3    33 29.00 57.30         n.s.
4       4    19 24.90 46.10     negative
5       5   118 11.85 26.00     positive

that has quite differrent lo and hi values, and different significance results for habitat classes 1 and 3.

I am assuming that all is running correctly, but I feel a bit nervous that my conclusion is different to that of the test data example.

I would suggest you:

Double-check that the output in the README is correct and up to date - I suppose there is a chance you changes the test data but forgot to update the README output.
Consider setting the random seed set.seed() as part of the example code so that the results will be the same.
Perhaps manipulate the test data such that the while the exact numbers may vary, at least the significance of the results is consistent.

Clear statement of dependancies

Regarding @openjournals/joss-reviews#3811 JOSS requires a "clearly-stated list of dependencies" at the moment you state in the README that "shar is mainly based on the spatstat (Baddeley et al. 2015) and raster (Hijmans 2017) package" but I think you should be specific as shar also requires the rcpp package.

I would suggest listing these somewhere, perhaps within the installation section.

I have spotted that you have an open issue about whether to include the rcpp requirement. This is very much a style thing, but I prefer to have absolutely minimal package dependencies to make maintenance easier. So you could help resolve this issue by removing that rcpp dependency - but a clear list of the remaining dependencies would still be helpful I think 😄

Use inherits() to test class inheritance

e.g.,

shar/R/calculate_energy.R

Line 61 in f4788a9

if (!class(pattern) %in% c("rd_pat", "rd_mar")) {

should be instead

if (inherits(pattern, c("rd_pat", "rd_mar"))) {

Reference: https://developer.r-project.org/Blog/public/2019/11/09/when-you-think-class.-think-again/

This is part of openjournals/joss-reviews#3811

Check if ppp and raster have same extent

Makes sense in results_habitat_associations()

Annealing

Keep relocated pattern even if energy did not decrease in a few cases if e.g. annealing = 0.001

Rcpp

rcpp_sample is a little bit faster than sample but maybe that is not worth the Rcpp dependency since its the only function?

Integrate reconstruct_pattern_multi

This issue tracks all steps to integrate the new reconstruct_pattern_multi function into the package. For this, I created a new branch called changes_multi.

Update NEWS and cran-comments
Update CITATION
Update READE and Hompage with new citation info
Update DESCRIPTION (Author, version, imports)
Make sure file structure is consistent (e.g., separate internal function file for each function)
Run R-CMD-Check
Update testthat suite
Maybe add vignette?

calculate_energy for reconstruct_marks

Calculate_energy and calculate_mean_energy

Better use of functions is needed, e.g. wrapper using plot_randomized_pattern and calculate_energy

Fix Deploy pkgdown

Clarify expectations around NA data in the raster landscape

Regarding openjournals/joss-reviews#3811 I think there should be a note somewhere, perhaps in the README or the paper, maybe both around the expectation of how NA data in the raster landscape would be handled.

All the examples and supporting papers cited seem to work with landscapes that are "complete" in that points can occur anywhere within the unit-square for the point-process models and there is habitat everywhere within the raster landscape. But I suspect that in reality it will be quite common for the study area to be a more complex shape than a rectangle, and therefore there will be NA cells within the raster landscape in which points cannot occur. So I am left wondering if it is theoretically reasonable to use the methods in shar with that kind of data. Essentially I am querying how NA data in a raster landscape is handled.

If shar can't handle NA data that is absolutely fine, but I do think it should be highlighted to potential users if that will preclude use of the package so that they can quickly eliminate shar as an option if it won't work for them.

Apologies if I have missed that information somewhere, but if it does exist, perhaps it could be made more prominent as I think that could have a massive bearing for potential users.

Broken link in README

"Both functions return a list with randomized rasters and the observed one. For more information on the methods, please click here."

The here link currently doesn't go anywhere.

Bring reconstruct_pattern functions under one

The four reconstruct_pattern functions could be brought together into one where the user specify the type of reconstruction wanted, the function could then look like:

reconstruct_pattern(pattern, type = c("homo", "hetero", "cluster", "mark"))

Simulate pattern each n_random

Put simulated <- ... into n_random loop

Allow to provide lists of pattern to reconstruct_marks

Why a point pattern approach to habitat analyses

Regarding @openjournals/joss-reviews#3811 requirement for a statement of need, I am not aware of any point pattern software in R that allows someone to do habitat analyses, so in that regard I am happy that this software fills a gap.

However, I think it would be helpful for me to understand if there are situations where this approach is beneficial over the multitude of other ways I could take xy point data and a raster and produce some sort of habitat/species distribution/ecological niche model.

I don't feel like I can articulate this question very well, so I do apologise for that 😬, and I haven't had time to fully read and digest the cited papers, but I think it could be very helpful to potential users to understand any specific instances where these kinds of approaches are beneficial when compared to other similar presence-background approaches available in R packages such as dismo, biomod2, zoon, adehabitat, and the like.

I guess I would say that at the moment I don't see what the benefit of treating my xy location data as a point pattern and analysing it though shar.

I am mentioning this as if you can highlight that benefit somehow, then I think you will increase the likelihood of ecologists engaging with shar.

Include links to citations in README

Regarding @openjournals/joss-reviews#3811 I would have found it helpful to have links to the citations in the README. Any chance you could please add the missing links you have in the paper to the README as well? I think people who find your package via GitHub rather than via JOSS would find that helpful.

Quantum plot

Add a quantum plot style to plot_randomized_pattern(x, method = "sf").

Torus for point pattern

Technically it should be possible to torus translate the point pattern as well

Allow r for all reconstruction functions

Better use of classes

Save used method (e.g. fitted or reconstructed)
Better printing
Save energy
Save stoping criterion

Ideas future updates

In case you have ideas for future updates, simply post them here!

simplify argument for all functions that would return only one element (e.g. reconstruct_pattern() with n_random = 1 and return_input = FALSE
add mark-correlation reconstruction
stop pattern reconstruction if energy doesn't change (or only very little) for n iterations
Make a difference between verbose (for warnings) and progress printing

list_to_randomized

Remember to add list_to_randomized() to NEWS.md

Stop reconstruction if energy levels off

ads package

Use ads package for pcf

Provide example pattern to derivate characteristics to reconstruct

README UTF code

There seems to be a problem with the references, probably an issue with UTF codes

Calculation of energy

Introduce features to weight different summary functions or mean them

Add warning to fit_point_process() if marked

Heterogenous reconstruction

Start with heterogenous pattern

Create plot methods for the shar object

Regarding issue openjournals/joss-reviews#3811, it would be easiest if plot methods were created for the objects derived from shar (i.e. rd_pat), so basically instead of having to call:

pattern_random <- fit_point_process(species_a, n_random = 19, process = "cluster")
plot_randomized_pattern(pattern_random)

One could simply do:

plot(pattern_random)

This could be achieved by turning the plot_* functions into plot.rd_pat and plot.rd_ras, check http://adv-r.had.co.nz/S3.html for instance.

Own function for core reconstruction

Use internal function for core reconstruction

Dont plot every iterations

Leads to crashes

Add precision in DESCRIPTION file

Regarding openjournals/joss-reviews#3811, shar deals with point pattern data linked to categorical (discrete) environment variables. This information on the data requirements should be in the DESCRIPTION file. Maybe:

Therefore, information about the location of the species (as a point pattern) is needed together with environmental conditions (as a categorical raster).

Support/switch to terra

Probably worth supporting terra soon or even make terra default und allow raster as legacy mode

Warning in reconstruct_pattern_homo

Return warning if mean(pcf) is above threshold because clustered

Combine reconstruct_* functions

The functions to reconstruct have a lot of repeated code, so could be combined

Streamline code starting from if(return_input){ for all randomization

Axis in pair correlation plots

This is related to openjournals/joss-reviews#3811, the pair correlation plots from plot_reconstructed_pattern sometime show axis that do not span the whole range of the data:

pp <- spatstat.core::rLGCP(win=spatstat.geom::owin(c(0, 10), c(0, 10)))
ff <- fit_point_process(pp)
plot_reconstructed_pattern(ff)

Include breaks into results_habitat_association

@ZekeMarshall had the idea to not include the breaks additionally to the classes in the results_habitat_association() table.

That's a good idea, however, the breaks are not necessarily included in the provided raster object. Maybe add an argument (default NA or NULL), which allows providing breaks in the same order as classes. In this case, the breaks from classifiy_habitats could be used. Or maybe a data.frame which would allow to merge classes to breaks.

Related, maybe write a short helper function to convert the classInterval object as data.frame.