The medrxivr's discuss from ropensci

medrxivr's Issues

Fix info on default date for mx_api_content() to reflect actual default date

According to the docs the default from_date is "2019-06-01"

Lines 6 to 7 in bdec02c

 #' @param from_date Earliest date of interest. Defaults to 1st June 2019 

 #' (earliest medRxiv record was posted on 25th June 2019).

However, the actual default "from_date" is "2013-01-01".

Add option to automatically search for both captialised and uncapitalised versions of terms

For example,

mx_search("dementia", auto_caps = TRUE)

would find both Dementia and dementia

mx_caps() prevents wildcards from being properly handled.

results <- medrxivr::mx_search(medrxivr::mx_snapshot(),
                                                 medrxivr::mx_caps("mendelian randomi*ation"))

mx_api_content() fails if the last page doesn't contain any records

This seems to be due to the fact that the number of records given by the "total" metadata is more than the total number of records actually available.

As of 14.39pm on 04/01/2021, the number of records given by the "total" is 148231. However, if you set the counter to any record within 31 of this figure (e.g. https://api.biorxiv.org/details/biorxiv/2013-01-01/2021-01-04/148201), you get a "No posts found" message. As medrxivr uses the "total" metadata field to work out how many pages it needs to cycle through to download the whole database, this sometimes leads to an error when the last page, expected by medrxivr based on the "total" field, is empty.

Note as more records are added to the API, the hardcode figures above will no longer demonstrate the issue.

Fix spelling

Requested by editor:

ropensci/software-review#380

Run styler

Requested by editor:

ropensci/software-review#380

Release medrxivr 0.0.3

Prepare for release:

Submit to CRAN:

usethis::use_version('major')
devtools::submit_cran()
Approve email

Wait for CRAN...

Unable to resolve r-lib/actions@master in GitHub Actions

Description

While preparing the workflow directory and required actions in a GitHub Actions workflow, the following error is encountered:

Error: Unable to resolve action `r-lib/actions@master`, unable to find version `master`

Impact

This prevents the workflow from running.

Fix image location for pkgdown

Via https://ropensci.r-universe.dev/ui#builds we see

Missing images in 'README.md': 'articles/data_sources.png'
ℹ pkgdown can only use images in 'man/figures' and 'vignettes'

Cross-link to rbiorxiv in README

nicholasmfraser/rbiorxiv#1

NEAR operator limited to 9 terms

Anything over and including NEAR10 fails

Server down

It seems like the bioRxiv/medRxiv API are down entirely. I'm experiencing the following error with every request:
Error : (2002) Connection refused

Can you confirm this? All examples provided in the docs (https://api.biorxiv.org) are failing.
For example: https://api.biorxiv.org/details/biorxiv/2018-08-21/2018-08-28/45 results in the described error.

Are you experiencing the same? Posting here, since it might affect your entire package.

Session Info

Adding Altmetrics or download number to the search function

Wondering if there's an option to add altmetrics to the search function. Additionally how many times a preprint's PDF has been clicked

Add function to report number of "hits" per individual search term.

It is often useful to be able to see the number of "hits" (records returned) by each individual element of the search, so that when designing the search strategy you can interogate which elements are influencing the returned records the most. So for example, if the search is:

topic1 <- c("dementia", "Alzheimer's")      # Combined with Boolean "OR"
topic2 <- c("lipids", "cholesterol")             # Combined with Boolean "OR"

query <- list(topic1,topic2)                        # Combined with Boolean "AND"

results <- mx_search(mx_snapshot(), query)

Then passing query to the proposed mx_reporter() function would return something like the below:

# Total number of records found by your search: XX

# Total topic 1 records: XX
# - dementia: XX
# - Alzheimer's: XX

# Total Topic 2 records: XX
# - lipids: XX
# - cholesterol: XX

Error in count/100 : non-numeric argument to binary operator

Session Info

 setting  value
 version  R version 4.2.2 (2022-10-31)
 os       macOS Ventura 13.2.1
 system   x86_64, darwin17.0
 ui       RStudio
 language (EN)
 collate  en_US.UTF-8
 ctype    en_US.UTF-8
 tz       America/Toronto
 date     2024-03-20
 rstudio  2023.09.1+494 Desert Sunflower (desktop)
 pandoc   NA

> mx_api_content(
+     from_date = "2013-01-01",
+     to_date = as.character(Sys.Date()),
+     clean = TRUE,
+     server = "medrxiv",
+     include_info = FALSE
+ )
Error in count/100 : non-numeric argument to binary operator
> mx_data <- mx_api_content(from_date = "2020-01-01",
+                           to_date = "2020-01-07")
Error in count/100 : non-numeric argument to binary operator
> if(interactive()){
+     mx_data <- mx_api_content(from_date = "2020-01-01",
+                               to_date = "2020-01-07")
+ }
Error in count/100 : non-numeric argument to binary operator

> preprint_data <- mx_api_content(server = "biorxiv")
Error in count/100 : non-numeric argument to binary operator
> preprint_data <- mx_api_content()
Error in count/100 : non-numeric argument to binary operator

Offhand, I can see that this is caused by a bad value of count, probably NA, null, or 0. I believe the same error has been included automatically in the docs.

I am guessing either something changed server-side or in a dependency that invalidated the lib's logic. I am writing a Python version now; will let you know if I find the problem and solution.

pkgdown is failing

The pkgdown GitHub action is failing, but the Jenkins version works fine. Gives error message:

-- Building function reference -------------------------------------------------
Error in check_missing_topics(rows, pkg) : 
  Topics missing from index: medrxivr

Remove the pkgdown.yml GitHub actions file.

 Error in check_missing_topics(rows, pkg) : 
  All topics must be included in reference index
• Missing topics: medrxivr, mx_caps

Note that for topics you do not want to include in the index you can create an "internal" section https://pkgdown.r-lib.org/reference/build_reference.html?q=internal#missing-topics

You can also use the @keywords internal tag and redocument for, say, the package manual page.

To check all topics are listed, after editing the configuration file you can run pkgdown::check_pkgdown().

	#' @param from_date Earliest date of interest. Defaults to 1st June 2019
	#' (earliest medRxiv record was posted on 25th June 2019).

ropensci / medrxivr Goto Github PK

medrxivr's Issues

Description

Impact

Recommend Projects

Recommend Topics

Recommend Org