Giter VIP home page Giter VIP logo

Comments (9)

datawookie avatar datawookie commented on September 15, 2024

Hi @YonghuiDong, sorry for the delay in getting back to you on this. Can you please give me more specifics of what you have in mind? Perhaps point me to a feed and let me know what data you would like to get back? What do you mean by "partial" contents? How would you like to specify what portion of the contents you get back? Thanks, Andrew.

from feeder.

YonghuiDong avatar YonghuiDong commented on September 15, 2024

Thanks for your kind help. I would also like to fetch part of the contents from the link as well, e.g., the first 200 words of the contents from the link.

Current feedeR version only exports title, date and link from Rss feed.
feedeR:: feed.extract("https://feeds.feedburner.com/RBloggers")

The R package scifetch can export contents as well. description in the figure.
scifetch::getrss("https://feeds.feedburner.com/RBloggers")

Screen Shot 2020-04-12 at 13 01 30

The probeem with scifetch is that it does not supports RSS feeds of scientific journals very well. Therefore I am wondering if feedeR can do the similar job.

from feeder.

datawookie avatar datawookie commented on September 15, 2024

The probeem with scifetch is that it does not supports RSS feeds of scientific journals very well.

Understood.

Just so that I have an idea of what you are wanting to achieve, please can you point me to some examples of relevant "RSS feeds of scientific journals" and give me an idea of what you'd be looking for in the output.

from feeder.

datawookie avatar datawookie commented on September 15, 2024

Hi @YonghuiDong , I've added the description field to the RSS output. Changes are in the dev branch (commit hash 64f6113). Can you please test and let me know if that does what you want? Thanks, Andrew.

from feeder.

YonghuiDong avatar YonghuiDong commented on September 15, 2024

@datawookie Amazing, Thanks a lot Andrew!

I have tested the RSS of RBloggers, and the result is exactly what I need.

feedeR:: feed.extract("https://feeds.feedburner.com/RBloggers")

Screen Shot 2020-04-13 at 14 06 47

However, when I test the scientific journal, e.g. Nature Plants, the description are all NAs.

feed.extract("http://feeds.nature.com/nplants/rss/current")

Screen Shot 2020-04-13 at 14 02 58

Here is the RSS feed of Nature Plants: http://feeds.nature.com/nplants/rss/current. You can see from the link that there is a short summary for each article.

Below is an example of the output from feedly.com. The summary of the article is also included.

Screen Shot 2020-04-13 at 14 19 38

Is it possible that feedeR can fetch the summary as well?

Many scientific journals are non-open access journals, which require subscriptions. But the abstract of each article is available. It will be even better if feedeR can fetch the abstract of the article.

Thanks a lot again.

Yonghui

from feeder.

datawookie avatar datawookie commented on September 15, 2024

However, when I test the scientific journal, e.g. Nature Plants, the description are all NAs.

Yes, that's because it's an RDF feed and I implemented the fix for an RSS feed (based on the fact that the original example you provided me with was https://feeds.feedburner.com/RBloggers).

from feeder.

datawookie avatar datawookie commented on September 15, 2024

I've pushed updates for RDF feed to the dev branch. Please test and let me know.

from feeder.

YonghuiDong avatar YonghuiDong commented on September 15, 2024

@datawookie Hi Andrew,

I have tested several different journals, this version works very well. Thanks a lot!!

Yonghui

from feeder.

datawookie avatar datawookie commented on September 15, 2024

Great. I'll merge into the master branch once I hear back on the other pending issue.

from feeder.

Related Issues (16)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.