Comments (13)
Hi Luciano, did you have a problem with that feed?
from feeder.
Yes, and other feeds I tested here (brazillian portuguese)
All give the error: Unable to parse date.
other feeds:
http://feeds.folha.uol.com.br/mercado/rss091.xml
http://rss.uol.com.br/feed/economia.xml
from feeder.
Aha! Okay, busy working with this issue already. Have a look at Issue #2. German locale in that case.
from feeder.
Thanks! Your package is very helpful!
from feeder.
No problem. Should have these date issues resolved soon. I am going to bed now though. Sorry. I'll be back to fix this in 8 hours though.
from feeder.
Hi Luciano,
Okay, I am working on a local fix. Please try this out: feedeR_0.0.2.tar.gz.
Let me know how that goes.
Thanks,
Andrew.
from feeder.
Hi Luciano, I have resolved Issue #2. I think that you might find that the changes to the master branch will also resolve your problems. Let me know. Thanks, Andrew.
from feeder.
Thanks Andrew. It worked for most of the feeds. I still found a problem in one of them:
feed.extract("http://feeds.folha.uol.com.br/mercado/rss091.xml")
Error: Unable to parse date.
In addition: Warning message:
All formats failed to parse. No formats found.
Also, for the feeds that worked, Im having a problem with encoding (the feed is brazilian portuguese). Do you think itΒ΄s possible to resolve that?
Here is the sessionInfo()
R version 3.3.1 (2016-06-21)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Debian GNU/Linux stretch/sid
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8 LC_PAPER=en_US.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
loaded via a namespace (and not attached):
[1] Rcpp_0.12.6 lubridate_1.5.6 XML_3.98-1.4 digest_0.6.10 dplyr_0.5.0.9000 withr_1.0.2
[7] assertthat_0.1 bitops_1.0-6 R6_2.1.2 DBI_0.5 git2r_0.15.0 magrittr_1.5
[13] httr_1.2.1.9000 stringi_1.1.1 curl_1.1 devtools_1.12.0 tools_3.3.1 stringr_1.0.0
[19] RCurl_1.95-4.8 feedeR_0.0.3 memoise_1.0.0 tibble_1.1
from feeder.
Hi Luciano,
I have added another date/time format to deal with that feed. Please install again from GitHub and confirm that this resolves your problem.
Unfortunately the encoding of these feeds lies outside of the scope of this package at present. I'm really just focusing on accessing the data from the feeds. It'd be tricky to try and cater for all possible encodings. I think that in the first instance you'd need to try and handle this on a feed-by-feed basis.
If, however, you have an idea for how this might be incorporated into the package, please let me know and I'll see what I can do.
Thanks,
Andrew.
from feeder.
Thanks Andrew,
The date worked perfectly now.
Regarding the encoding, maybe you can incorporate a parameter to the function call (feed.extract) and pass it to the XML package?
Im using a solution now that works using something like this:
xmlParse(getURL(url, .encoding = "ISO-8859-2"))
If you could do something like (feed.extract(url, encoding)) maybe it could help.
from feeder.
Okay, have a look at the repository now. I've added an encoding
argument to feed.extract()
.
from feeder.
Working perfectly now!
Thanks a lot Andrew. Your package will be very useful!!
from feeder.
Cool. Let me know if there are any other suggestions.
from feeder.
Related Issues (16)
- Google-News: feed.extract returns error HOT 2
- Not able to parse Date from Cnbc RSS feeds. HOT 1
- Space required after the Public Identifier error HOT 1
- Tag mismatches from Glassdoor feed HOT 1
- cannot parse Google allert RSS
- RSS without date HOT 2
- Feature request HOT 9
- NIH Reporter RSS feed error HOT 2
- Encoding issues HOT 17
- locale de_DE.UTF-8: feed.extract parse date error HOT 6
- Empty description in RSS
- Unable to feed.extract slashdot.com feed HOT 3
- Unable to parse date
- Getting Error while extracting reuters RSS feed. HOT 1
- feed.extract() is failing with reuters finance RSS feed. Please see below the error. HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from feeder.