Comments (5)
Some of its numbers, for example DP.DPL for Depreciation.Depletion, don't appear to come from the correct source. In fact, I'm unsure where it's getting those numbers from. Checking the adjoining rows doesn't make it clear that some off-by-one error is assigning LE's values elsewhere, but it's possible that it's incremented sufficiently by that point that the "off-by-one" error is pretty enormous. That, and logically then the last X rows should essentially be lacking data. Something that isn't true.
Will look into this more deeply tomorrow.
from qmj.
Playing with it now.
So taking a small sample of ~100 companies around LE still attributes incorrect data to LE.
However, the get_info function on LE alone does produce correct data.
If I take the last company in my subset, LPI, I find that the raw financials does indeed produce the correct cash flow statement.
Hm. Perhaps quantmod is grabbing the wrong data?
from qmj.
Scratch that, it seems as if the correct data is being grabbed by get_info. Unsure why I thought otherwise just above. Will check tidyinfo.
from qmj.
Tidyinfo is producing the correct data. Though it's possibly being somewhat harsh in cutting out 2012 data for LE due to the lack of a balance sheet, it doesn't seem pressing.
Will try re-generating our financials data set to see if inaccuracy persists. Maybe data set was produced by an older, buggy version?
from qmj.
Correction*
Upon closer inspection, my conclusion is that get_info/tidyinfo is correctly grabbing and producing data.
My guess is that, as we only have access to the past 4 10-K filings, we were grabbing weird/old data from LE.
To be completely sure, I've also re-built our financials data set. LE now has reasonable data. On top of that, it's also moved forward
It's also moved forward around 700 rows. Implicitly this means that there was an enormous gain in information in the preceding companies, which is also concerning.
from qmj.
Related Issues (20)
- Do we want to set up a (relatively painless) way of updating/retrieving the companies from the Russell 3000 Index? HOT 1
- Worth Repeating explanation of Russell 3000 in Prices and Financials data documentation? HOT 3
- Documentation for get_companies is unintuitive in explaining how it works. HOT 5
- If statement in market_data HOT 3
- Providing a function to clean temporary data HOT 1
- get_companies() regex cuts out several companies when directly copying and pasting from the Component List HOT 2
- get_prices - quantmod getSymbols function changed HOT 2
- Phantom Bugs # 17 and # 18 HOT 1
- Off-hand Thought: Dealing With Missing Information HOT 3
- Observation: get_prices is slow to aggregate the various chunks of raw price data into a single data object HOT 1
- For Case Study: Reducing the Number of Companies for which we produce No Quality Score
- Issue: Financials Data Table Overfilled with Erroneous Data, Missing Some Gathered Data HOT 8
- Library qmjdata does not automatically load when loading qmj
- Impose consistency across variable names and function names
- qmjdata is not available HOT 4
- ?qmj leads to "No documentation for 'qmj' in specified packages and libraries. HOT 1
- README markdown file for github repo is badly, badly out of date HOT 4
- qmj package documentation file is out of date HOT 1
- Cleaning up tidy_prices HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qmj.