Comments (8)
Focusing on AAC right now (of Issue #18 fame).
Going through the process of getting and processing data in a tiny cluster of companies around AAC does produce the correct raw data, the correct processed data, and also produces profitability, growth, payouts, safety, and quality scores. Each of the subscores of each component are also calculated.
from qmj.
Playing with a smaller data set (800 companies), it seems likely that tidyinfo is having hiccups when it comes to companies with a '.' in its ticker.
from qmj.
May be resolved. Issue may have been a regex error in tidyhelper, writing up NA years for companies with a '.' in their ticker. Testing further.
from qmj.
Tentatively resolved with #27.
Looking at the results of 800 companies, quality scores were produced for all but 23 of them. From a quick sampling, quality scores were missing because either data was absent or severely deficient. Which is great.
Will close this issue once I look at a few other possibilities for the error.
from qmj.
Data inaccuracies of this magnitude is resolved with #27
from qmj.
Issue with a few extra edge cases:
P
SPTN
Possibly more. Both are likely beating the regex in some unplanned way.
from qmj.
Issue with P at least appears to be the fact that they made two filings within the same calendar year. I'll map the process from the beginning and go for more than a bandaid fix this time.
from qmj.
Good news. After modifying the tidying functions, data processing is vastly improved. Enough so that the number of companies that don't produce quality scores is now at 113, a third its previous value. A quick random sampling seems to suggest that the biggest issue is quantmod being unable to return either financial or stock price data.
from qmj.
Related Issues (20)
- Do we want to set up a (relatively painless) way of updating/retrieving the companies from the Russell 3000 Index? HOT 1
- Worth Repeating explanation of Russell 3000 in Prices and Financials data documentation? HOT 3
- Documentation for get_companies is unintuitive in explaining how it works. HOT 5
- If statement in market_data HOT 3
- Providing a function to clean temporary data HOT 1
- get_companies() regex cuts out several companies when directly copying and pasting from the Component List HOT 2
- get_info or tidyinfo is handling some data badly. Possibly incorrectly inserting anomalous data. HOT 5
- get_prices - quantmod getSymbols function changed HOT 2
- Phantom Bugs # 17 and # 18 HOT 1
- Off-hand Thought: Dealing With Missing Information HOT 3
- Observation: get_prices is slow to aggregate the various chunks of raw price data into a single data object HOT 1
- For Case Study: Reducing the Number of Companies for which we produce No Quality Score
- Library qmjdata does not automatically load when loading qmj
- Impose consistency across variable names and function names
- qmjdata is not available HOT 4
- ?qmj leads to "No documentation for 'qmj' in specified packages and libraries. HOT 1
- README markdown file for github repo is badly, badly out of date HOT 4
- qmj package documentation file is out of date HOT 1
- Cleaning up tidy_prices HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from qmj.