arena's Issues
Find percentage of people using no phone or using 2 phones
We only detect a fraction x of the people, because not everybody has a smart phone and because some people have 2 smartphones. The number x should be found online in some reports or papers. Then our calculations should scale to take into account x, too.
Validation of density estimation
Either by Monte Carlo simulations or by analyzing video data
Conservation of mass
Taking all detected MACs into account?
Because the smoothing may partially distribute them inside the stadium, and because it is unlikely that outsiders would be detected (wall etc, what Jan said on Friday 10/06).
More concretely, slide #3 from the Friday 10/06 presentation. We should not exclude those detected outside the region.
In this way, we actually don't need to adjust the integrals inside to be exactly 1, because of the added effect from the outsiders. Then both the full and the dotted lines at slide 15 should "overlap".
The stadium boundary is artificial anyway.
Take care of the randomized addresses
One solution is:
- make a statistics of e.g. what (average or median?) percentage p of the addresses detected in a time window is randomized.
- exclude all randomized addresses during calculation of density
- after all calculation is done, scale the histogram to take into account that p% of data was ignored.
After all, we only detect a fraction x of the people, because not everybody has a smart phone and because some people have 2 smartphones. The number x should be found online in some reports or papers. Then our calculations should scale to take into account x, too.
Determine time window size
Fine-tuning the method
Our method involves the superposition of a series of Gaussian 'bumps' which are centred on top of the fitted positions. To do so, we smooth each position separately and keep it in memory as a density histogram.
However, this is mathematically very similar to kernel density estimation and the use of radial basis or weight functions, which does not require to build separate histograms. We have to check whether using one implementation or the other has any effect on the results, as this choice might influence the amount of truncation and approximation introduced in the statistical modelling, and thus the accuracy.
Find a venue for paper submission
Conferences or journals
e.g. very relevant based on other papers that use mobile data or crowd analytics : EPJ Data Science
http://link.springer.com/journal/volumesAndIssues/13688
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.