sonjageorgievska / arena Goto Github PK

Jupyter Notebook 59.83% HTML 39.98% Python 0.19%

arena's Issues

Find percentage of people using no phone or using 2 phones

We only detect a fraction x of the people, because not everybody has a smart phone and because some people have 2 smartphones. The number x should be found online in some reports or papers. Then our calculations should scale to take into account x, too.

Validation of density estimation

Either by Monte Carlo simulations or by analyzing video data

Conservation of mass

Taking all detected MACs into account?

Because the smoothing may partially distribute them inside the stadium, and because it is unlikely that outsiders would be detected (wall etc, what Jan said on Friday 10/06).

More concretely, slide #3 from the Friday 10/06 presentation. We should not exclude those detected outside the region.

In this way, we actually don't need to adjust the integrals inside to be exactly 1, because of the added effect from the outsiders. Then both the full and the dotted lines at slide 15 should "overlap".

The stadium boundary is artificial anyway.

Take care of the randomized addresses

One solution is:

make a statistics of e.g. what (average or median?) percentage p of the addresses detected in a time window is randomized.
exclude all randomized addresses during calculation of density
after all calculation is done, scale the histogram to take into account that p% of data was ignored.

After all, we only detect a fraction x of the people, because not everybody has a smart phone and because some people have 2 smartphones. The number x should be found online in some reports or papers. Then our calculations should scale to take into account x, too.

Determine time window size

Fine-tuning the method

Our method involves the superposition of a series of Gaussian 'bumps' which are centred on top of the fitted positions. To do so, we smooth each position separately and keep it in memory as a density histogram.

However, this is mathematically very similar to kernel density estimation and the use of radial basis or weight functions, which does not require to build separate histograms. We have to check whether using one implementation or the other has any effect on the results, as this choice might influence the amount of truncation and approximation introduced in the statistical modelling, and thus the accuracy.

Find a venue for paper submission

Conferences or journals

e.g. very relevant based on other papers that use mobile data or crowd analytics : EPJ Data Science
http://link.springer.com/journal/volumesAndIssues/13688

sonjageorgievska / arena Goto Github PK

arena's Issues

Find percentage of people using no phone or using 2 phones

Validation of density estimation

Conservation of mass

Taking all detected MACs into account?

Take care of the randomized addresses

Determine time window size

Fine-tuning the method

Find a venue for paper submission

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent