asteca / asteca Goto Github PK

View Code? Open in Web Editor NEW

16.0 5.0 6.0 22.06 MB

Code for the ASteCA package.

Home Page: http://asteca.github.io/

License: GNU General Public License v3.0

Python 100.00%

astronomy astrophysics star-clusters python asteca

asteca's People

Contributors

Stargazers

Watchers

Forkers

philrosenfield mmfranco25 msolpera karthika-ai chihuanbin gimpeller sarashenyy

asteca's Issues

Performance issues with integ magnitude

This function need to be optimized, it takes far too long to finish for medium sized clusters and above.

Localization bug in p-value

Sometimes this will happen:

 .../functions/get_p_value.py", line 142, in get_pval
p_vals_cl.append(float(str(p_val_cl)[4:]))
ValueError: invalid literal for float(): 0,2902803

Need to add localization so commas are never used.

Write best parameters to file

Add best fitted parameters (metallicity, age, extinction, distance modulus) to final output file.

Assignation of binary masses

There's a bug in synth_clust where sometimes (haven't reproduced it) the array mass_bin0 is a float, possibly because m1 is empty.

Fix data_input

Once the data_output is stable, fix 'clusters_input.dat'.
1- It could to match the output file's format, or
2- have a different, easier to read format.

Missing term in likelihood

Check Cabrera-Caño & Alfaro 1990, there appears to be a missing 1/(n-1) term in the likelihood function.

Create bad pixels mask

To be used on frames with complicated geometries or bad pixels that show blank portions.

Empty or bad pixel regions could be either filled with an average sample of stars from the frame or marked an ignored.

Manual detection

Give the option to supply a file with as many as xmin, xmax, ymin, ymax values per line to leave out rectangular sections of the frame that are either empty or unusable for some reason.

Related: #68, #107

Auto detection

Perhaps use Monte Carlo to automatically detect empty regions in the frame? Generate N random points and check which ones have no neighbor stars around them. Grouping these points we could detect the empty areas.
Even simpler would be to obtain the KDE for the field and identify as empty regions those that are in a curve less than x% the maximum value. The problem here would be how to check if a star is within a closed curve of accepted density value, or outside of it. More complicated even are non-closed curves.
No need to use curves. Just check the center of the bin in the 2D spatial histogram. If the density in that point is below a certain threshold (for example, the maximum KDE density value for all the bins in the frame), assume that bin is in an empty region (ie: composed of "bad pixels") of the frame and mark it as invalid.

Fix colorbar position

The colorbar in the CMD with the membership probs keeps moving around when the GA and/or the p-value test are disabled/enabled.

Don't stretch cluster when zooming in

If the cluster is located near a border (see NGC1863) the zoom will look stretched because the axis won't be of the same size and the plot is square allways.

Add integrated color index

Obtain not only the integrated magnitude curve but also the integrated color index.

Add errors to synthetic cluster.

Use those errors in the likelihood calculus.

Crash proof the code

Use exceptions to handle any possible crash in any function. Make it so that the code itself doesn't crash but instead it jumps to the next cluster file in the loop (if any)

Resources:

https://docs.python.org/2/tutorial/errors.html#handling-exceptions
http://openbookproject.net/thinkcs/python/english3e/exceptions.html

Subtract averaged field integrated mag?

Check if I should subtract to the cluster region integrated magnitude curve the averaged field integrated magnitude curve to obtain a more accurate estimation for the true cluster integrated magnitude.

Re-define cluster region

See Ref 27/SL351, the cluster region defined cuts a portion of the cluster. Perhaps define it as a square centered on the center of the cluster of length 2_1.5_r_cl.

Add no background flag

If the cluster is too large or the frame too small, it could happen the the region selected for obtaining the background falls inside the cluster.

In this case the radius, background, density, field regions, bayesian decont algor, p-value test should be skipped.

Add a flag so that the user can indicate when this happens or.

Add Saha's W parameter?

Write a function that calculates Saha's W parameter between the cluster region and all the field regions. It is another version of what the p-values distribution functions does.

On second thought, not sure it is the same thing.

This short article The W-function applied to the age of Globular Clusters, Rengel & Bruzual (2002), uses the W function to estimate ages for GCs.

The method used is similar to what ASteCA does to estimate the cluster probability of being a real physical entity through the KDE p-value: compares synthetic clusters of the same age with each other to generate a distribution of W values, then compares the observed cluster with synthetic clusters of the same age, and finally selects the "best" age estimate as that which produces the largest overlap between distributions.

More details can be found in the PhD Thesis (dead) on which the article is based. Here it is stated that the number of model points is fixed to the number of observed points (stars), see pag. 29.

Confirmed by Dr P Saha: the W function should be used when the number of model points is fixed.

Dr Saha suggested to fix this parameter to a large value (as large as possible) and assign per-star masses after the fitting is completed. But, as stated by Dr Saha: "If M is very large, W should go to the Poisson formula", which sort of defeats the purpose of using W.

This statistic is also discussed in Bayesian isochrone fitting and stellar ages Valls-Gabaud (2014), who conclude that W is:

the statistic of choice to be used in the context of CMD modeling

Create input params file.

Add a file containing all the values that are currently hard-coded.

Bug in GA (possibly in decode)

There's an issue in the elitism/decode/fitness_eval block where the best solution is apparently not being passed along to the fitness_eval function.

I suspect this is related to the decode_ function not transforming the solution correctly.

Removal of stars in p-value is wrong

Right now before comparing the cluster region with a field region, a number of n_f stars are removed from the cluster region where n_f is the number of stars in that field region.

This results, for heavily contaminated regions, in a cluster region almost devoid of stars which forces high p-values for the cluster-field regions comparisons. For clusters not too contaminated the effect is diminished.

This was introduced via issue #12.

Add average field integ mag to integ mag plot

Generate an average of all the field regions defined and add it to the integrated magnitude plot.

Also add completeness limit line to the integ mag plot.

Isoch fitting process

Finish isoch fit process and merge into main code.

Download 2MASS data?

Retrieving data from Vizier: http://www.aspylib.com/doc/astrometry_queries.html

Main package: http://www.astropy.org/

Accept a CMD from any single photometric system

Generalize the code to process a CMD from any arbitrary photometric system defined in the Girardi set.

Old attempts: Old 0.2.0 branch with 70 commits, 34 older commits

Make total mass & binary fraction variable

Make the total mass and the binary fraction the synthetic cluster is created with two more variables parameters.

Correct comments in get-regions

Correct the description of flag_area_stronger (currently tied to the decont algor in the comments) and other things that need it.

Use metallicity and age steps

Currently the steps defined in the input file for these two parameters has little use.

Make it so that these values are used when reading the isochrone files so as to skip values in between.

Read mem probs from file

Add an option to read the membership probabilities from file (from a previous run or user-provided) to speed up calculations.
See what happens with the plotted KDE when this happens.

Select CMD, not mag, color and phot system separated

Restrict the selection to a given CMD so that it defines the magnitude, color and photometric system used.

There's no point in leaving these things be picked until the code learns how to deal with them separately (currently it does not)

Restrict radius for high CI clusters

When the cluster has a high CI (cont index) the radius should be restricted to a value lower that the one found by the get_radius function. This way less field stars will be present in the r<r_cl CMD and the isochrone fitting process will be more accurate.

Remove R^2 from QQplot

It's useless.

Add a flag to not output a figure file at all

'cluster_region': re-define

The cluster-region array is constantly being used and the stars in it are always being filtered to only use stars inside the cluster's radius. Re-write this so this "cleaning" is not need anymore.

Re-write 'get_most_probable_memb'

It needs a re-name and a re-write. The name does not accurately describe what it does anymore and neither do the descriptions inside the function.

Subtract field stars integ magnitude?

Should I subtract the averaged integrated magnitude from the field regions from the cluster region integrated magnitude curve?

Add plot of best-fit synthetic cluster.

That.

Fix minimum in integ mag y axis

Curve goes below the plot, minimum is set too high. See BSDL654.

Read p-value qq-plot data from file

Create a file that can store all the values necessary for the p-value and qq-plot functions to be processed without running them, just reading data from said file.

Re-locate + discard plots

1- Move the integ magnitude plot o the first column fifth row
2- Move the memb probability distribution to the second column fifth row
3- Discard the m_p>0.75 diagram
4- Discard the N_c CMD diagram.
5- Displace the full CMD two column to the right
6- Replace the m_p>0.5 for m_p>mu and locate it after the full CMD.