Comments (6)
there is a lot to this,
- it seems that they are selecting every 2 raws in the training DF, but why if we cropped an area to training why skip, or why to divide the data in 1/2
- it seems that they are using scorer.run() for comparing against the GT data, but no idea what the parameters mean, and what the training has to do with anything here.
- the training parameter
- if it is true, means only get training area,
- exclude training area
from dc1.
The above method uses coordinates the following coordinates : 1400: {"ra_min": -0.2688, "ra_max": 0.0, "dec_min": -29.9400, "dec_max": -29.7265},
which is the same as the one used when cropping the data
so it seems that there is no data leakage
from dc1.
New Questions.,
- why then there is still a duplication of sources
- why do we need to separate the training image from the full image, if we can just shuffle the whole df later from the full image
- why are we skipping every two rows ON THE TRAINING DATA
In the meantime, I will follow my own path unless someone convinces me otherwise
from dc1.
That explains why when we sued ML on the testing set, there was no sign of duplications
from dc1.
New Questions.,
- why then there is still a duplication of sources using the np.close
- (maybe just the sources are close to each other (bad explanation, but the code above looks good) )
- why do we need to separate the training image from the full image, if we can just shuffle the whole df later from the full image
- why are we skipping every two rows ON THE TRAINING DATA
-why are we using the smaller image in the training, and the bigger image in the testing, I don't understand.
In the meantime, I will follow their solution no problem
from dc1.
training area before the slice:
from dc1.
Related Issues (9)
- Possible data leakage HOT 2
- Adding a threshold parameter to the primary beam correction method HOT 1
- The change that happens in the number of sources in the different frames
- Do we need the first two links in the ReadMe files?
- Feedback HOT 1
- is the code based on the final DC results? ? ? HOT 1
- Source finding HOT 2
- Primary Beam correction HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dc1.