keck-datareductionpipelines / framework Goto Github PK
View Code? Open in Web Editor NEWProject management and simple code repository for the DRP framework project
Project management and simple code repository for the DRP framework project
Generate a set of simulated data files (FITS files) which can be used to test our data model. These simulated data should cover a range of possible formats and should include formats which represent raw data from various instruments and formats which might exist only in intermediate processing steps.
One goal is to cover as many possibilities as possible to test that the data model (and eventually the primitives which act on that data model) correctly handle all aspects of the data including the array data itself, variance information, mask information, header information, and table data.
Types of input data
We should confirm that basic mathematical operators (addition, subtraction, multiplication, etc.) implemented in simple primitives propagate each type of data (pixel data, header, mask, variance, table) properly.
Examples:
Note that handling of mask and variance is embedded in astropy.nddata.NDData
and astropy.nddata.CCDData
if we choose to use those.
Describe how the requirements will support scientific reproducibility.
This is probably best done as a summary after other requirements are completed.
Related to #3.
The system should have tools to generate a record of the processing history. The goal is to record enough information to reproduce exactly the resulting reduced data from the same raw inputs (data and arguments). This record should be relatively easy to read (for both humans and computers) and not contain excess information.
It should be possible to write code which parses the record and produces all the inputs necessary to run the pipeline to reproduce the same result. This code is not necessarily something provided by the system described here, but the need should be considered in the design.
In principle, this information could be recorded to a log file, however, a log file has a slightly different goal and it includes a more linear history of what has happened and likely does not satisfy the need to make this simple to read without excess information.
A second possibility would be to record this information to the output data file (e.g. a FITS file header). This would be built in to our data representation.
A third possibility is to record this to a flat file on disk (i.e. a "proctable").
Finally, one could record the information to a database. This would be in some ways similar to the proctable described above, but has the disadvantage of being less human readable.
Fills REQ_5, REQ_6, REQ_9
Fills REQ_3 and REQ_4
Fills REQ_12, REQ_13, REQ_14
Fills REQ_7?
Fills REQ_1, REQ_2
I am experimenting with this
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.