A Package to help Handel Typical Data Engineering Tasks, including:
- Scheduling
- Storage
- Formatting
- Validation
- Tagging / Labelling
- Profiling
- Cleaning
- Transformations
- Bussiness Rules
- Encryption
- State Machine (State Transition)
current_version = "2.0.0"
The Data Engineering Toolkit is still under initial development and is being tested with Python 3.11.4 version.
The Data Engineering Toolkit will follow semantic versioning for its releases, with a {major}.{minor}.{patch}
scheme for versions numbers, where:
major
versions might introduce breaking changesminor
versions usually introduce new features and might introduce deprecationspatch
versions only introduce bug fixes
poetry lock --no-update
poetry install -E dev
pip install data_engineering_toolkit
- Open new terminal
- "Windows-Key + R" will show you the 'RUN' box
- Type "cmd" to open the terminal
cd <Path To>/DataEngineeringToolkit
python -m venv venv
venv\Scripts\activate
- Open new terminal
- "Control + Option + Shift + T" to open the terminal
cd <Path To>/DataEngineeringToolkit
python -m venv venv
source venv/bin/activate
(venv) python -m pip install -r requirements.txt
(venv) (venv) $ poetry init
(venv) poetry lock --no-update
(venv) poetry install -E dev
(venv) bumpver update --minor
(venv) poetry build
(venv) poetry config repositories.testpypi https://test.pypi.org/legacy/
(venv) poetry publish -r testpypi
(venv) poetry publish