Composable Logs

Composable Logs is a Python library to run ML/data workflows on stateless compute infrastructure (that may be ephemeral or serverless).

In particular, using Composable Logs one can do ML experiment tracking without a dedicated tracking server (and database) to record ML metrics, models or artifacts. Instead, these are emitted using the OpenTelemetry standard for logging. This is an open standard in software engineering with growing support.

It can be useful to think of the logs emitted by Composable Logs as somewhat similar to logs emitted by unit test frameworks (like eg the JUnit format).

For example, log events emitted from Composable Logs can be directed to a JSON-file, or sent to any log storage supporting OpenTelemetry (span) events. In either case, this means that one does not need a separate tracking service only for ML experiments.

The below shows how a captured JSON log can be converted into a static website based on ML Flow.

Composable Logs uses the Ray framework for parallel task execution.

For more details:

Documentation and architecture

https://composable-logs.github.io/composable-logs

Live demo

Using Composable Logs one can run a ML training pipeline using only a free Github account. This uses:
- Github actions: trigger the ML pipeline daily and for each PR.
- Build artifacts: to store OpenTelemetry logs of past runs.
- Github Pages: to host static website for reporting on past runs.
The static website is rebuilt after each pipeline run (by extracting relevant data from past OpenTelemetry logs). This uses a fork of MLFlow that can be deployed as a static website, https://github.com/composable-logs/mlflow.
Codes for pipeline (MIT): https://github.com/composable-logs/mnist-digits-demo-pipeline

Public roadmap and planning

https://github.com/orgs/composable-logs/projects/2/views/3

Install via PyPI

Latest release

pip install composable-logs
https://pypi.org/project/composable-logs

Snapshot of latest commit to main branch

pip install composable-logs-snapshot
https://pypi.org/project/composable-logs-snapshot

Any feedback/ideas welcome!

License

(Note: As of 1/2023 this project was renamed from pynb-dag-runner to composable-logs.)

composable-logs / composable-logs Goto Github PK

composable-logs's Introduction

Composable Logs

For more details:

Documentation and architecture

Live demo

Public roadmap and planning

Install via PyPI

Latest release

Snapshot of latest commit to main branch

License

composable-logs's People

Contributors

Stargazers

Watchers

composable-logs's Issues

Code

Switch output pypi libraries

mlflow

Tasks

Update package names

Documentation

Tasks

Tasks

Tasks

Tasks

Tasks

Tasks

mnist-demo repo

pynb-dag-runner repo

mnist-demo-pipeline

pynb-dag-runner package:

Tasks

Refactor (Python task/notebook) tests on evaluated spans

Fail if trying to execute pipeline with task with more than 1 retry setting

Implement new iterator for parsing spans

Move UI over to use data generated from new parser

Cleanup/minor

Tasks

Write mermaid files to static website

Render Mermaid files in UI

Tasks

Tasks

Tasks

Step 1:

Step 2:

Start refactorings of cli-tools:

Revise no-link version of Mermaid DAG diagram

Create new cli generate_static_data

Switch to use latest version of pynb-dag-runner in mnist demo pipeline

Tasks

Tasks

Tasks

Tasks

Tasks

Package pynb-dag-runner[-snapshot]

Package pynb-dag-runner-webui

Tasks

Tasks

Tasks

Tasks

Modified MLFlow repo

mnist-dag-runner rep

pynb-dag-runner main repo

Tasks

Tasks

Tasks

Tasks

subtasks

Recommend Projects

Recommend Topics

Recommend Org

`pynb-dag-runner` package:

Create new cli `generate_static_data`

Switch to use latest version of `pynb-dag-runner` in mnist demo pipeline

Package `pynb-dag-runner[-snapshot]`

Package `pynb-dag-runner-webui`