dapritchard / workloganalyzer Goto Github PK

View Code? Open in Web Editor NEW

0.0 0.0 0.0 477 KB

Analyze Worklog Records

Home Page: https://dapritchard.github.io/worklogAnalyzer/

License: GNU General Public License v3.0

R 100.00%

workloganalyzer's People

Contributors

Watchers

workloganalyzer's Issues

Add an overall summary to `effort_summary` report

Currently the sum of the top-level hours are not reported. This would only be useful when effort_style is "effort" or "effort_and_percent" since the percentage effort would always be 100% at the top level (but it could still be reported).

Convert worklog nodes to S4 representation

I'm planning to try converting to S4 for the following reasons.

S4 gives some added type support so that it will make it easier to ensure that the data structures are valid (e.g. all of the children are of the right type, etc.). Normally I find S4 classes less convenient to work with as a user so I don't tend to use them, but... (see next bullet)
I want the trees to be an more "opaque" type, that is to say I want users to only interact with them via the functions/methods provided by the library rather than manipulating the nodes directly, which using S4 I believe will encourage.

Add a function to partition worklogs data frame by regular intervals

Something like

partition_by_intervals(
  worklogs   = wkls,
  period     = period,
  start      = start,
  directions = "both"
)

Fix 'worklogs_from_parents' error: @children names must be unique'

The following is the error message from running worklogs_from_parents:

Error in `map2()` at �]8;line = 116:col = 4;file:///Users/david.pritchard/Dev/worklogAnalyzer/R/add-hierarchy.R�worklogAnalyzer/R/add-hierarchy.R:116:4�]8;;�:
ℹ In index: 1.
ℹ With name: personal.
Caused by error in `validObject()`:
! invalid class “worklogs_node” object: @children names must be unique
Run `�]8;;rstudio:run:rlang::last_trace()�rlang::last_trace()�]8;;�` to see where the error occurred.

Fix tests of worklogs 'new' constructors

Many of the tests in tests/testthat/test-constructor-new.R are commented out because they fail.

Add a function to filter worklogs by using regular expressions

It may be convenient to filter worklogs by one or more regular expressions.

Possible features:

You could choose to filter by node names, leaf names, or both
Multiple regular expressions would be additive
Maybe include an "exclude" like option?

See how other applications like find, ld, rsync, etc. handle this.

Do we allow a worklogs tree to be just a leaf?

Right now we think of a worklogs tree with a single task as being a worklogs_node with a single child worklogs_leaf. But we could potentially end up with a situation where we have a tree that's just a worklogs_leaf. For example:

By calling new
By calling extract_worklogs pointing towards a worklogs_leaf

Some potential approaches:

We could potentially treat this as a special case that's effectively the same as a worklogs_node with a single child worklogs_leaf. We could give it the name as the description from the internal data frame. But then we'd have to program around this everywhere.
1. If the leaf doesn't contain any worklog entries then we won't be able to extract a name.
  1. We could throw an error in this case
  2. We could make the name of the worklogs be an argument to new and store it with the object. Since we don't really want users to call new directly this might not be a big deal. Furthermore, it might be convenient to store the name of the worklogs along with the object (presumably as duplicated information in lockstep with the names of the parents @children slot.
As a variant of (1) we could try to "fix" the situation whenever we encounter it and wrap worklofs_leaf in a worklogs_node. But if we've already programmed around the special case everywhere is this even helpful?
We could throw an error when we do encounter it.
1. We'd have to create top-level versions of functions to catch this case
2. Would this be annoying to the user?

Should 'worklogs' know the "schema" of the tree and ensure it is consistent?

When a 'worklogs' object is first created, we should probably check consistency of the data frames stored in the leafs.

There may be situations where we'd like to return an "empty" worklogs tree, such as for the following:

We filter out all of the worklog entries out of the tree
We extract a subtree without any worklogs in it
We delete the last worklogs leaf

In such situations we'd like as_tibble and friends to return a data frame with 0 rows and the correct names and types of the columns. We could search around and find a leaf to get the schema, but maybe we'd prefer to get that information at creation time and store that information in every element of the tree.

We might also want to require that the information is provided at at creation time.

Fix 'worklogs_from_parents' error: parent prototype must be consistent with children

The error message is the following.

ℹ In index: 1.
ℹ With name: personal.
Caused by error in `validObject()`:
! invalid class “worklogs_node” object: parent prototype must be consistent with children
Run `�]8;;rstudio:run:rlang::last_trace()�rlang::last_trace()�]8;;�` to see where the error occurred.

Fix 'worklogs_from_parents' error: invalid tags field

The error message I'm seeing is the following:

Caused by error in `validObject()`:
! invalid class “worklogs_leaf” object: the tags field must be a character vector without NAs

Add GitHub actions to perform R CMD check

Set up a GitHub actions specification to run R CMD check

Consider including 'labels' with 'worklogs_leafs'

It feels like the labels belong with the worklogs_leafs since they don't change and it saves us from having to pass them in later.

More complete testing for 'worklogs' routines

A few idea of things to check:

Worklog data frames with 0 rows
Worklog data frames with differing types
Worklogs where the name of a leaf child doesn't match the name of a task
Worklog data frames with more than 1 task description

Fix test ensuring that all description elements are equal

new for worklogs_leaf throws an error for invalid input
new(...) did not throw the expected error.

Add option to display clock time in `effort_summary`

Currently I am thinking there could be three options:

Proportion only, e.g. 10%
Proportion and time, e.g. 4:10 (10%)
Time only, e.g. 4:10

Split worklogs.R into multiple files

Use the Collate field in DESCRIPTION to specify the order in which the files are read in.

See:

If present, the collate specification must list all R code files in the package [...] as a whitespace separated list of file paths relative to the R subdirectory.

Fix 'worklogs_from_parents' error: the start, end, and duration columns must be consistent

The full error message is the following:

Caused by error in `validObject()`:
! invalid class “worklogs_leaf” object: the start, end, and duration columns must be consistent
Run `�]8;;rstudio:run:rlang::last_trace()�rlang::last_trace()�]8;;�` to see where the error occurred.

Fix error when running worklogs_from_parents

The following error message is given:

child prototypes must be consistent

Add README file

Create a README.md file. Probably it will be useful to create a README.Rmd file and then convert that to Markdown. I'm not sure yet if that conversion should be automated in some way.

Add 'subtree_remove' routine

Consider taking a character vector as input specifying the subtree to delete. Alternatively we could try to do something a little bit more akin to purrr::chuck.

Change structure to give each task it's own subtree

I think it's necessary to include each task in it's own subtree so that we can identify tasks with 0 worklogs in them. If they are rows in a data frame then we can't do so. Also I suspect that it will eliminate some programming around special cases.

Create summary method

What I am currently thinking is to do something like the show method, i.e. something that prints out the data like a tree, but with effort proportion included.

Perhaps we could have an option for dropping levels that are 0%.

Here's a first pass at what I am imagining.

                                                  Effort proportion
.                                                 -----------------
├── Development projects                                   88% 
│  ├── (+4) A-team                                               3%
│  ├── (+26) asclepias                                          28%
│  ├── (+4) codelist                                             5%
│  ├── (+1) data pipeline orchestration layer                    2%
│  ├── (+8) event-data-model                                    15%
│  ├── (+7) interval-algebra                                     8%
│  ├── (+1) noviverse-site                                       7%
│  ├── (+20) nsBuild                                            17%
│  └── (+1) nsProjects                                           3%
└── Research projects                                      12%
   ├── (+2) P0024 Fracture Prediction                            2%
   ├── (+1) P0025 Osteoporosis Negative Control                  3%
   ├── (+1) P0036                                                0%
   ├── (+3) P0053 Osteoporosis Comparative Effectiveness         3%
   ├── (+5) P0073 Migraine NCO                                   3%
   └── (+1) P0076 market insight PHRs                            1%

Add a name slot to 'worklogs_leaf'

This is related to #23 so that we can treat a sole worklogs_leaf the same way as we would a worklogs_node with a single child worklogs_leaf.

Make config labels able to be NULL

Some of the config@labels elements are not required. To signal this option, allow the values to be NULL.

How to represent worklogs?

Top level representation

Should we force worklogs to be represented using a data frame, or could we alternatively allow arbitrary nested structures?

Representation as a data frame

Suppose we have a constructor that creates a tibble with attached metadata about how to obtain start time, end time, tags, description, etc. How do we ensure that the information is up-to-date whenever we try to use one of the package functions/methods on a possibly modified version the object?

The following are some possible strategies to handle the issue described in the preceding paragraph.

We don't ensure that the attached metadata is up-to-date, but instead perform runtime checks to make sure that the information makes sense (is of the right types, etc).
The constructor transforms the input data into a data frame with a standardized form (standard column names and types, etc.). Then runtime checks are performed as in (1).
We try to create wrappers for data frame methods like [ that keep the metadata up-to-date.
Create some type of abstract class that can be exported to a data frame as needed.
Take a hash using digest or similar of the object at construction time to ensure that it hasn't changed.

Representation as a list-of-lists

Represent hierarchy by using a list representation. Possibly the leaves of the hierarchy (individual tasks) could be represented using a data frame.

Error when running 'worklogs_from_parents'

The main part of the error message is the following:

trying to get slot "children" from an object of a basic class ("NULL") with no slots