Giter VIP home page Giter VIP logo

codingformedicine's Introduction

Hi! 👋 I’m Chris Lovejoy.

I’m the co-founder of a healthtech start-up (Chief Technology Officer) and ExplainThisPaper.com (Chief Scientific Officer).

I’ve been a Data Scientist, Medical Doctor, Academic Researcher and worked at a fast-growing healthtech start-up (Cera Care). I have a YouTube Channel and wrote a book about learning.




codingformedicine's People

Contributors

chris-lovejoy avatar emilyjin11 avatar guzalbulatova avatar jjcato9 avatar lawrenceadams avatar medic-code avatar v-sher avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

codingformedicine's Issues

Package Versions, Package Managers?

@chris-lovejoy and I have had some issues with getting notebooks to run due to differing pandas versions etc

Concretely, here are my relevant packages:

(coding_for_medicine)  [truncated] > conda list | grep -E 'pandas|numpy|seaborn|matplotlib|imbalanced-learn'
imbalanced-learn          0.9.1                    pypi_0    pypi
matplotlib                3.6.1           py310h2ec42d9_0    conda-forge
matplotlib-base           3.6.1           py310he725631_0    conda-forge
matplotlib-inline         0.1.6              pyhd8ed1ab_0    conda-forge
numpy                     1.23.4          py310h1b7c290_0    conda-forge
pandas                    1.5.1           py310hecf8f37_0    conda-forge
seaborn                   0.11.2               hd8ed1ab_0    conda-forge
seaborn-base              0.11.2             pyhd8ed1ab_0    conda-forge

Should we pin these to a specific version, and have a prompt to install the right version on each run?

Also are we using conda or pip? If using Colab I imagine this doesn't really matter ~ but good to get uniformity across notebooks

Optimal Data Source?

@chris-lovejoy Do you have a particular idea of how you want students to get data?

My current method uses the kaggle CLI, however I thought it was surprised by how much of a faff it is to use (having to create an account, manually pasting a .json file into ~/.kaggle etc.) - not beginner friendly in my opinion.

Did you invision just including the data within the repo, or having the notebook grab it from a source?

Not hugely important but I suppose it would be ideal to have a uniform approach between notebooks.

Making breast_cancer_features exercise more interactive

The breast cancer features exercise is only 'loosely' interactive. It encourages users to manipulate the code, but has no actual requirements for this and can run front-to-back without modification. Therefore, this exercise does not currently pass Principle 2 (of being interactive).

To improve it, there should be more explicit "stop points" where users are prompted to modify the code. For example, this could describe functions that could be used, but which aren't yet implemented - and invite the user to implement them and play around.

update format of README (and rename files as appropriate)

remove number-based sorting of exercises

create a table that lists all exercises, what technologies they include, how difficult, links to the respective notebook, etc

(also add an intro on the README detailing how to get started with any of the notebooks)

Create a requirements file for package versions used across all exercises

The simplest way to do this will be create a pip virtual env (as per current readme descriptions), install the packages, and make sure that all notebooks run without problem -> if so, then can save as requirements.yml file and add a description to the readme for installing those dependencies

Stroke Prediction Exercise

Start work on Stroke Prediction exercise at over here (file)

Rough outline for things to cover/notebook structure:

  • Get data / load into runtime / imports etc.
  • Data Exploration
  • Build models
  • Explore Accuracy Paradox
    • Look at better alternatives, e.g. F1/AUC/etc
  • Overfitting and underfitting

Anything else worth going into @chris-lovejoy ?

Is it possible to be added as a collaborator to the repo?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.