Giter VIP home page Giter VIP logo

dissertation's People

Contributors

cscorley avatar

Watchers

 avatar  avatar  avatar  avatar

dissertation's Issues

Introduction

  • Introduction
  • Terminology
  • Motivation
  • Research Goals and Problems
  • Methodology
    • Why changesets?
    • Datasets and Benchmarks
    • On LDA: random seeds, batch, and online?
  • Organization

Rolling synopsis

  • Write a "rolling synopsis" to help organize and express the current state of things (Authoring pg 53)
  • send it to Nick ASAP
  • consider sending to Carver?

Refactor table/figure building

Much of the table/figure building is spread across multiple notebooks and it's becoming difficult to keep everything in order. These need to be refactored so everything is systematic and defensible

Boxplots should be split

Would like one boxplot to remain as they are, and another boxplot that excludes outliers. I hope the latter 'zooms' on the boxplot a bit so highlight performance differences for non-outlier ranks

Update all RQ numbers and references

Many of these are likely out of date w.r.t. their order.

Should be possible to grab the current chapter & section numbers and inject them instead!

DIT experiment approach

yeah, i really dislike the 'snapshot' approach for DIT i used
i should have done something else

from an experimental perspective
depends both on the input data and the heuristic
where input data varies by snapshot and changesets, and heuristic varies by change counting and indexing dev documents
having only the input data differ would have been better, and its not unobtainable

(need to edit this later to clarify)

Add random seed comparison

There is a need to show why a good seed or a bad seed can change outcome.

Visual idea: compare two LDA state matrices after update with each document/chunk

Reformat all chapters (again)

Turns out there's a lot of repeated text & ideas that would be better off served if the chapters were inverted with sections.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.