cscorley / dissertation Goto Github PK
View Code? Open in Web Editor NEWLicense: MIT License
License: MIT License
For random seed, show before/after of two different seeds
Historical should have more success than either of the others
add highlights to the config result tables for configs used in other areas of the work
currently only set once, but should reset every LDA init
Possibly split up the combination chapter (after reorg) into two separate ones. It seems there's plenty of need to separate model optimization and corpus optimization given how much data I have. Blocked by #4
Much of the table/figure building is spread across multiple notebooks and it's becoming difficult to keep everything in order. These need to be refactored so everything is systematic and defensible
Would like one boxplot to remain as they are, and another boxplot that excludes outliers. I hope the latter 'zooms' on the boxplot a bit so highlight performance differences for non-outlier ranks
Many of these are likely out of date w.r.t. their order.
Should be possible to grab the current chapter & section numbers and inject them instead!
yeah, i really dislike the 'snapshot' approach for DIT i used
i should have done something elsefrom an experimental perspective
depends both on the input data and the heuristic
where input data varies by snapshot and changesets, and heuristic varies by change counting and indexing dev documents
having only the input data differ would have been better, and its not unobtainable
(need to edit this later to clarify)
There is a need to show why a good seed or a bad seed can change outcome.
Visual idea: compare two LDA state matrices after update with each document/chunk
Right now the topics are spread across chapters. Instead, flip chapters and sections.
search for articles using abstracts
Turns out there's a lot of repeated text & ideas that would be better off served if the chapters were inverted with sections.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.