Giter VIP home page Giter VIP logo

content_miner's People

Contributors

davidackroyd99 avatar

Watchers

 avatar  avatar

content_miner's Issues

Purge duplicates from wordlist

  • Main screen has button to purge duplicates from wordlist
  • Clicking it removes duplicate terms from the wordlist chosen on the main GUI

Demo video

Before V0.1 public release, there should be a video in the README that quickly shows people how the software works

Profiler

The output needs to show some stats about how long the process took so that performance can be measured and improved.

Unit tests of existing code

  • Simple asset statements in an if __name__ == '__main__' section of each file
  • Make sure that these tests can be run for the project automatically by PyCharm or similar

GUI options to customise output

  • Tick box to enter 'scan mode'
  • 'Scan mode' is called something better (maybe 'targets only'?)
  • Main screen has textbox for allowed number of unknown targets (default 1)
  • Tick box for whether or not we want target frequency after target in the output

Console app

  • Software actually works as a console app rather than some janky Python script
  • Command line arguments for wordlist and content
  • Switch for scan mode (default false)
  • Switch for adding target frequency (default true)

SRT support

.srt files are cleaned of subtitle crap before being imported (may involve an extra dependency)

Simple GUI

  • GUI opens when application is run from cmd line
  • Main screen has textbox for known wordlist filepath (required field)
  • Main screen has textbox for content filepath (required field)
  • Main screen has textbox for destination filepath (required field)
  • Main screen has button marked 'Find Sentences'
  • Clicking 'find sentences' will execute the existing sentence finding functionality and save the result to the destination filepath

Tidy GUI

  • Explore layout/theming w/ TTK
  • All the elements are lined up in some sort of layout that doesn't look ridiculous
  • Window heading (next to minimize, exit, etc.) shows name and version number

Error handling

Any I/O issues are gracefully handled and the user gets a message letting them know what went wrong.

Pipelines

  • Dev pipeline which lints & runs the tests (to be run before PRs are merged)
  • Release pipeline which lints, runs the tests and creates the installer

Clean up existing repo

  • Have proper .gitignore
  • Only contain files for software, no personal wordlists or content

Progress indicator

Because the process takes a vaguely long time, the user will want to know how far through they are and that the app is actually doing something.

Basic organisation of code/refactoring

  • The code needs to be organised into separate files (not all just in main.py)
  • Public/private functions
  • Docstring at the top of each file, roughly explaining scope
  • Docstring for each public function
  • Also need type hints

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.