The metroscope from syvwlch

metroscope's Issues

Start using Pull Requests instead of coding against the master branch

Implement rhyme detection

Tag all words with their rhyming part, to make it easy to find terminal and internal rhymes.

Add a staging app to Heroku setup

Add support for alternate pronunciations from cmu_dict & custom_dict

So after some refactoring in PR #68, the only thing I pull from CMU dict are the phones.
Currently, when I do, I use the first item in the list and ignore any others, and in the custom dict I only store one.

So to support multiple pronunciations, I would need to:

Make the 'phones' key in custom dict point to a list containing the current string.
Add a new property which represents the index into the word's phones list, and which raises an IndexError when set out of range and is irrelevant when the list is None.
Update all the consumers of _phones to use the index, not zero.

Mark disagreements between meter and word pronunciation

Currently I’m just applying the meter directly to each word.
I should at least mark where the meter disagrees with the word’s pronunciation

Add unit tests for 404/500 errors on major pages

Should prevent deploying code that just errors out on page load.

Store the phones for the original word in WordBuilder instance and custom dict

Store the entire phones, because that's the underlying data for both the stresses() method and the rhyming_part() method of the pronouncing package.
Which means that once I have the phones in custom_dict, I should stop using stresses_for_word() and instead call stresses() directly, and add a _phones property to WordBuilder.

Make custom_dict fail safe

Add default values for when a particular key is missing in the dict for an entry, eg the entry has syllables but no stresses.

Move the poems to a database

Instead of storing the poems in text files and running the analysis each time, load the entire data object into a database with the following tables:

poems
poets
meters

Update the run script to:

load the four sample poems into the database on start
load the poem objects from the database instead of from disk

Check custom dictionary before CMU

Now that there’s a custom dictionary for words that don’t show in CMU, check it first.

Should be faster and would allow overriding UNC if needed?

Research using pytest fixtures

Read up on pytest fixtures
Make list of good candidates for first use

Get rid of all self.assert in favor of plain assert

With pytest, should get all the introspection needed on plain assert.

Add users

So the site can keep track of the scansion proposed by various users.

Install pytest

Get pytest to run our existing tests

Already know pytest doesn’t support subTest, so there’s gonna be some work involved.

Remove all subTest use

Switch to Test Driven Development

Worked well for me last summer, time to do the same with this project.

stress_line marks vowels that are silent/elided

Broke this off from #12 since the fix will be different.

Currently I can force an elision by replacing the offending vowel with an inverted comma in the original text, but that looks weird when the syllable is silent in normal speech.

Alternatively, I can add an entry to the custom dict, but that will only scale for the most common occurrences.

Refactor show_stress_line and uppercase_stress_line into a single function

That way we scan the line only once, and the syllabification can help the stress marks land inside a different syllable.

Add database to Flask

install SQLAlchemy’s flask plugin
create the first table
add a shell context processor
add data via shell

Start using Travis CI to do Continuous Integration

Ruh-rho: in CMU dict the vowel stress marker “1” is for most stress.

I assumed that it was:

0 is no stress
1 is optional stress
2 is required stress

And I was wrong.

Custom dict should include a full WordBuilder instance for each entry

This is the next logical step to address #13 and other issues going forward with words that aren't in the CMU dictionary and/or don't break down to syllables properly with the current syllabifier.

Make Metroscope a module

First step towards TDD, and allows creation of poem-specific scripts.

Set up Heroku to push master to production after a pull request

Show readme.md on the About page

Never have the same code written twice, right?

Looks like Markdown is the right package to install for this.

Make custom dictionary an optional argument

Cleaner than having it defined in the body of the function.

Tooltips don't show on mobile

They only show on desktop and they're not styled by Bootstrap...

Use pytest to run automated tests

Once pytest runs our existing tests with no failures:

Switch to using it to run them in automation for Continuous Integration

Consider replacing pronouncingpy with NLTK

Since we already switched to NLTK for the syllabification, we could also use:

http://www.nltk.org/_modules/nltk/corpus/reader/cmudict.html

Would allow using same format for custom_dict, maybe?

Add more poems!

Add the major poems from chapter one of the handbook of poetry.

Allow users with CHANGE_METER permission to choose which meter to apply to the current poem.

Everyone starts with the default scansion of a poem, but when logged in is able to edit it.

Start using the Flask test_client for functional testing

Write one test using test_client

Implement syllable-hyphenation

Use Pyphen with a custom dictionary in front to fix those cases where hyphenation doesn’t break along all syllables, e.g. equi-va-lent versus e/qui/va/lent.

Try adding a word to custom dict with just syllables

Should work now that we .extend phones list if a word gets syllables from custom dict but not phones.

Consider adding a CLI to Metroscope

With a default colored text output, perhaps?

Research installing metroscope in editable mode via pipenv

Should allow easy imports anywhere in the project, including tests which could go back to a separate folder.

Show more data using tool tips?

Use tool tips to show optional info about a word, such as # of syllables, stress pattern, etc...

Make Metroscope.py into a package

Move test text to a file

Get rid of Hamlet while you’re there too.

Switch from SonoriPy to NLTK

SonoriPy has been incorporated into NLTK and will no longer be updated, as per recent update to its repository.

Need to plan a transition, probably once I'm really solid on the unit tests for the functions that use it.

Add rhyming_part to custom_dict

Two ways to do this:

Just store the actual phones as if they came from CMU_dict.
Store a word that is in CMU_dict and rhymes with the custom word.

Refactor line-level logic into a class

Current line level logic manipulates strings directly.

Refactor that logic into a class while separating the analysis logic from the display logic. An instance of the line-level class should contain a list of the WordBuilder class instances for all of the words in the line, and build any string representations from that list as needed.

For this refactoring exercise, there will be three string representations:

str
repr
refactored logic to generate the HTML currently shown on the site

syvwlch / metroscope Goto Github PK

metroscope's People

Contributors

Stargazers

Watchers

metroscope's Issues

Recommend Projects

Recommend Topics

Recommend Org