Comments (12)
Hey, the collab link is unavailable, could you fix it? I really want to fix the problem for me and your draft might help me.
from markovify.
@markelovstyle I was gonna make a small implementation Colab based on markovify for the Russian corpus so gimme a week or so and I will post a working example.
from markovify.
Thanks for your interest in this library, @wooferclaw. I'm not familiar enough with Russian to be able to diagnose the issue. One step, however, that would help help me and/or a more Russian-familiar community member resolve this: Could you attach a minimal Python script that demonstrates the problem?
from markovify.
I would check your text/char encoding first...then I would try well_formed = True and reject_reg = ' '(space for example) to disable rejection of non-standard chars.
@jsvine If you want/interested, I can assist with the Russian and Hebrew languages. Also, wanted to thank you for markovify! Great job. I love it.
from markovify.
Hi @asigalov61, and thank you for the offer and kind words. If you would like to provide examples of, and improvements to, using Markovify in those languages, that'd be great. Feel free to open an issue or PR, or to email me directly ([email protected]).
from markovify.
Hello, how are things going with the Russian-language integration? I really need Markovify in my project, but unfortunately I work with russian corpus
from markovify.
@markelovstyle Hey, I can help with Russian stuff.
Have you tried this?
f.write(TXT_String.encode('utf-8', 'replace'))
markov_text_model = markovify.NewlineText(text, well_formed=False, state_size=markov_chain_state_size)
And then try to use smaller state sizes (i.e 2) and also higher overlaps. Other settings are important too.
Let me know.
from markovify.
@markelovstyle Take a look at my implementation of markovify:
https://github.com/asigalov61/Markovify-Piano/blob/main/Markovify_Piano.ipynb
You are welcome to use the code as it supports a full range of uni-8 chars, which should work fine for Russian as well.
from markovify.
Thanks, it works, but not as well as English. In most cases I get None.
from markovify.
@markelovstyle If it works sometimes it means that you need to adjust the settings of the generator.
Also, the corpus must be properly formatted. This implementation requires sufficiently long sentences and a sufficiently long corpus.
from markovify.
Guys, i appreciate your help very much. @asigalov61 i have tried may encoding variations, none of them worked.
from markovify.
Here, guys. I made a draft version. Works great on my end. Try it out and let me know.
https://colab.research.google.com/drive/1OLagaj21zjV5kxjR5DIU4kx7jHQG8ggt?usp=sharing
from markovify.
Related Issues (20)
- markovify and music HOT 1
- Thank you for a job well done! HOT 2
- I can’t install because of the encoding of the file HOT 1
- Can I generate sentence with only two words? HOT 2
- generate sentence with it's prediction HOT 2
- spaCy model shortcuts are deprecated HOT 1
- Non-english characters are not being displayed correctly.
- markov_text_model.make_sentence_with_start KeyError HOT 1
- Fallback without building a new model? HOT 1
- “python_requires” should be set with “>=3.6”, as markovify 0.9.3 is not compatible with all Python versions. HOT 1
- Control generated sentences randomness HOT 2
- - HOT 2
- missing utf-8 BOM lead to codec failures during tests on windows
- Markovify - Markov chain : Seed and Condition to text generated based in input. HOT 2
- markovify's make_sentence_with_start() doesn't seem to work properly HOT 11
- Can't install on browser webpage.
- PolyCodeMaster.py HOT 1
- Markovify returns None HOT 3
- Brackets and Speechmarks HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from markovify.