Comments (1)
hi @githubclj, interesting idea but its not something we want to implement because if we go that way we will need to always generate all possible combinations for a dataset. The firsts versions of chatito used to work that way, but for large datasets it just takes too much time to generate all combinations and if i don't generate all combinations, the utterances at the end of the generation process would never be seen. If you really need to have the same dataset, you can try to generate all possible combinations, the data will contain the same amount of examples each time but with different order. Just be aware that it is not recommended to generate all combinations.
Chatito takes ideas from probabilistic programming languages, and the idea is that the utterances are pulled from a cloud of probabilities on demand, and since we can pull a subset of all the possibilities, we need them to be a random sample, hope this makes it clear why this feature is not going to be implemented.
from chatito.
Related Issues (20)
- relex
- Unhandled crash when generating testing data HOT 3
- Online ide HOT 2
- Optional slots HOT 1
- [BUG] Slot regression between v2.1.5 and v.2.2.1 HOT 5
- Import failing HOT 2
- Weighted probability HOT 10
- Snips NLU output format error HOT 1
- 数据量太大,然后速度太慢了 HOT 2
- How can I add previous generated json file with new examples? HOT 1
- How can I add Number? HOT 1
- "Can't generate X examples" warning doesn't say which intent it is referring to HOT 2
- How to use Chatito in angularjs HOT 1
- Training/Testing Number Via Cli? HOT 2
- how to use regex_features? HOT 1
- Downloading dsl files? HOT 1
- How to start Chatito on local host HOT 1
- I got JavaScript heap out of memory when training HOT 1
- How to determine whether happened over-fit?
- Save entities for test HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chatito.