Comments (9)
Hello,
Glad to hear about your interest in MLeap. We are currently making a big push to get to version 0.3.0, which will get us much closer to a stable API to work with. We have 3 pull requests to get through and then we will release it to maven central. Over the next week, we will be building out documentation and examples about how to use MLeap.
Thank you for your patience, we'll check back in on this ticket as we build out the documentation.
from mleap.
We had to write our own OneHotEncoder because Spark’s uses metadata to perform the transformation. This is unusual for Spark models, so instead, we use a custom estimator/model which stores the size of the output vectors so we don’t need to rely on metadata to do one hot encoding.
Re bundle.ml: You can choose whether you want to serialize to json or to protobuf - your choice should depend on which transformers you’re using and which models you’re training (i.e. for tree-based models we recommend protofub). I’ll add documentation and an example of serialization to protobuf.
As far as TC, they were using it when we left, but have not stayed up to date on the latest usage there. If you’d like to talk with someone who’s running it in production we can make an intro.
Best
Mikhail
from mleap.
@ltang We have added in pretty complete documentation here:
http://mleap-docs.combust.ml/
Please let us know in a separate issue if you would like us to add more specific documentation.
Cheers,
Hollin
from mleap.
Hi @ltang ,
We checked in a few examples with both scala and pyspark - can you give them a try?
https://github.com/combust-ml/mleap-demo/tree/master/notebooks
We're working on getting a public model server up in the next few weeks so you can test .deploy() as well.
Best
M
from mleap.
Thanks. I'll have a try and get back to you in few days.
from mleap.
@seme0021 Thanks for the demos.
I just walked through the airbnb example (except the deploy part). works out of the box!
I could not get toree to work due to some dependency conflict. May have to dig deeper. But the example works fine as long as I import all mleap dependencies.
Several minor questions:
- why is the demo using MLeap's OneHotEncoder rather than Spark's default one?
- Is bundle.ml format simply a bundle of json files (node.json and model.json)?
- Is TrueCar already using MLeap?
Our own use case requires some additional transformers. Will check your source code to see whether we can add customized transformers.
from mleap.
Hey, just giving an update here. We are building out comprehensive documentation for MLeap 0.5.0. You can see it here as it develops:
from mleap.
from mleap.
@geoHeil I responded to this question in another thread, but I'll post here as well.
Here is a list of supported transformers: http://mleap-docs.combust.ml/core-concepts/transformers/support.html
We are coming out with Sklearn modules on PyPi with the next release of MLeap, 0.6.0, which should be a fairly quick release after 0.5.0.
from mleap.
Related Issues (20)
- Will Mleap support Scala 2.13? HOT 19
- Enhancements
- mleap enhancments HOT 5
- python SimpleSparkSerializer support protobuf as serialize format
- how to convert spark dataframe to mleap tensor[double] straightly HOT 3
- Do we support Flink dataset convert to Mleap DenseVector? HOT 2
- Secret for publishing signed jars must be updated HOT 3
- Is it possible to share data between multiple transformer instances? HOT 4
- Incorrect use of truthy/falsy in MathBinary python
- Fix MathBinary bug HOT 2
- Deserialization for PCA model seems to be broken. Failed to find a default value for "k" HOT 2
- XGBoost MLeap bundle speed HOT 5
- MLeap XGBoost deserialized model throws an Exception HOT 1
- Can't install mleap alongside scikit-learn > 0.23.0 HOT 2
- MLeap Transformer issue. HOT 1
- MathBinary input validation
- Mleap and python 3.8 HOT 27
- Exception in thread "Thread-4" java.lang.NoClassDefFoundError: ml/combust/bundle/HasBundleRegistry HOT 3
- Need help installing MLeap and XGBoost on databricks HOT 6
- Using XGBoost with the newest mleap=0.22.0 in Python 3.8 HOT 10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mleap.