Comments (3)
After some research I do not think it is worth implementing the protocol. It is not a good match for describing arrow data, since it does not support bitpacked booleans. Anything with nulls becomes impossible to describe.
We have now optimized our __array__
implementation to provide zero copy conversions where possible. I think that should suffice for integrations.
If someone can make a strong case for implementing __array_interface__
, I will consider reopening this.
from polars.
This means one could eventually to import torch; torch.tensor(dataframe)
, am I right? But it would still be up to Polars dev team to minimise the number of copies made, etc.
from polars.
This means one could eventually to
import torch; torch.tensor(dataframe)
, am I right? But it would still be up to Polars dev team to minimise the number of copies made, etc.
Actually, torch.tensor
does not seem to be using the NumPy protocol. We implement __array__
but it still cannot take a DataFrame. You have to explicitly export to NumPy first, e.g. torch.tensor(df.to_numpy())
or use our new functionality df.to_torch()
.
from polars.
Related Issues (20)
- Need fill_infinity feature HOT 2
- Inconsistent API of join_asof
- unreachable code panic on invalid argument for unique HOT 1
- Extract Underlying Coding from a Categorical or Enum Datatype HOT 2
- Cast categorical and enum data types directly to signed integers
- scan_csv should be able to read "0" and "1" into a boolean type
- serde deserialisation of AnyValue doesn't work HOT 2
- Problem with list eval on length 1 dataframes
- Add example showing how to unpivot multiple columns HOT 1
- Add pre-filtered decode to new-streaming Parquet source
- Dataframes that have both strings and categories cannot be serialized and deserialized from disk.
- LazyFrame.map_batches() ordering guarantees
- computation of list.len for null list seems incorrect HOT 1
- Schema for groupby-agg of literal raised to some power does not match `collect` result.
- Severe memory issues with `rolling` and `group_by`
- Schema inference fails when colums are produced without a name with pyodbc and sql server HOT 2
- StringCacheMismatchError when using joblib.Parallel and Categorical data HOT 1
- Expr.rank() function changed to unstable sort in polars >=1
- Significant performance difference depending on how I use the "filter" method HOT 5
- readJSON Fails on JSON with Newline Characters HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from polars.