Comments (3)
What is counter-intuitive? We don't have a notion of null
rows, so we cannot make an ignore_nulls
argument. They behave different and count
will not be exposed as a general catch all.
That's why we only expose it on Expr
. And pl.len()
will be exposed as a catch all. They are different for a reason.
from polars.
Everything conforms to how the documentation specifies.
import polars as pl
from polars import col
df = pl.DataFrame({"a": [1, 2, None]})
df.select(pl.count()) # returns 3 (counts all rows)
df.select(pl.count("a")) # returns 2 (counts non-nulls only)
df.select(col("a").count()) # returns 2 (counts non-nulls only)
df.group_by(pl.lit(True)).count() # returns 3 (counts all rows)
The reason we cannot ignore nulls in group_by().count()
is because there is no concept of a null row. Is a [0, 1, null]
a null row? Is [null]
a null row? (it's not the same as an empty row, which would be []
). A group_by
operation returns a frame, not a row, hence we cannot say how many non-null rows there are, because there is no concept of a null row. Does this make sense?
I agree that it is confusing. It is self-consistent, but it is nonetheless confusing.
from polars.
@ritchie46 @mcrumiller Thanks for the explanations. It seems like I missed the following statement in the doc.
This way of using the function is deprecated. Please use :func:
len
instead.
from polars.
Related Issues (20)
- add `show` method for syntax compatibility with pyspark/duckdb/etc dataframe API
- `gather` in `agg` context gathers values from other groups
- ShapeError: filter's length: 155 differs from that of the series: 0 HOT 9
- Version 0.20.30 bug HOT 4
- `.list.to_array()` fails if first element of a list column is excluded HOT 2
- `scan_parquet` + `with_row_index` causing `pl.len()` to return 0 HOT 1
- full join with coalesce=True panics if more key expressions are used than columns in a frame
- LazyFrames containing nested List types will cause panic in `collect()` HOT 1
- Another "coalesce=False" `join` schema issue HOT 2
- performance slowdown with `Expr.alias` HOT 3
- Shift(n) should accept a varying n HOT 4
- Rolling ewm/prod/rank HOT 3
- Improve string split API and DataTypes (`split`, `splitn`, `split_exact`)
- Inconsistent Behavior with `inspect` in Aggregations
- In LazyFrame, select empty Series causes panic HOT 3
- `check_sorted` causes error in `DataFrame.rolling` HOT 3
- Expose API for custom grouping operations similar to expression plugin API
- add strategy="mode" for fill_null HOT 5
- `write_parquet(pyarrow=False)` with `Struct` panic: "The children must have an equal number of values." HOT 2
- Add `pl.Config.show_full` (or something similar) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from polars.