Comments (6)
It looks like these work fine, with respect for sorting and grouping ?
tb.filter(d => op.row_number() < 10)
tb.groupby('groupvar').filter(d => op.row_number() < 10)
but of course a more compact writing could prove convenient
from arquero.
It certainly works, but it's verbose and hard to keep straight for a pretty common operation in exploratory analysis particularly.
E.g., since row_number is one-indexed, that code tb.filter(d => op.row_number() < 10)
actually returns the first 9 items, not the first 10! And tail operations aren't symmetric with respect to the index put in:tb.filter(d=>op.row_number() > op.count() - 10
does return 10.
from arquero.
yes indeed, the tail operation is more complex to write:
tb.params({nrows: naiss_tb.numRows()}).filter((d,$) => op.row_number() > $.nrows - 10)
or
let nrows = tb.numRows();
tb.filter(d => op.row_number() >= nrows - 10) ;
from arquero.
I'm open to adding a slice()
method to tables that provides syntactic sugar over window based filtering (or something more performant for ungrouped tables), followed by a reify()
call.
The most delicate aspects concern grouped tables. I'm assuming each group should be sliced separately. Still, the ordering of rows in the output would be solely determined by any orderby
criteria, oblivious to groups.
Following JavaScript conventions, the method could have a table.slice(start[, stop])
signature, where negative values indicate an offset from the last row of the group. Output tables would retain the groupby
criteria of the input.
Note that standard JavaScript does not support "inside out" specifications like the slice(-1, 0)
described above. I'm unconvinced that Arquero should go down that route. An alternative would be to take two slices and concatenate them, or use a non-sugared windowed filter.
How does this sound?
from arquero.
I would be satisfied with a slice operator that follows JS slice syntax while respecting (and preserving) group_by. Selection of non-contiguous subsets is not very important and can be done inside filter
as described above. tbl.slice(-5)
for tail(5)
works great.
from arquero.
Added slice()
verb in v2.2.0.
from arquero.
Related Issues (20)
- arquero op.first_value function issue HOT 2
- table.print() should return table to enable chaining HOT 2
- Citation for academic paper HOT 2
- Nuxt 2.0 build failing due to apache-arrow exports HOT 6
- Add op functions to work with Map and Set objects HOT 1
- `fromCSV` fails with uncaught TypeError on CSV with headers only HOT 1
- Join ignores empty string as suffix
- Table expressions do not support underscores as numeric separators in numeric literals
- CSV parse functions don't get run on null values
- Failing during production build: minification problem? HOT 10
- Verb to drop columns by name? HOT 2
- derive can not handle string? HOT 2
- Table concatenation results in empty table
- Problems getting Arquero to find it's types in Typescript HOT 5
- array_agg and undefined/none values HOT 3
- fromArrow -> Unrecognized type: "undefined"
- COUNTIF-like aggregate function
- NextJS swcMinify is breaking arquero
- Apache Arrow 15 support HOT 9
- docs: broken links HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from arquero.