Comments (6)
@SuaYoo in the draft PR, this currently only sorts on the new order if you explicitly pass the ?sortBy=lastCrawlTime
param (modified the existing sort by field instead of adding a new one, making it effectively lastUpdated
).
I see your point though and will look into changing it to return a lastUpdated
field and use that instead
from browsertrix.
Another option:
- Add
currCrawlStartTime
to db - Sort desc by
{currCrawlStartTime -1, lastCrawlTime -1}
, which should return current crawls first, then finished crawls that have completed ordered descending bylastCrawlTime
. A multi-field index would speed this up as well.
or a variant: sort on lastCrawlStartTime
(as current crawl), lastCrawlTime
, and status
.
Will try a few options and see what is the simplest solution with the desired result.
from browsertrix.
Sorry for the late response--to clarify, is this work handling default sorting by the backend, or adding a new field lastUpdated
? The latter would be preferred, so that the frontend both sort by the value and show the value without having to re-calculate it.
from browsertrix.
Can sortBy
accept a secondary/additional sort fields? Then we can keep lastCrawlTime
and the frontend can request to filter sortBy=lastCrawlTime,finished,started,modified,created
.
from browsertrix.
Can
sortBy
accept a secondary/additional sort fields? Then we can keeplastCrawlTime
and the frontend can request to filtersortBy=lastCrawlTime,finished,started,modified,created
.
We can do that! The trade-off would be either we need to create a bunch of indices for the possible combinations, or lookups might be slower than they would with a predictable combination and accompanying index.
from browsertrix.
@SuaYoo if you're cool with it I think I'd prefer to just set a lastUpdated
field in the db and build an index around that to keep the query and sorting fast. Otherwise we might get back into the slow lookups territory we moved away from recently.
Sorting options for workflows would then be: ("created", "modified", "firstSeed", "lastCrawlTime", "lastUpdated")
, with lastUpdated
being either the current crawl start time if a workflow is running or the finished time of the last crawl if not.
from browsertrix.
Related Issues (20)
- [Bug]: Crawl Workflow Status Stuck at "Waiting (At Capacity)" on Local Deployment
- [Feature]: search collection items by tags
- [Feature]: Improve UX of prefix search or switch to fulltext search HOT 2
- [Feature]: QA should include certain workflow settings
- [Feature]: Org Billing Page
- [Change]: Graph non-HTML page QA results as a discrete bar HOT 2
- Use first seed for workflows with no name in browser profile detail workflows list
- Shoelace progress rings always display at 100% completion in Chrome HOT 1
- Shoelace button groups don't appear correctly HOT 2
- [Bug]: Profile VNC connection fails while profile browser is still running (was: Profile ping returning success after expired) HOT 1
- [Feature]: Allow setting scale for QA runs in helm chart
- [Bug]: The copy-field label is inside the field
- [Feature]: Show and update the QA results bar graph while analysis is running
- [Change]: Update column sorting for all tables HOT 1
- Indicate pages with significant failures/unable to be analyzed separately from "No data" in QA meter HOT 1
- Use rounded border radius on QA meter bars
- [Bug]: QA analysis fails all the time for "pol frontpage with all context"
- Add button to QA crawl in Watch Crawl tab when crawl completes
- QA: Show number of files and errored pages separately from QA meter HOT 1
- [Bug]: Ensure the qa configmap updated for long running QA runs
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from browsertrix.