ukwa / ukwa-ui Goto Github PK
View Code? Open in Web Editor NEWA new user interface for the UK Web Archive
License: BSD 3-Clause "New" or "Revised" License
A new user interface for the UK Web Archive
License: BSD 3-Clause "New" or "Revised" License
Links to all the reading rooms are broken
Currently, switching to Advanced Search adds the right fields, but also requires the user to enter a standard query in the 'Enter search phrase...' box. The Advanced Search form should replace that box, or at least render that box optional.
Secondarily, the search is sometimes returning unexpected results, but i think it's because the Solr endpoint is defaulting to OR
logic when searching rather than AND
. As indicated here this can be changed by extending the query with &q.op=AND
or by changing the Solr server configuration.
The search display reverts to basic - UX query: is this ok?
As raised by Somaya, some links are coming out double-quoted. e.g. https://beta.webarchive.org.uk/en/ukwa/noresults
Looking at the source, it seems the messages files that contain 'over escaped' HTML fragments. e.g.
https://github.com/ukwa/ukwa-ui/blob/master/src/main/resources/i18n/messages.properties#L73
Which creates a link to https://beta.webarchive.org.uk/en/ukwa/"cookies"
when the browser tries to make sense of the HTML.
One of a set of questions that arose during an internal code review.
This codebase appears to have no functioning tests. There is one test, but it does not appear to test anything, and breaks the build. Please remove or improve the test.
We need to add an advanced search page, and the proposal is to base this on the corresponding page in Shine. Can you work with Chris P. to pursue this?
Example:
https://dev.webarchive.org.uk/gd/ukwa/contact
Scots page, reCaptcha in English.
I don't know if there are country codes for Welsh and Scots, if not, don't worry about hacking it, this can be closed.
The NLS logo in the footer needs replacing (i've emailed the new jpg to Mindy and Lee)
If you hover over the filter Help symbols (the Question Mark symbols, not the chevrons), the tooltip displayed is suffixed "tooltip", e.g. "Access facet help tooltip"
The last word shouldn't be there.
I suspect these are intended as placeholders for more useful tooltips though.
One of a set of questions that arose during an internal code review.
When an error occurs, such as one of the Solr back-ends being unavailable, a response is given that is generic and unhelpful.
All error pages should be themed as normal.
Also, ideally, the codebase should have a development
mode where stack traces are reported through the UI, and a production
mode where the stack traces are collapsed/hidden and a user-friendly message is returned. The production
mode should also email errors to a configurable email account.
The idea that the general search should simultaneously search across both the Collections and Targets Solr Service as well as the resource-level full-text index. The type
facet could then be used to filter the faceted results.
Depends on whether the schemas are able to be cross-searched.
Home Page >
Enter a search term (no filter) >
n results returned >
Change language (e.g. change from English to Welsh) >
Issue 1: 0 results returned for "" (keyword has been reset) >
Enter a keyword and search >
Issue 2: 0 results returned for "" (but a keyword has been supplied) >
Reset filters >
Enter search term >
n results returned
Issue 3: Why do filters need to be reset? We haven't set them!
One of a set of questions that arose during an internal code review.
Although solrj
is on the classpath and in the pom.xml
, it does not appear to be used anywhere. The code itself however re-implements a lot of the SolrJ client logic, like building HTTP requests, setting query parameters, escaping values and so on.
Why did you re-invent this logic rather than re-using SolrJ?
(see https://cwiki.apache.org/confluence/display/solr/Using+SolrJ for more details on using SolrJ)
The following comments were noted in review:
• The FAQ page is too full on, too many sparse titles all over the page, it should be presented as an expandable list
• it's formatting is out of sync with the standard links in the footer.
• And when you click on the individual links, it can be quite jumpy, it doesn't look good
enter http://henryjacksonsociety.org/ into search box and get 'error 500 server error'
enter henryjacksonsociety.org/ and it works fine (the website expected is the top result
We (the BL) have changed accounts with surveymonkey. Can you replace the old link with this one:
https://www.surveymonkey.co.uk/r/ukwasurvey01
Raised by @JasonNotOnHereYet
What symbols users should use when specify search data in single field, in order to be transformed into field list and passed to SOLR query?
Wayback Landing Page is always English.
Example:
From the Welsh version of:
Special Collections > Climate Change Debates > 10:10uk
https://dev.webarchive.org.uk/cy/ukwa/collection/369 >
https://dev.webarchive.org.uk/cy/ukwa/wayback/OA/29991231999999/http://www.1010uk.org/
We land on
https://www.webarchive.org.uk/wayback/archive/20170630224615/https://1010uk.org/
It should be
https://www.webarchive.org.uk/wayback/archive-cy/20170630224615/https://1010uk.org/
Staff browsers are connecting directly to the HTTPS website rather than going via the staff proxy. So, to offer staff the right link we need to whitelist the whole staff LAN address range.
I'm not sure how this works for the IPv6-based users coming in on their laptops, so this might not resolve the whole problem.
As discussed, we need this component set up to build via Docker Hub and deployable via environment variables. etc.
Home page search box wouldn't accept characters when typed.
Also, the tooltip indicating the empty field is misplaced, it points above the field, not to it.
Moto g4, Android 7, Chrome
"Special Collections" overlaps the text.
https://dev.webarchive.org.uk/en/ukwa/collection
Samsung galaxy Tab 10 inch screen SM-T520
Android 4.4.2
Chrome browser
Example:
...webarchive.org.uk/cy/ukwa/info/nominate
Question mark appearing in the middle of words.
"Diogelwch wefan o?r DG"
"hail-gynllunio?n sylweddol"
etc.
Jack Russell, not Jack Russel
If I click on Advanced Search, the url stays the same. Not necessarily a bug (because it displays and hides a frame within the current page), just questioning this from a UX point of view.
Later on, we should move the Solr setup into this repo and close down that old one. We should then fix up the Docker Hub build.
Thinking about using web archives for storytelling, it occurs to me that one way this is done for other media is via embeds, like YouTube videos embedded in a blog, or Twitter Cards so more info can be embedded in Twitter conversations. We should consider adding some kind of embedded version of a web page, e.g. an autogenerated thumbnail API and oEmbed hook?
LIst of (possible?) omissions:
Main Heading/Banner: UK Web Archive and sub heading
Selected Filter information
The nominate items
Basically I've identified those from looking at the missing tags in the properties file, hope that's enough to go on.
Once ukwa/ukwa-manage#25 and dependent tasks are complete, we will need to update the the UI to make use of them.
The language labels (English/Welsh/Scottish) appear to be displayed inconsistently in the different languages.
E.g. I would expect the label to be in the language of that to be selected.
So on an English or Scottish page, "Welsh" would be displayed as "Cymraeg", not "Welsh", say.
However on the English and Scottish pages this is so, whereas on the Welsh page:
"English" is displayed as "Saesneg" - the Welsh for "English". Should be "English".
Scottish"is displayed as "Gaeleg yr Alban" - the Welsh for "Scottish Gaelic". Should be whatever the Scottish Gaelic of "Scottish Gaelic" is. Presumably "Gàidhlig" as that's on the English page.
Page after the last one specified in url results in a page with ...
1. ... no indication its actually not the one specified, or that the one specified doesn't exist.
2. ... has 2 sets of page buttons! OR
3. ... has no buttons, twice...
Examples:
https://beta.webarchive.org.uk/en/ukwa/collection/329?page=3
gives no indication the page specified doesn't exist (there are only 2 pages).
https://beta.webarchive.org.uk/en/ukwa/collection/329?page=4
Shows 2 sets of buttons
https://beta.webarchive.org.uk/en/ukwa/collection/329?page=10
Shows the back arrow (with no page buttons) twice.
Lumped together as I'm assuming its the same underlying or related cause(s).
We should link to a Beta OpenWayback that uses the new theme/colours.
Mindy has been looking at this. See ukwa/waybacks#1
A Beta Wayback service is now running on the access
server, but needs connecting up to https://beta.webarchive.org.uk/wayback (as defined in https://github.com/ukwa/ukwa-access-services/blob/master/docker-compose.yml#L12)
One of a set of questions that arose during an internal code review.
Some minor issues that arose in general:
in the Java:
in the templates:
and in the JavaScript:
Or number of Titles should be displayed with the Collection Title.
Example
https://dev.webarchive.org.uk/en/ukwa/collection/1035
then
https://dev.webarchive.org.uk/en/ukwa/collection/1037
is empty
Example:
https://dev.webarchive.org.uk/en/ukwa/collection/1107
Black and Asian Britain
First site in the list:
A.M. Qattan Foundation
Search on "Qattan" in the Special Collection Search Box under the Collection Title
No results.
(I've checked that its not because the site above is in Arabic; another example is "Evaristo" which is in content of the Title of the same name further down the collection)
Another example is searching on "Brexit" within the Brexit collection.
Don't know if its a configuration, indexing or coding issue.
Related Cards listed on
https://trello.com/c/sQ1KwFI2/212-snapshots-vs-titles-search-considerations
We need to update the Cambridge logo.
NOTE: This is not necessarily part of this website engine, and is a proposal held here because it's a part of the front-end workflow.
We need to provide a URL resolution endpoint (for e.g. Document Harvester or other links in the catalogues) that bounces the user to the right end point. i.e. given this:
https://www.webarchive.org.uk/access/resolve/20170403111155/https://www.gov.uk
The resolver should look it up in the Open Access Wayback and then, if it's a 451, redirect to the appropriate LDL Reading Room system if the user is on a known Reading Room IP. If we have it, it should redirect to Open Access Wayback. If we don't have it, we could offer links to holdings in other web archives (via Memento). This latter functionality could be embedded in the Wayback page, but it's probably simpler to take the Mementos component and either update it (and run it standalone) or merge the logic into this website.
For extra credit, the system should check if the item is available in the Reading Room playback system and be apologetic if the item is known but not yet available.
In the meantime, we could just use Apache to redirect the user to re-written URLs based on IP.
The link to the NLS site in the footer goes to https://www.uk/
not https://www.nls.uk/
. Please fix that!
Currently we use a hard-coded IP address mapping table to work out whether to present the user with links to Ericom. This is a pain to maintain over time.
Alternatively, could we actually test for access in the web page itself? i.e. default to no-access
and then use JavaScript to check which (if any) of the secure playback gateways are visible to the user. If they can see one of them, change the page to offer direct access (probably storing the result in a cookie to avoid re-checking over and over again).
One of a set of questions that arose during an internal code review.
This part of common.js appears to be modifying image tags when there is no image given.
Surely this would be better done server-side? Switching to the default image if none is set? Or is there a reason this can't be done?
Currently, links to playback put a timestamp of 29991231999999 in the URL to direct to the latest copy. This seems to expose dead sites far too easily.
As our users are usually interested in the past, it makes more sense to default to showing our earliest copy rather than the latest, when we have no other timestamp information to go on.
So, can we use {playback-prefix}10001231999999/{url}
instead?
Do we need to add a random part to avoid locking problems? e.g. {playback-prefix}10001231######/{url}
?
See design for issue #31
The fix is not quite right but the exact rendering has been deferred to fix in this issue.
The WCT-era links, especially for Targets and Collections, need to be supported by the new website. The WCT IDs are stored in W3ACT but are not yet exposed in the same way.
Some of these look rather complicated, e.g. https://www.webarchive.org.uk/ukwa/target/28180500/collection/26312782/source/collection because the latter part of the URL is being used to store the context the target is being viewed in.
Others are complicated because the paging of other parameters: https://www.webarchive.org.uk/ukwa/collection/100757/page/1/source/collection
In practice, it should suffice to match the primary ID (discarding the remainder of the URL).
We should also review the main navigational URLs and provides mappings in the new site. e.g.
etc.
Current version of ukwa-ui
does not provide pages for Targets. As the design settles down, I suggest we plan to augment the ukwa-ui
app with a statically-generated site that covers Targets and Collections, and reduce ukwa-ui
over time to just the dynamic search part.
(Initially raised in #9)
@anjackson, previously we had this requirement for logging:
We have implemented 1 and 3.
What is implied by 2? Shall we just dump the contents of application.properties to the log file? After removing passwords of course.
When you say "Code has very little logging", how much logging do you expect? It currently logs Solr queries as per (1) above and stack traces as per (3). What else would you like logged?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.