Giter VIP home page Giter VIP logo

archives-web's People

Contributors

dependabot[bot] avatar epicfaace avatar greenkeeper[bot] avatar hesyifei avatar hs4man21 avatar matthewturk247 avatar sbcatania avatar ufxela avatar zixianma avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

archives-web's Issues

Some dates have archives articles missing - need to combine METS files

Need to combine
https://stanforddailyarchive.s3.amazonaws.com/data.2013-nov/data/stanford/1991/09/19_01/Stanford_Daily_19910919_0001-METS.xml

and

https://tiles.archives.stanforddaily.com/data.2013-nov/data/stanford/1991/09/19_02/Stanford_Daily_19910919_0001-METS.xml

currently, we only show the latter (19_02) but miss all the (19_01) pages -- see https://archives.stanforddaily.com/1991/09/19?page=1&section=MODSMD_ARTICLE2#issue

This is important because we're actually missing articles on the new site (which were there previously on the old site).

position header tags vertically instead of horizontally on home page

"About this collection" "Search the Collection" and "Subscribe" are currently in the same row. Fix this by orienting each header and its respective components vertically one after the other.

Bonus: Make this feature responsive by orienting vertically if the page size is small and switch to horizontal when the page size is big.

Screen Shot 2019-11-14 at 7 58 35 PM

Things to fix before release

  • Fix data repeating
  • Redirect old site to new site - use now.sh for this
  • Search result heights with an expand button - #36
  • Be able to search everything, add slider - #37
  • Get dns control of stanforddailyarchive.com

Get this done by Nov 17

An in-range update of react-bootstrap is breaking the build 🚨

The dependency react-bootstrap was updated from 1.0.0-beta.12 to 1.0.0-beta.13.

🚨 View failing branch.

This version is covered by your current version range and after updating it in your project the build failed.

react-bootstrap is a direct dependency of this project, and it is very likely causing it to break. If other packages depend on yours, this update is probably also breaking those in turn.

Status Details

FAQ and help

There is a collection of frequently asked questions. If those don’t help, you can always ask the humans behind Greenkeeper.


Your Greenkeeper Bot 🌴

corrections

create a form on frontend for users to create and submit corrections to articles. Then, someone at TSD needs to approve the changes, and run the pipeline (to be implemented, see: #55) to propagate the changes.

views

would be cool to see how many times a page has been viewed. Not sure how to do this though.

search helper/info display

create a guide/key/help bar that user can expand which shows some hints for searching e.g. you can add quotations to do an exact text match, just like when google searching.

Action required: Greenkeeper could not be activated 🚨

🚨 You need to enable Continuous Integration on Greenkeeper branches of this repository. 🚨

To enable Greenkeeper, you need to make sure that a commit status is reported on all branches. This is required by Greenkeeper because it uses your CI build statuses to figure out when to notify you about breaking changes.

Since we didn’t receive a CI status on the greenkeeper/initial branch, it’s possible that you don’t have CI set up yet. We recommend using Travis CI, but Greenkeeper will work with every other CI service as well.

If you have already set up a CI for this repository, you might need to check how it’s configured. Make sure it is set to run on all new branches. If you don’t want it to run on absolutely every branch, you can whitelist branches starting with greenkeeper/.

Once you have installed and configured CI on this repository correctly, you’ll need to re-trigger Greenkeeper’s initial pull request. To do this, please click the 'fix repo' button on account.greenkeeper.io.

suggester

integrate cloudsearch suggester into the frontend. suggester suggests against article_text (high fuzzy search) or article_title (high fuzzy search) or author (exact match), depending on what index field the user is searching by.

Getting errors when running the script - "Cannot read property 'nodeValue' of undefined"

Getting errors when running the script

The last successful file parsed was text/1894/4/26/DIVL151.txt Adv. 17 Page 4

(node:15545) UnhandledPromiseRejectionWarning: TypeError: Cannot read property 'nodeValue' of undefined
    at Paper._callee$ (/home/ec2-user/environment/node_modules/@thestanforddaily/archives-web/lib/classes/Paper.js:85:63)
    at tryCatch (/home/ec2-user/environment/node_modules/regenerator-runtime/runtime.js:45:40)
    at Generator.invoke [as _invoke] (/home/ec2-user/environment/node_modules/regenerator-runtime/runtime.js:271:22)
    at Generator.prototype.(anonymous function) [as next] (/home/ec2-user/environment/node_modules/regenerator-runtime/runtime.js:97:21)
    at asyncGeneratorStep (/home/ec2-user/environment/node_modules/@babel/runtime/helpers/asyncToGenerator.js:3:24)
    at _next (/home/ec2-user/environment/node_modules/@babel/runtime/helpers/asyncToGenerator.js:25:9)
    at process._tickCallback (internal/process/next_tick.js:68:7)
(node:15545) UnhandledPromiseRejectionWarning: Unhandled promise rejection. This error originated either by throwing inside of an async function without a catch block, or by rejecting a promise which was not handled with .catch(). (rejection id: 1)
(node:15545) [DEP0018] DeprecationWarning: Unhandled promise rejections are deprecated. In the future, promise rejections that are not handled will terminate the Node.js process with a non-zero exit code.

Make search results more compact

Right now, a single search result shows around 2 paragraphs and basically fills up the entire screen. We'd like each search result to be smaller and show only a snippet of text, so that the user can view many more search results on the screen at the same time.

image

An in-range update of query-string is breaking the build 🚨


☝️ Important announcement: Greenkeeper will be saying goodbye 👋 and passing the torch to Snyk on June 3rd, 2020! Find out how to migrate to Snyk and more at greenkeeper.io


The dependency query-string was updated from 6.11.1 to 6.12.0.

🚨 View failing branch.

This version is covered by your current version range and after updating it in your project the build failed.

query-string is a direct dependency of this project, and it is very likely causing it to break. If other packages depend on yours, this update is probably also breaking those in turn.

Status Details

Release Notes for v6.12.0

v6.11.1...v6.12.0

Commits

The new version differs by 2 commits.

See the full diff

FAQ and help

There is a collection of frequently asked questions. If those don’t help, you can always ask the humans behind Greenkeeper.


Your Greenkeeper Bot 🌴

filter search by author_title

User can view possible author titles and select one or multiple (union of selected if multiple) to filter search results by

Link papers/articles to their online counterparts

For example, https://archives.stanforddaily.com/2002/05/31 could have a link to https://web.archive.org/web/20020602164045/http://www.stanforddaily.com/.

And https://archives.stanforddaily.com/2002/05/31#page=1&section%5B%5D=MODSMD_ARTICLE4 could have a link to https://web.archive.org/web/20020629021038/http://www.stanforddaily.com/tempo?page=content&id=8524&repository=0001_article.

These online articles could also be the source for archives-text.

For earlier papers and articles, we probably need to find the links manually.

For later papers and articles, maybe there will be some patterns in the URL?

An in-range update of interactjs is breaking the build 🚨

The dependency interactjs was updated from 1.5.1 to 1.5.2.

🚨 View failing branch.

This version is covered by your current version range and after updating it in your project the build failed.

interactjs is a direct dependency of this project, and it is very likely causing it to break. If other packages depend on yours, this update is probably also breaking those in turn.

Status Details

FAQ and help

There is a collection of frequently asked questions. If those don’t help, you can always ask the humans behind Greenkeeper.


Your Greenkeeper Bot 🌴

make search in navbar more visible

Right now, search is in the right corner, which is a little difficult to see initially imo, especially on screens w/ wider aspect ratios. maybe put some padding to make it no so far right?

Method for searching for article overlays

In METS file, first traverse "Physical Structure" to get all pages (e.g. ALTO00001).
e.g. (https://s3.amazonaws.com/stanforddailyarchive/data.2013-nov/data/stanford/1999/12/01_01/Stanford_Daily_19991201_0001-METS.xml)

<structMap LABEL="Physical Structure" TYPE="PHYSICAL">
		<div ID="DIVP1" LABEL="The Stanford Daily" TYPE="Newspaper" DMDID="MODSMD_PRINT MODSMD_ELEC">
			<div ID="DIVP2" ORDER="1" ORDERLABEL="1A" TYPE="COVER_PAGE">
				<fptr>
					<par>
						<area FILEID="IMG00001"/>
						<area FILEID="ALTO00001" BETYPE="IDREF" BEGIN="P1"/>
					</par>
				</fptr>
			</div>
...

Then traverse the whole file to find all elements with attribute e.g. [FILEID="ALTO00001"]. Then find their parents that has TYPE="ARTICLE" and add this as part of the overlays for that parent.

e.g.

					<div ID="DIVL10" TYPE="CONTENT">
						<div ID="DIVL11" TYPE="ARTICLE" DMDID="MODSMD_ARTICLE1" LABEL="Earthquakes rock Stanford in '06, '89">
							<div ID="DIVL12" TYPE="HEADING">
								<div ID="DIVL13" TYPE="TITLE">
									<fptr>
										<area BETYPE="IDREF" FILEID="ALTO00001" BEGIN="P1_TB00006"/>
									</fptr>
								</div>
								<div ID="DIVL14" TYPE="AUTHOR">
									<fptr>
										<area BETYPE="IDREF" FILEID="ALTO00001" BEGIN="P1_TB00007"/>
									</fptr>
								</div>
							</div>

It will add P1_TB00007 and P1_TB00006 to overlay for MODSMD_ARTICLE1.

Then find corresponding positions and size in the ALTO file. (e.g. https://s3.amazonaws.com/stanforddailyarchive/data.2013-nov/data/stanford/1999/12/01_01/Stanford_Daily-ALTO/Stanford_Daily_19991201_0001_ALTO0001.xml)

getBlockPositionAndSize(id) {
// Based on testing. See https://github.com/TheStanfordDaily/archives/issues/2#issuecomment-491481280.
const scaleFactor = 0.0003;
// Find tag with `ID="{id}"`
// https://stackoverflow.com/a/17268477/2603230
let textBlock = this.altoData.find("[ID='" + id + "']")[0];
let xPos = textBlock.attributes["hpos"].nodeValue * scaleFactor;
let yPos = textBlock.attributes["vpos"].nodeValue * scaleFactor;
let width = textBlock.attributes["width"].nodeValue * scaleFactor;
let height = textBlock.attributes["height"].nodeValue * scaleFactor;
let results = {
id: id,
x: xPos,
y: yPos,
width: width,
height: height
};
console.log(results);
return results;
}

@epicfaace do you think there's any easier way? Also, do we need to highlight anything other than TYPE="ARTICLE"? (e.g. TYPE="TITLE_SECTION" and TYPE="ADVERTISEMENT")

First page of results from cloudsearch is not shown

open back up a way to add new articles to the archives

in the past, alumni were able to "sponsor" the addition of articles. Perhaps bring this feature back. Currently, archives are missing all the content pre 2014 that wasn't sponsored, as well as all content post 2014

Look into image to text services (amazon Textract, for example).

An in-range update of react-scripts is breaking the build 🚨


☝️ Important announcement: Greenkeeper will be saying goodbye 👋 and passing the torch to Snyk on June 3rd, 2020! Find out how to migrate to Snyk and more at greenkeeper.io


The dependency react-scripts was updated from 3.4.0 to 3.4.1.

🚨 View failing branch.

This version is covered by your current version range and after updating it in your project the build failed.

react-scripts is a direct dependency of this project, and it is very likely causing it to break. If other packages depend on yours, this update is probably also breaking those in turn.

Status Details

Release Notes for v3.4.1

3.4.1 (2020-03-20)

v3.4.1 is a maintenance release that includes minor bug fixes and documentation updates including upgrading Babel to fix a bug in the 7.8 release line. This release also brings support for TypeScript 3.8.

🐛 Bug Fix

  • react-scripts
  • babel-preset-react-app
  • cra-template, eslint-config-react-app, react-scripts
    • #7790 Widen eslint-config-react-app peer dependency versions (@lukyth)

💅 Enhancement

  • cra-template-typescript, cra-template
  • react-scripts

📝 Documentation

  • Other
    • #8515 Fix proxying API request docs (@hjr3)
    • #8561 Indicate that the file structure is the template's (@Vinnl)
  • react-scripts

🔨 Underlying Tools

  • babel-preset-react-app, create-react-app, react-dev-utils, react-error-overlay, react-scripts
  • react-scripts

Committers: 9

Migrating from 3.4.0 to 3.4.1

Inside any created project that has not been ejected, run:

npm install --save --save-exact [email protected]

or

yarn add --exact [email protected]
Commits

The new version differs by 13 commits.

  • d2f813f Publish
  • 7641a3c Prepare 3.4.1 release
  • d5b527f Update to Babel 7.9 (#8681)
  • 6adb82a Add React.StrictMode to default templates (#8558)
  • a452ddc Bump dependencies (#8620)
  • 3f699fd Fix proxying API request docs (#8515)
  • 4d26208 Use native ESLint behaviour when extending (#8276)
  • 8ba0ccb Whitelist main in template.json (#8539)
  • 7d3b72c Update template example in docs (#8561)
  • 2030ee1 Fix optional chaining and nullish coalescing support (#8526)
  • 038e6fa Widen eslint-config-react-app peer dependency versions (#7790)
  • 7e6d6cd Closes webpack dev server and exits process on "end" stdin (#7203)
  • af926d5 Bump pnp-webpack-plugin (#8509)

See the full diff

FAQ and help

There is a collection of frequently asked questions. If those don’t help, you can always ask the humans behind Greenkeeper.


Your Greenkeeper Bot 🌴

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.