Giter VIP home page Giter VIP logo

corpus-joyce-portrait-tei's People

Contributors

alanburnett avatar charlesreid1 avatar garrettdonnelly avatar gitter-badger avatar goldieshen avatar jonathanreeve avatar joshuakolb avatar jrmcbride avatar jss2277 avatar rdeal25 avatar zk2020 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

corpus-joyce-portrait-tei's Issues

XSLT not working on Chrome

The prototype transformation on github.io works fine on Firefox, but fails to transform correctly on Chrome and similar WebKit-based browsers. Chrome complains that "This XML file does not appear to have any style information associated with it. The document tree is shown below."

This is very likely due to an XSL issue, since XSL complains that:

runtime error: file portrait.xsl line 98 element attribute
xsl:attribute: Cannot add attributes to an element if children have been already added to the element.

mark up Joyce's coinages and Joycean compound words

A word can count as a member of this category if:

  • it isn't in the OED
  • it's in the OED, but the first citation is from Joyce's Portrait (see pandybat)

Find these by using a spell checker.

Sub-tasks:

  • Chapter 1
  • Chapter 2
  • Chapter 3
  • Chapter 4
  • Chapter 5

change <milestone>s to empty tag

As @tcatapano pointed out in #25, <milestone> is supposed to be an empty tag. Change this to an empty tag and remove the text, noting for XSL that * * * is the desired rendering of rend="asterixes".

check Latin ligatures in MS

Is it "foetus" or "fœtus"? Norton edition gives "foetus,"

Also, "poena damni" circa line 4581, or "pœna damni"?

Ae ligatures in "Synopsis Philosophiae Scholasticae"

I'm tempted to think that the electronic version from the OTA has these ligatures, since they're represented by such strange characters: Philosophi.^#

build working bibliography

This will be an ongoing effort, of course, but it'd be a good idea to get a working bibliography filled out with the major relevant works, at least.

remove or convert `<i>` tags

The problems with <i> tags in the original SGML version are at least:

  • <i>s surround italicized lines, even if those lines aren't verse, but prose.
  • <i>s surround lines in line groups, when the entire line group should be italicized (in the style sheet), not each line individually

I'd recommend:

  1. remove <i> tags in any line group, since all verse groups will be rendered as italic in the stylesheet.
  2. convert inline <i>s to <hi rend="italic">

mark up names

Used named entity recognition software?

  • Chapter 1
  • Chapter 2
  • Chapter 3
  • Chapter 4
  • Chapter 5

finish removing $ marks

The original markup featured leading $s on lines of verse. Replace these with <lg> and <l>s, or more semantic tags, where appropriate.

add section <div>s

Bonus: mark these up with JP Riquelme's section division descriptions from the Norton edition.

some thoughts

  • Why are you using <TEI.2>
  • If you want to add Joyce's corrections and errors don't forget to ad an xml:id to Joyce, James.
  • I would do a basic difference between the typographical layer and the semantic one. Thus, I would avoid the use of tags like <emph>: is that because is a foreign language <foreign>, a <title>, etc. I would leave the typo for the final
  • Maybe here you would have to creat a @class for this kind of line: <l><emph rend="italic">Pull out his eyes,</emph></l> Why is in cursive?
  • Not sure this is a line: <l> * * * </l>
  • This is not either a line: <l>II</l>
  • Once the paragraphs are settled, you could give an @xml:id

make edition website

One reviewer comments:

Since there will be readers who are unfamiliar with GitHub, "it might be useful for the editor to explore ways of presenting the site with a separate, simple html page that explains what the project is, what to expect in GitHub, etc."

how should we mark up reported speech?

Currently:

<p><said who="Simon Dedalus">—<hi rend="italic">
<quote>I'll pay you your dues, father, when you cease turning 
the house of God into a pollingbooth.
</quote></hi></said></p>

Would like to get better attribution for the quote here.

what section do section dividers belong to?

Should it be:

<milestone unit="section" rend="asterixes"><l> * * * </l></milestone>

</div> <!-- 1.3 --> 

<div n="1.4" type="section"> 

or:

</div> <!-- 1.3 --> 

<milestone unit="section" rend="asterixes"><l> * * * </l></milestone>

<div n="1.4" type="section"> 

or:

</div> <!-- 1.3 --> 

<div n="1.4" type="section"> 

<milestone unit="section" rend="asterixes"><l> * * * </l></milestone>

@jamesosullivan, what do you think?

programmatically link passages mentioned in criticism

  1. Amass as many secondary works as possible in PDF.
  2. OCR the PDF files, if necessary.
  3. Identify passages discussed in secondary sources with quotation marks, block quotes, and/or >3 word matching phrases between the novel and source.
  4. Link passage in novel with secondary source. Include URL to full text database.

mark up biblical quotes

From Chapter I, Dante:

<p><said who="Dante">—
<quote>Woe be to the man by whom the scandal cometh!</quote> 
said Mrs Riordan. 
<quote>It would be better for him that a millstone were 
tied about his neck and that he were cast into the depth of the
sea rather than that he should scandalise one of these, my least
little ones.</quote> That is the language of the Holy Ghost.</said></p>

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.