open-editions / corpus-joyce-portrait-tei Goto Github PK
View Code? Open in Web Editor NEWThe Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man
License: GNU General Public License v3.0
The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man
License: GNU General Public License v3.0
The prototype transformation on github.io works fine on Firefox, but fails to transform correctly on Chrome and similar WebKit-based browsers. Chrome complains that "This XML file does not appear to have any style information associated with it. The document tree is shown below."
This is very likely due to an XSL issue, since XSL complains that:
runtime error: file portrait.xsl line 98 element attribute
xsl:attribute: Cannot add attributes to an element if children have been already added to the element.
Or, roughly, the proportions of planned features that are finished (marked up in every chapter).
Maybe use <lg type="song">
and the source
attribute?
A word can count as a member of this category if:
pandybat
)Find these by using a spell checker.
Sub-tasks:
See also relevant issue at @aaronplasek's repo: aaronplasek/19th_C._novel_scraper#7
As @tcatapano pointed out in #25, <milestone>
is supposed to be an empty tag. Change this to an empty tag and remove the text, noting for XSL that * * *
is the desired rendering of rend="asterixes"
.
Is it "foetus" or "fœtus"? Norton edition gives "foetus,"
Also, "poena damni" circa line 4581, or "pœna damni"?
Ae ligatures in "Synopsis Philosophiae Scholasticae"
I'm tempted to think that the electronic version from the OTA has these ligatures, since they're represented by such strange characters: Philosophi.^#
This is a far-future goal, but hopefully these guidelines will be useful for our editing process:
This is interior to the text but exterior to the novel.
Around line 1646. Declines to 'maria', for BVM.
Among them, show line numbers (10s) from HWG's edition merged here.
With screenshots, I imagine
Note that these appear in the manuscripts in exactly this form---three asterisks.
This will be an ongoing effort, of course, but it'd be a good idea to get a working bibliography filled out with the major relevant works, at least.
The Modernist Journals Project has the full run of The Egoist that contains the first serialized publication of Portrait. It might be fun to do OCR on these PDFs.
PR #54 is missing some </said>
tags.
The problems with <i>
tags in the original SGML version are at least:
<i>
s surround italicized lines, even if those lines aren't verse, but prose.<i>
s surround lines in line groups, when the entire line group should be italicized (in the style sheet), not each line individuallyI'd recommend:
<i>
tags in any line group, since all verse groups will be rendered as italic in the stylesheet.<i>
s to <hi rend="italic">
Not quite sure how to do this. The Origin of Species
, the new testament
, Aquinas.
Right now it's:
<p>1 Pair Buskins. </p>
<p>1 D. Coat. </p>
<p>3 Articles and White. </p>
<p>1 Man's Pants. </p>
Used named entity recognition software?
The original markup featured leading $s on lines of verse. Replace these with <lg>
and <l>s
, or more semantic tags, where appropriate.
Specification is here. There might be some advantages to using TEI Simple, especially when it comes to transforming the text for display. Maybe it'd be better to start from scratch.
Bonus: mark these up with JP Riquelme's section division descriptions from the Norton edition.
Sub-tasks:
<TEI.2>
xml:id
to Joyce, James.<emph>
: is that because is a foreign language <foreign>
, a <title>
, etc. I would leave the typo for the final@class
for this kind of line: <l><emph rend="italic">Pull out his eyes,</emph></l>
Why is in cursive?<l> * * * </l>
<l>II</l>
@xml:id
One reviewer comments:
Since there will be readers who are unfamiliar with GitHub, "it might be useful for the editor to explore ways of presenting the site with a separate, simple html page that explains what the project is, what to expect in GitHub, etc."
Currently:
<p><said who="Simon Dedalus">—<hi rend="italic">
<quote>I'll pay you your dues, father, when you cease turning
the house of God into a pollingbooth.
</quote></hi></said></p>
Would like to get better attribution for the quote here.
It may be useful to extract all the songs from the novel at some point.
Should it be:
<milestone unit="section" rend="asterixes"><l> * * * </l></milestone>
</div> <!-- 1.3 -->
<div n="1.4" type="section">
or:
</div> <!-- 1.3 -->
<milestone unit="section" rend="asterixes"><l> * * * </l></milestone>
<div n="1.4" type="section">
or:
</div> <!-- 1.3 -->
<div n="1.4" type="section">
<milestone unit="section" rend="asterixes"><l> * * * </l></milestone>
@jamesosullivan, what do you think?
Document:
From Chapter I, Dante:
<p><said who="Dante">—
<quote>Woe be to the man by whom the scandal cometh!</quote>
said Mrs Riordan.
<quote>It would be better for him that a millstone were
tied about his neck and that he were cast into the depth of the
sea rather than that he should scandalise one of these, my least
little ones.</quote> That is the language of the Holy Ghost.</said></p>
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.