etcbc / bhsa Goto Github PK
View Code? Open in Web Editor NEWHebrew Bible + Linguistic annotations in text-fabric format. Fixed and ongoing versions.
Home Page: https://etcbc.github.io/bhsa/
License: MIT License
Hebrew Bible + Linguistic annotations in text-fabric format. Fixed and ongoing versions.
Home Page: https://etcbc.github.io/bhsa/
License: MIT License
I assume there's no parsing information for qere variants, is that right? Like part of speech, number, gender, etc.
Bug/Problem
In the bhsa feature description it says that "nametype" is a feature of the objectype "lex". This looks to be the case indeed when checking SHEBANQ. Jer 1:1 shows the presence of three nametype values (2x "pers", 1x "topo):
When running a MQL query in SHEBANQ that looks for the value "topo" of the feature "nametype" of the object-type "lex" in Jer 1:1 it should find "Anathot". But it doesn't (https://shebanq.ancient-data.org/hebrew/query?version=2017&id=3479). Instead it finds only a 8 words in all of Jeremiah (there should be more then 500 topos in Jer). "Anathot" in Jer 1:1 is not found, even though it has received the value "topo". The same happens when looking for "pers" in Jer - only 20 are founds while there should be more than 1000.
A quick comparison with the bhsa TF app shows the same results:
However, in contrast to the feature description (https://etcbc.github.io/bhsa/features/nametype/) it seems that "nametype" is attached to the object-type word in the bhsa TF app where the accurate results can be retrieved:
The linking of "nametype" with the object-type "word" was not done in SHEBANQ, however:
Conclusion
Only a very limited amount of "nametype" values are linked with the object-type "lex". This is true for both the bhsa TF app as well as SHEBANQ 2017. However, all "nametype" values are linked with the object-type "word" in the bhsa TF app.
Suggestion
Change the official bhsa feature description and make "nametype" a feature of "word". This is already implemented in the bhsa TF app. The same should be done in SHEBANQ 2017.
I have a presentation given by the data specialist of the ETCBC that can function as a literature reference for the features domain/txt/pargr. Where can I add it? The presentation defines the use of participants/actors in relation to these features.
I noticed the 2017 version of gloss.tf seems to be missing most glosses. Is there a reason for this?
~/github/etcbc/bhsa$ wc -l tf/4b/gloss.tf tf/2017/gloss.tf
435818 tf/4b/gloss.tf
9245 tf/2017/gloss.tf
It has been brought to my attention that the links to the tutorials on https://etcbc.github.io/bhsa/#how-to-use do not work any more.
Hello, I am interested in using text-fabric for my research. I have downloaded the package, but I am having trouble downloading the BHSA data. After running "text-fabric bhsa" I get the following errors:
getConfig: Parent module 'tf.extra.bhsa-app' not loaded, cannot perform relative import
getConfig: Data source "bhsa" not found
In the 2021 dataset (unlike the 2017 data), Mesopotamia (Strong's H0763), אֲרַ֥ם נַֽהֲרַ֖יִם, occurs as two separate nodes with different lexical values: 11325 'אֲרַם' and 11326 'נַהֲר' (river). I think they should be a single node with a single lexical value.
print(F.g_lex_utf8.v(11325), F.g_lex_utf8.v(11326))
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.