Giter VIP home page Giter VIP logo

bhsa's People

Contributors

cmerwich avatar codykingham avatar dirkroorda avatar tolchi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

bhsa's Issues

Parsing information for qere

I assume there's no parsing information for qere variants, is that right? Like part of speech, number, gender, etc.

nametype feature not always implemented correctly (I assume)

Bug/Problem
In the bhsa feature description it says that "nametype" is a feature of the objectype "lex". This looks to be the case indeed when checking SHEBANQ. Jer 1:1 shows the presence of three nametype values (2x "pers", 1x "topo):
Annotation 2020-09-26 114539

When running a MQL query in SHEBANQ that looks for the value "topo" of the feature "nametype" of the object-type "lex" in Jer 1:1 it should find "Anathot". But it doesn't (https://shebanq.ancient-data.org/hebrew/query?version=2017&id=3479). Instead it finds only a 8 words in all of Jeremiah (there should be more then 500 topos in Jer). "Anathot" in Jer 1:1 is not found, even though it has received the value "topo". The same happens when looking for "pers" in Jer - only 20 are founds while there should be more than 1000.

Annotation 2020-09-26 115837

A quick comparison with the bhsa TF app shows the same results:
Annotation 2020-09-26 120149

However, in contrast to the feature description (https://etcbc.github.io/bhsa/features/nametype/) it seems that "nametype" is attached to the object-type word in the bhsa TF app where the accurate results can be retrieved:

Annotation 2020-09-26 120404

The linking of "nametype" with the object-type "word" was not done in SHEBANQ, however:

Annotation 2020-09-26 120626

Conclusion
Only a very limited amount of "nametype" values are linked with the object-type "lex". This is true for both the bhsa TF app as well as SHEBANQ 2017. However, all "nametype" values are linked with the object-type "word" in the bhsa TF app.

Suggestion
Change the official bhsa feature description and make "nametype" a feature of "word". This is already implemented in the bhsa TF app. The same should be done in SHEBANQ 2017.

presentation for definition of domain/txt/pargr

I have a presentation given by the data specialist of the ETCBC that can function as a literature reference for the features domain/txt/pargr. Where can I add it? The presentation defines the use of participants/actors in relation to these features.

Most slots missing in tf/2017/gloss.tf

I noticed the 2017 version of gloss.tf seems to be missing most glosses. Is there a reason for this?

~/github/etcbc/bhsa$ wc -l tf/4b/gloss.tf tf/2017/gloss.tf 
 435818 tf/4b/gloss.tf
   9245 tf/2017/gloss.tf

Error downloading BHSA

Hello, I am interested in using text-fabric for my research. I have downloaded the package, but I am having trouble downloading the BHSA data. After running "text-fabric bhsa" I get the following errors:

getConfig: Parent module 'tf.extra.bhsa-app' not loaded, cannot perform relative import
getConfig: Data source "bhsa" not found

Word nodes 11325-11326: אֲרַ֥ם נַֽהֲרַ֖יִם

In the 2021 dataset (unlike the 2017 data), Mesopotamia (Strong's H0763), אֲרַ֥ם נַֽהֲרַ֖יִם, occurs as two separate nodes with different lexical values: 11325 'אֲרַם' and 11326 'נַהֲר' (river). I think they should be a single node with a single lexical value.

print(F.g_lex_utf8.v(11325), F.g_lex_utf8.v(11326))

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.