msg-systems / holmes-extractor Goto Github PK
View Code? Open in Web Editor NEWInformation extraction from English and German texts based on predicate logic
License: MIT License
Information extraction from English and German texts based on predicate logic
License: MIT License
Trying to install this in Ubuntu 22.04, I get these types of errors. Any suggestions on how to fix this?
`error: command '/usr/bin/x86_64-linux-gnu-gcc' failed with exit code 1
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building wheel for blis
Running setup.py clean for blis
Failed to build thinc preshed blis
Installing collected packages: wrapt, wasabi, srsly, plac, murmurhash, cymem, wheel, tqdm, six, setuptools, preshed, numpy, Cython, thinc_gpu_ops, blis, thinc
Running setup.py install for preshed: started
Running setup.py install for preshed: finished with status 'error'
error: subprocess-exited-with-error
× Running setup.py install for preshed did not run successfully.
│ exit code: 1
╰─> [14 lines of output]
/home/sdspieg/.env/lib/python3.10/site-packages/setuptools/installer.py:27: SetuptoolsDeprecationWarning: setuptools.installer is deprecated. Requirements should be satisfied by a PEP 517 installer.
warnings.warn(
running install
/home/sdspieg/.env/lib/python3.10/site-packages/setuptools/command/install.py:34: SetuptoolsDeprecationWarning: setup.py install is deprecated. Use build and pip and other standards-based tools.
warnings.warn(
running build
running build_py
running build_ext
building 'preshed.maps' extension
x86_64-linux-gnu-gcc -Wno-unused-result -Wsign-compare -DNDEBUG -g -fwrapv -O2 -Wall -g -fstack-protector-strong -Wformat -Werror=format-security -g -fwrapv -O2 -fPIC -I/usr/include/python3.10 -I/home/sdspieg/.env/include -I/usr/include/python3.10 -c preshed/maps.cpp -o build/temp.linux-x86_64-cpython-310/preshed/maps.o -O3 -Wno-strict-prototypes -Wno-unused-function
cc1plus: warning: command-line option ‘-Wno-strict-prototypes’ is valid for C/ObjC but not for C++
cc1plus: fatal error: preshed/maps.cpp: No such file or directory
compilation terminated.
error: command '/usr/bin/x86_64-linux-gnu-gcc' failed with exit code 1
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
error: legacy-install-failure
× Encountered error while trying to install package.
╰─> preshed
note: This is an issue with the package mentioned above, not pip.
hint: See above for output from the failure.
[end of output]
`
Hi! Good morning from Singapore!
Could u kindly give examples from how you'd use the word_match.type=='entity' / 'derivation'?
Any plans to add Spanish as a language, or any ideas how I could get this to work for spanish?
@richardpaulhudson thanks for this precious contribution to the open source community, not only the library itself but the documentation and NLP insights are awesome. I was wondering if you would have any scientific literature suggestion about the following topics:
Many thanks again!
I am executing example codes from the git page. But I get a type error.
holmes_manager = holmes.Manager(model='de_core_news_lg', number_of_workers=1)
TypeError: init() got an unexpected keyword argument 'number_of_workers'
If the parameter is removed, I get OSError.
OSError: [E050] Can't find model 'de_core_news_lg'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.
Am I following it correctly? Has anyone faced similar issues?
When I try to run the holmes extractor using this example, it reaches the end of all the parsing, spawns a lot of (?) worker threads during indexing, and then the program gets killed, presumably because it exceeded system resources.
What's going on, and how to fix?
Hi, @richardpaulhudson
I have tested this tool with code
import holmes_extractor as holmes
holmes_manager = holmes.Manager(model='en_core_web_lg')
holmes_manager.register_search_phrase('be all ear')
holmes_manager.start_chatbot_mode_console()
I am trying to do "phrase" search, and found out that, once the sentence contains a "be" verb, this will lead to detection of "be all ear".
I suppose this search phrase should be a full sentence, a phrase like stuff is not supported when you saying Ungrammatical expressions. (Cause I found it a little bit misleading of search phrase and search "phrase")
I am using holmes extractor with ontology-based matching and it is working fine for taxonomies of nouns (or instances of these), even for compund nouns.
The same is not working for verbs: if we have a verb like 'move' and a subclass of it like 'walk' in the ontology, then 'walk' is not recognized (if 'move' is used in the search phrase), only synonyms (owl:equivalentClass) are working.
So my questions are:
is this the correct function or am I doing something wrong?
and if the function is missing, are there plans to extend the taxonomy support also for verbs?
Thank you in advance.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.