Comments (2)
I think you'll need to spin up separate worker servers and queue tasks, as you said. I don't think this is specific to the gpt_index project, and I wouldn't expect built-in support.
However, if you go ahead and rewrite gpt_index in something lower level like C or Rust, then that's something that could potentially be beneficial in making runtime indexing more plausible. I looked into Cython to maybe do this myself, but that might only make a marginal difference since some native Python methods are already implemented with CPython and not really the underlying bottleneck.
The bottleneck was the embedding step using Ada embeddings in my use case. This is due to roundtrips from OpenAI servers to generate the embeddings, so I guess threading could be a solution, however, you'll just run into other issues like throttling or 429s which is why I think built-in gpt_index improvements shouldn't be expected.
from llama_index.
@teoh we are actively improving the runtime of index building and querying, so hopefully this will be less of an issue in the near future!
As @aliyeysides noted, the feature you are describing might be best handled in the application layer (i.e. outside of GPT Index). We are moving more and more underlying logic to be executed async, but the top level API would remain blocking for ease of use.
from llama_index.
Related Issues (20)
- [Bug]: openai http_client type error HOT 6
- [Bug]: OpenSearch ConnectionError(Timeout context manager should be used inside a task) HOT 2
- use route query engine : http pool time out error HOT 6
- [Bug]: Marvin Metadata Extractor Demo code not working HOT 1
- [Bug]: BM25 Retriever - Corpus uses default MetadataMode while reading content from nodes instead of MetadataMode.EMBED or user provided option HOT 1
- [Question]: How can i get all nodes from the PGVectorDB? HOT 1
- [Bug]: AttributeError: 'tuple' object has no attribute 'score' HOT 1
- Difference between using a persistent storage like S3 vs Using a Vector DB to store data in LLAMA INDEX[Question]: HOT 3
- [Question]: Is `llama_index` thead-safe? HOT 5
- [Bug]: Bedrock Cohere embeddings are not working as expected. HOT 5
- [Bug]: Code in Guidance Pydantic Program doc not working HOT 3
- [Bug]: NameError: name 'AgentChatResponse' is not defined in Using Meta-Llama-3-70B-Instruct with HuggingFace Inference API HOT 6
- [Question]: How to create a multiDocagent using function call with bedrock llms HOT 8
- [Question]: Chat engine takes long time to generate output for the first query HOT 3
- [Bug]: FirestoreKVStore's aget_all raises AttributeError when collection is not empty HOT 1
- [Feature Request]: Please support stream_chat for vllm
- [Question]: BM25 Retriever takes long time to load with docstore are its parameter HOT 2
- [Documentation]: notebook docs/examples/prompts/prompts_rag.ipynb not working HOT 9
- [Question]: AttributeError: 'NoneType' object has no attribute 'search' HOT 2
- [Question]: why PostgresKVStore table class does not match my postgres vector table schema? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama_index.