Giter VIP home page Giter VIP logo

Comments (7)

PSeitz avatar PSeitz commented on June 9, 2024

That's not possible currently. What's your use case?

from tantivy.

gembin avatar gembin commented on June 9, 2024

I want every doc fully stored, so I dont need to query database using doc_id, and it's easy for the client to query data if we have a database as a store like tikv or mongodb etc., and it will also decouple the index storage from data storage.

from tantivy.

PSeitz avatar PSeitz commented on June 9, 2024

You can fully store the documents in tantivy, so you don't need to query a database using doc_id. doc_id or DocAddress is something that's internal to tantivy and should not be used external. It's possible to compress with zstd in the docstore for better compression.

from tantivy.

gembin avatar gembin commented on June 9, 2024

Thanks! Normally, how much data it can store without performance downgrade?

from tantivy.

fulmicoton avatar fulmicoton commented on June 9, 2024

You could use a fast field or a stored field to store a primary id from an external DB.

from tantivy.

PSeitz avatar PSeitz commented on June 9, 2024

Thanks! Normally, how much data it can store without performance downgrade?

Performance downgrade on indexing? It depends on your doc store settings (blocksize, compression algorithm, compression level). We use a separate thread to compress the doc store, so you may not notice any difference until doc store compression gets slower than regular search indexing.

Search performance is not really impacted by this.

from tantivy.

gembin avatar gembin commented on June 9, 2024

Thanks, very helpful!

from tantivy.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.