Giter VIP home page Giter VIP logo

Comments (6)

rom1504 avatar rom1504 commented on May 26, 2024

scann is opensource but not really packaged well + it doesn't quantize

but also it's not really a blocker either way in the current state of this repo
doing more experiments with the LM and the LM+knn integration is

from retro-pytorch.

afcruzs avatar afcruzs commented on May 26, 2024

I agree is a bit cumbersome to use, but it should have (rather efficient) quantization, no? See last section of https://medium.com/@kumon/similarity-search-scann-and-4-bit-pq-ab98766b32bd

from retro-pytorch.

rom1504 avatar rom1504 commented on May 26, 2024

scann is fast but no it doesn't optimize for memory use, since it uses PQ4 it requires storing the embeddings at full precision for reranking to avoid loss of recall

from retro-pytorch.

marcobellagente93 avatar marcobellagente93 commented on May 26, 2024

@rom1504 thanks for the answer! So is it correct to say that the open source version of scann does use quantization to compute faster inner products? I mean there's implemented options for using brute force and 2 different quantizers (lut16 and lut256), but I see your point about storing the embeddings. That's also odd since reorder (which I assume is what you mean by reranking) is optional

from retro-pytorch.

rom1504 avatar rom1504 commented on May 26, 2024

Yes it uses quantization to compute faster inner product.
Do they report good results on pq4 and no reordering?

from retro-pytorch.

vinnik-dmitry07 avatar vinnik-dmitry07 commented on May 26, 2024

https://github.com/erikbern/ann-benchmarks/
https://ann-benchmarks.com/glove-100-angular_10_angular.html

from retro-pytorch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.