Comments (2)
Hi @levan92 we use a proprietary feature extractor which we found to be both fast and gives accurate similarities for large variety of datasets. It is possible to support external models, for example CLIP is already supported. We are planning to release a demo notebook soon. Let us know if there are any features that are missing for your usecase?
from fastdup.
@levan92, You can get both the model name, layers names and output layer name and dim (~576) from the onnx model which is included as part of the package
For the nearest neighbour search they are probably using FAISS
~/.local/lib/python3.8/site-packages/fastdup $ nm -A libfastdup_shared.so
...
libfastdup_shared.so: U _ZN5faiss10read_indexEPKci
libfastdup_shared.so: U _ZN5faiss11write_indexEPKNS_5IndexEPKc
libfastdup_shared.so: U _ZN5faiss13index_factoryEiPKcNS_10MetricTypeE
libfastdup_shared.so: U _ZN5faiss14ParameterSpaceC1Ev
libfastdup_shared.so: U _ZN5faiss22NormalizationTransformC1Eif
libfastdup_shared.so: U _ZN6cppipc11must_cancelEv
...
from fastdup.
Related Issues (20)
- [Bug]: fastdup fails to create above 10M object crops on ubuntu 20 (due to file system exhaustion) HOT 1
- [Feature Request]: Allow search on multiple work dirs in parallel HOT 1
- [Feature Request]: Compile fastdup for arm to allow docker run on mac m1 HOT 1
- [Feature Request]: add jfif support for windows OS HOT 1
- [Feature Request]: add mkv video support for fastdup HOT 1
- [Feature Request]: fastdup video extraction to provide timing info for each extracted frame HOT 1
- [Bug]: Pinned `requests` makes `fastdup` incompatible with other packages HOT 3
- [Feature Request]: mean_distance in image cluster relative to centroïd + distance between different clusters (using centroïds) HOT 1
- [Bug]: Kernel keep crashing when trying to run fd.run() HOT 22
- [Bug]: Fix thumbnail resize to look better HOT 1
- [Bug]:AssertionError: For removing wrong labels created by the create_similarity_gallery() need to run stats_file=df where df is the output of create_similarity_gallery() HOT 1
- Use fastdup in code pipeline rather than reports HOT 3
- [Bug]: RuntimeError: fastdup detected your are running an old version 1.60 (10 versions or more vs. the latest) please upgrade fastdup) HOT 2
- [Bug]: Oxford pet dataset, fastdup fails on 8 bad images HOT 1
- [Bug]: bad images warning is gibberish HOT 1
- [Bug]: When running fastdup as two steps (and there are bad images) connected component ids do not match atrain_features.dat.csv
- [Bug]: Can't pip install HOT 1
- [Bug]: Run is crashing when specifying embeddings HOT 3
- [Bug]: UnicodeDecodeError when running fd.run HOT 1
- [Feature Request]: Reidentification mode HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fastdup.