Giter VIP home page Giter VIP logo

Comments (7)

danimesq avatar danimesq commented on June 3, 2024 2

I'm here cheering for this to become an opt-in feature (personally ASAP but for y'all no pressure)

from snowfs.

sebastianrath avatar sebastianrath commented on June 3, 2024 1

SnowFS supports copy-on-write for certain file systems like APFS, but it does not yet have deduplication implemented in the application layer. Currently, the main reason for this is performance, as fragmentation in binaries can have a higher impact on CPU and I/O. For the first implementation of SnowFS speed had a higher priority over disk space. However, we are considering adding this as an opt-in option, as these impacts may not be relevant for every project.

from snowfs.

danimesq avatar danimesq commented on June 3, 2024

@sebastianrath I was expecting snow-fs already had this

from snowfs.

sebastianrath avatar sebastianrath commented on June 3, 2024

Could you share some background info? What type of projects would that be beneficial to? How many files, and what are the overall file sizes? Thanks!

from snowfs.

danimesq avatar danimesq commented on June 3, 2024

@sebastianrath

What type of projects would that be beneficial to? How many files, and what are the overall file sizes? Thanks!

To have an idea, I have tons of GB of screenshots both on mobile and on desktop.
And it is sad to know that most of the GB of these files have shared bytes that could be dedupliced.

Imagine a screenshot of a notepad, where most of its pixels are white; so all of that could be dedupliced (for example, Windows start menu icon on these screenshots wouldn't be repeated).
I imagine GIFs and video file formats uses a similar approach for overlapping frames.

from snowfs.

danimesq avatar danimesq commented on June 3, 2024

BTW I'm working at a new symlink daemon that will support to form a single file from shared objects.
Its here: https://github.com/Floflis/witchlink

from snowfs.

danimesq avatar danimesq commented on June 3, 2024

@sebastianrath do you know libraries that finds duplicate bytes on files and moves these duplicates into separate files?

I would love if git natively had more than 1 object per file, so there wouldn't be "foo", "bar" and "foobar" objects but only "foo" and "bar".

from snowfs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.