Giter VIP home page Giter VIP logo

Comments (6)

newren avatar newren commented on July 4, 2024

It sounds like you took a guess at what was taking up space and removed some files, but a lot of your space is in other files. Run python $gfr --analyze from your project, and look at the files in the created $GIT_DIR/filter_repo/analysis report directory after the run. It should tell you what is large.

from git-filter-repo.

KoningLeon avatar KoningLeon commented on July 4, 2024

I knew the files that were the problems because they are the only ones that hold data. The repo now only takes up 80 mb after we remove the troublesome files. Somehow though it's not reflecting in the .pack file shrinking.

I did however take up your advice and ran the analyze command and I might have found something that could explain why the pack isn't shrinking. Some large files still show up in the path-all-sizes as < present > even though the files and folder are no longer part of the repo
image

And the same goes for the directories-all-sizes. The marked folders are no longer part of the repo, yet they are still marked as < present >.
image

from git-filter-repo.

KoningLeon avatar KoningLeon commented on July 4, 2024

Managed to get the desired result by doing:

  1. git clone --depth 2000
  2. Delete the entire Repo/PowerBi folder
  3. Run the git-filter-repo as per my original post
  4. Place back the Repo/PowerBI folder

Resulting in our repo going from 13.2gb to 150mb. This means losing the entire history for that specific folder but that is a sacrifice were are willing to make.

from git-filter-repo.

newren avatar newren commented on July 4, 2024

Any chance you were using CMD to run your commands? If so, the problem may be that you used single quotes (') instead of double quotes ("). If you changed your command from:

python $gfr --invert-paths --path-glob '*/cache.abf' --path-glob '*.pbix' 

to

python $gfr --invert-paths --path-glob "*/cache.abf" --path-glob "*.pbix" 

that might have fixed things for you. Apparently (as I learned in #435), the former will cause CMD to tell git-filter-repo that you want to remove files matching '*/cache.abf' and '*.pbix', which you obviously don't have any of, while the latter correctly tells git-filter-repo that you want to remove files matching */cache.abf and *.pbix.

To my knowledge, this is unique to CMD; single quotes work fine in any other shell and don't do this crazy weirdness.

from git-filter-repo.

KoningLeon avatar KoningLeon commented on July 4, 2024

No, I was using the Powershell terminal from within VScode.

from git-filter-repo.

newren avatar newren commented on July 4, 2024

Well, in that case, I'd suggest adding a --debug flag to your command so we can see what git-filter-repo actually saw; I have no idea if VScode did some weird interpretation either. And it'd be nice to see the large paths from the --analyze report both before and after you run git-filter-repo with the --debug flag.

That said, it sounds like you did find a solution, so if you don't want to debug further that's fine. But if you'd like to know what happened, the --debug output is the next piece of output I'd need.

from git-filter-repo.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.