Giter VIP home page Giter VIP logo

Comments (2)

bsmock avatar bsmock commented on July 18, 2024

Hi, thanks for bringing this up, it's a great point.

If I understand you correctly, this situation can happen when two predicted columns are non-overlapping but stacked vertically, one above the other. Is that correct?

This situation seems like it would happen rarely or never for a model that is performing very well on a particular test dataset. But if the model is not performing well, and this situation did happen, then just thinking about it, yes it does seem to me like the current post-processing code could have an undesired outcome.

Can you share the GriTS score you had before and after making the change? An improvement of 0.01 is actually pretty significant. Because models that perform very well can already have GriTS scores above 0.98, I do suspect this improvement will only happen for models whose performance is less than desired on the test data. But until table structure recognition is completely solved, that's a valid case to consider!

Best,
Brandon

from table-transformer.

iamayushanand avatar iamayushanand commented on July 18, 2024

If I understand you correctly, this situation can happen when two predicted columns are non-overlapping but stacked vertically, one above the other. Is that correct?

Yup, that's right. It can also happen for rows stacked horizontally but not overlapping.

Can you share the GriTS score you had before and after making the change?

Before Change: Grits_Top: 0.9161 Grits_loc: 0.7638
After Change: Grits_Top:0.9257 Grits_loc: 0.7707
The test set consists of 1500 samples.

Thanks for taking note of this! Let me know if a PR is needed to add these changes, would be happy to open one.

from table-transformer.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.