Comments (3)
@fdosani thanks for the response! Great to hear you think it's a good feature. I'll make sure to submit the pull request over the coming weeks when I find the time. Enjoy your holiday!
Coming back to the original post, I was thinking of adding it like this:
Number of columns in Original but not in New: 1. The missing column(s) in New are: date_fld
In the case of multiple columns I can format it like this:
Number of columns in Original but not in New: 1. The missing column(s) in New are: date_fld, X, Y
where X and Y are the other potential columns that could be missing in New
. I could do the same for the the other line related to the New
data frame in the situation there are any columns missing there.
Please let me know if you think this is a good way to format the output.
from datacompy.
@lva290 thanks for your message, appreciate you providing feedback and the kind words. Iām on vacation at the moment. Will be back on Wednesday. Iād love you have you contribute a PR if you have the time. I think displaying the columns would be a nice feature. No rush, happy to continue the discussion on Wednesday. @NikhilJArora and @ak-gupta of you have time would appreciate your thoughts too.
from datacompy.
@lva290 Sorry for the delay here. Yes that is fine. I'm onboard with your proposal.
from datacompy.
Related Issues (20)
- Pandas 2.0 support
- Fugue support for extra helper functions from core HOT 2
- No objects to concatenate issue with Fugue HOT 3
- The intersection logic of Compare has problems. HOT 3
- Speed up spark unit tests HOT 2
- Python 3.11 support HOT 12
- Feature Request: Ability to Update Compare Object Over Multiple Chunks HOT 4
- Datacompare for Date field is not working HOT 4
- SparkCompare() not working for dask - dropDuplicates HOT 1
- Add list of dissimilar columns to report HOT 8
- Restrictive dependency versions - NumPy 1.24.4 blocked HOT 6
- confused about df_unq_rows HOT 2
- Add mypy to the project HOT 4
- Add new action for running tests when PySpark is NOT installed HOT 1
- Comparison fails on dataframes with a single column HOT 8
- Benchmark Documentation between pandas, fugue, and native spark. HOT 1
- who can help make the result significantly HOT 2
- Issue in writing report HOT 9
- Look into porting Compare to a polars backend for performance testing. HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
š Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ššš
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ā¤ļø Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datacompy.