data-diffο
Data-diff is a command-line tool and Python library to efficiently diff rows across two different databases.
β Verifies across many different databases (e.g. PostgreSQL -> Snowflake) !
π Outputs diff of rows in detail
π¨ Simple CLI/API to create monitoring and alerts
π₯ Verify 25M+ rows in <10s, and 1B+ rows in ~5min.
βΎοΈ Works for tables with 10s of billions of rows
For more information, See our README
Resourcesο
Source code (git): https://github.com/datafold/data-diff
The rest of the documentation