Original Reddit post

running into an issue where pipeline metrics look fine. dag is green, no errors in logs, data volumes match expectations but downstream tables have incorrect values. sums off by 10-20%, joins missing rows, things like that. checked the usual: schema changes, null handling, duplicate keys, even reran full loads, still wrong. what do you check when upstream looks fine but downstream is off… any gotchas or checks that helped catch this? submitted by /u/Distinct_Highway873

Originally posted by u/Distinct_Highway873 on r/ArtificialInteligence