Invoice audit with PySpark, Databricks and Genie
Analytical tables and PySpark queries built in Databricks notebooks to investigate transaction issues affecting invoice analysis.
- Goal: structure a reliable view of inconsistencies in high-volume data.
- Delivery: analytical tables, PySpark queries and views by inconsistency type.
- Stack: PySpark, Databricks notebooks, analytical modeling, Genie and dashboard layer.
- Technical highlight: big data analytics engineering, data quality and automated refresh.