Skip to content

levisrum23/Pulp-Mill-Effluent-Analytics.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

23 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Pulp Mill Effluent Compliance: An Engineering-First Data Pipeline

πŸ“Š Business Intelligence & Industrial Compliance

This project transforms raw effluent sensor data from a pulp mill into an interactive regulatory dashboard. It bridges the gap between process engineering and data science by implementing industrial standards directly into the ETL pipeline.

🎯 Key Performance Indicators (KPIs)

  • Overall Compliance Rate: 36.7%
  • Financial Risk: Calculated based on a $500/violation model.
  • Primary Violation Driver: Turbidity (averaging 4.45 NTU during non-compliance).

πŸ› οΈ Data Engineering & "Data Dignity"

The pipeline was built in Power Query with a focus on data integrity:

  1. Locale-Aware Ingestion: Applied en-US locale settings to ensure decimal precision (protecting pH and Conductivity readings).
  2. Dynamic Mean Imputation: Automated null-handling for 491 missing sensor records using a scalable M-code script (List.Accumulate).
  3. Biological Sanity Checks: Verified all pH data falls within the 0.0–14.0 range to eliminate sensor artifacts.

πŸ“‚ Project Structure

  • /data_raw: Original sensor dataset.
  • /data_processed: Final Excel dashboard featuring DAX measures and Root Cause Heatmaps.
  • /scripts: Full M-code documentation for the ETL process.
  • /docs: Audit logs, terminology, and industrial standards used for the model.

πŸ”— Links

About

An engineering-first data pipeline for pulp mill effluent compliance using Power Query, DAX, and industrial root cause analysis.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors