Skip to content
View alwyndsouza's full-sized avatar

Block or report alwyndsouza

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alwyndsouza/README.md

Hi, I'm Alwyn D'Souza | Data & AI Engineering

I build scalable data platforms and AI-powered products that drive business outcomes. Currently leading strategic initiatives across data mesh architecture, semantic layers, and modern data stack implementation.

🤝 Let's Connect

LinkedIn Medium GitHub Location

🎯 Focus Areas & Tech Stack

dltHub Advanced Databricks dbt Terraform AWS GCP Python Docker GitHub Actions CI/CD

  • Data Engineering & Lakehouse: Data Mesh · AWS Glue · Lake Formation · Iceberg · Databricks · dbt · Redshift · GCP
  • DataOps: Data Contracts · CI/CD for Pipelines · Semantic Layers · End-to-End Observability · Incident & Change Management
  • AI & ML Engineering: · AIOps · Production LLM Integration · Agentic Pipelines (MCP) · RAG · AI Guardrails for Data Quality
  • MLOps: Pipeline Orchestration · Model Lifecycle Governance · ML Observability
  • Streaming: Flink · Kafka · Redpanda · RisingWave
  • Infra & Orchestration: Terraform · Step Functions · GitHub Actions · dlt
  • Leadership: COE Capability Uplift · Vendor Engagement (dbt Labs · AWS · Databricks) · Technical Mentorship

📊 GitHub Analytics

🛠 Core Technical Expertise

Languages

📈 Contribution Overview

General Stats

✍️ Latest Technical Articles

Pinned Loading

  1. dbt-ci-cd dbt-ci-cd Public

    A production-ready framework for dbt CI/CD that automates code validation, testing, and deployment workflows to maintain a resilient and scalable data platform.

    1 1

  2. dbt-conversation-ai-local dbt-conversation-ai-local Public

    Conversational AI for dbt: A Streamlit-based local agent powered by Ollama and MCP to query, document, and analyze dbt semantic models and metrics in a private environment.

    Python 2

  3. mds-databricks-semantic-layer mds-databricks-semantic-layer Public

    A production-grade Modern Data Stack reference implementation using dlt, dbt-core, and Databricks Unity Catalog, featuring a governed semantic layer with MetricFlow.

    Python 1

  4. mds-duckdb-semantic-layer mds-duckdb-semantic-layer Public

    A local-first Modern Data Stack (MDS) reference architecture using dlt for ingestion, dbt for transformation, and DuckDB as the high-performance compute engine and semantic layer.

    Python 2 1

  5. rp-dbt-rw-fraud-monitor rp-dbt-rw-fraud-monitor Public

    SQL‑first real‑time fraud detection using Redpanda, dbt, RisingWave, and Grafana

    Python