AI / Data Engineer
John Holland Group
2022—present
- Built ERD Studio, an open-source app that leverages AI for developing data engineering pipelines — cut data-pipeline delivery from five months for a whole team down to two weeks for one engineer with AI.
- Designed full-stack data workflows across AWS, dbt, Databricks and Power BI — ingestion, transformation, modelling and reporting.
- Modelled data with bronze / silver / gold architecture for aggregated, scalable datasets.
- Built high-performance DirectQuery KPI dashboards in Power BI — raw operational data to executive summaries with minimal latency.
- Automated Power BI measure generation with LLMs, cutting report maintenance time.
- Shipped CI/CD pipelines (GitHub Actions) for dbt and BI deployments.
- Deployed MLflow pipelines for classification — continuous training and model governance.
- Built a Retrieval-Augmented Generation pipeline with vector search on Delta Live Tables for an enterprise chatbot.
- Engineered ML pipelines for Tunnel Boring Machine analytics on AWS Glue, SageMaker and Airflow — telemetry to insight.