Flagship work
Start with the strongest public data systems work: feature workflows, batch scoring, monitoring, and one secondary data tool that shows validation discipline.
Feature store (mini)
Batch compile turns raw extracts into a versioned feature table under locked column specs and validation rules.
Batch scoring pipeline
Batch job processes CSV rows with a fixed preprocessing path and writes score, label, model version, and timestamp on each row.
Data quality & monitoring pipeline
Validates each incoming batch against a fixed reference and surfaces shifts in inputs, categories, and prediction patterns.
Data cleaning toolkit
Upstream prerequisite, not a side utility: downstream data workflows ingest the same reviewed tables.
How I Build
Core stack
Open to Roles
If you are hiring for data systems, data engineering, automation, API, or batch-workflow roles, feel free to reach out. Portfolio is the main evaluation path; Data tools is secondary evidence for smaller utilities and validation-heavy tooling.