I'm a lead data scientist, sorting out metadata and championing linked data over data linkage. 10 years experience as a data analyst and engineer; recently completed a undergraduate degree in computing science and statistics.

📱 Contact Me

🔗 LinkedIn


👩🏻‍💻 Work experience

Lead Data Scientist

Office for National Statistics, Fully Remote - May 2022 to Present

Promoting using open standards to speed up data dissemination across government is my jam. Engaging with other government departments and academics to adopt a cube-styled CSV-W where all the data exchange is provided using common vocabularies and ontologies with the data structure definition rolled in. I've been solving the thorny problems within ONS and outside by tweaking pre-post ELT and RAP activities to get data in a better and standardised shape. From this we get faster analysis, better portability, and consistency. Throw a CSV-W at a configured plotting tool and your y-axis comes pre-labelled. It's almost magic. By pushing open standards, I get to say I'm helping deliver the promise of the open web as envisioned by Tim Berners-Lee.

Senior Data Engineer

Office for National Statistics, Fully Remote - November 2020 to April 2022

As Senior Data Engineer in the Integrated Data Platform, I am responsible for transforming public data in various formats from government departments to high quality structured 5* Data. As part of a software engineering team, I work with other engineers using a developer stack of Python, Jekins, Docker, and using common Python data libraries like Pandas, Numpy, Scipy, rdflib, and Databaker. In addition to transformations, I am programming a new OSS package to convert CSV to CSV-W called csvcubed.

Data Engineer and Lead Analyst

One Manchester*, Manchester – July 2019 to October 2020*

Working within the Business Insights team, I was responsible for the roadmap of automating internal reporting from SQL Server Reporting Server/Excel to SQL/Power BI. Generated stakeholder buy-in to new process by creating tailored dashboard prototypes in Power BI using existing internal data sources and external sources under investigation for suitability. Championed Agile methodology for development of insight solutions, enabling bandwidth for valuable investigations of longstanding but not urgent questions.