Falsehoods this programmer believed about half-hourly energy data

It is common for energy generation and consumption values to be presented as half-hourly readings: giving 48 readings over the course of a single 24 hour period. This is the type of data we're working with on a daily basis in Energy Sparks. I thought I'd share a few things that I learned about working … Continue reading Falsehoods this programmer believed about half-hourly energy data →

Comments on “A data for AI taxonomy”

Jack Hardinges and Elena Simperl recently published a taxonomy to describe the data relevant to AI models and systems. Their goal is to help to better distinguish between the different types of data relevant to developing, using and monitoring AI models and systems to help to better distinguish them and thereby add some nuance to … Continue reading Comments on “A data for AI taxonomy” →

What datasets have been classified as Digital Public Goods?

Update: 2024-04-14, I've updated this post with some corrections. See below A couple of years ago I wrote a short series of posts looking at some different approaches for assessing data infrastructure. It includes this post on the Digital Public Goods standard and registry. Digital Public Goods are defined as: open-source software, open data, open … Continue reading What datasets have been classified as Digital Public Goods? →

Confused by SOLID

I keep checking in on the Solid project. But I'm baffled by its lack of functionality. I've written up some of my questions.

Data format design is a UX issue

I've been getting frustrated by CSV files again. The context for this is my day job at Energy Sparks. I've written about the wide range ofdifferent CSV formats that we have to contend with in order to accept data from a range of energy suppliers and meter operators. While there are a number of loose … Continue reading Data format design is a UX issue →

Will AI hamper our ability to crawl the web for useful data?

As websites start to block Common Crawl, and as the project leans in to its role in training LLMs, will it become harder to use data from the web for other purposes?

Increasing consistency of data with FAIR Implementation Profiles

FAIR implementation profiles offer a means to increase consistency around how data is shared.

The complexities of working with non-domestic half-hourly meter data

Energy consumption in non-domestic properties are more complex to analyse that domestic settings. Some notes on some of those challenges, particularly around metering.

First impressions of the Octopus Home Mini

For the last couple of months I've had an Octopus Home Mini in the house. I thought I'd share some first impressions. Firstly, what is it? It's a small pink device that is designed to give you a more detailed view of your current energy usage, as measured by your smart meter. It does this … Continue reading First impressions of the Octopus Home Mini →

Useful resources for designing data rich pages

One of the big projects we've currently got under way at Energy Sparks is redesigning the collection of pages that present the results of our detailed analysis of their energy data to school users. The existing pages have been around for a few years and our metrics and user testing has shown that they aren't … Continue reading Useful resources for designing data rich pages →