https://github.com/ellennickles/ml5js-model-and-data-provenance-project

What questions do you still have about the model and the associated data? Are there elements you would propose including in the biography?

I’m left with some questions about the models and datasets used in ml5.js. One of my main concerns is understanding exactly who created these models and datasets. Sometimes it’s clear, like with Google or academic teams, but in other cases, the origins are vague or the developers’ affiliations are unknown. I also wonder about the data itself: what images or recordings were used, how they were collected, and what methods were employed for labeling. For some models, there’s detailed documentation, but for others, it’s hard to find out where the data came from, who contributed to it, or what the intended use was. This lack of transparency makes it difficult to fully grasp potential biases or ethical issues.

If I were to propose additions to the “biography” of a model or dataset, I’d want to see clear information about what the model does, who developed it, when and why it was created, and where it’s hosted. For the data, I’d include a description of the data, its source, who collected it, how it was collected, and the intended purpose and users. I also think it’s important to include details about licensing, known biases, annotation processes, and any usage restrictions. References and links to original papers or documentation should be standard, along with explicit warnings about ethical concerns like privacy or consent.

How does understanding the provenance of the model and its data inform your creative process?

Understanding the provenance of a model and its data definitely shapes my creative process. When I know where a model comes from, how its data was collected, and what biases might be present, I can make more responsible and intentional choices in my work. It pushes me to be transparent with my audience, acknowledge the limitations and strengths of the technology, and design projects that are both ethical and effective. Being aware of these factors also helps me avoid unintentional misuse and encourages me to document my own work with the same level of care, so others can build on it responsibly.

Code Link: https://editor.p5js.org/my3037/full/wW1_s1f3x

Demo Video:

https://drive.google.com/file/d/18NPl0VdxsAbqBeKgpaXR8jBVEIZmqs4v/view?usp=sharing