- Something you find online. For example, take a look at Kaggle, awesome datasets or this list of datasets.
- https://www.registry.jockeyclub.com/registry.cfm?page=releasedNames&CFID=75624210&CFTOKEN=146476e202081137-69D31F18-5056-BE0C-977E746CB080EB2A
- Above is the dataset of horse games in Jockey Clubs. I particularly enjoyed this dataset because, to be honest, it was a funny data. The names are often comical, and they remind me of names that are generated by programs such as “random name generators.”
- I think I was also drawn to this dataset because I am taking an ITP class called Alter Egos, and we have to pick an alter ego of ourselves and name it; the horse name dataset reminds me of the names that people could use. I also feel like a lot of these names sound like cartoon characters and/or superheroes, which I enjoy.
- I’m glad most of the names are not human-like (ex: John Smith, Sarah Meyers, James Brown, etc.). I think having such literal names would throw a lot of people off.
- Find a dataset that you collect yourself or is already being collected about you. For example, personal data like steps taken per day, browser history, minutes spent on your mobile device, sensor readings, and more.
- Below are a few that I could think of on top of my head:
- screen time – collected automatically
- brwoser hisroey -collected automatically
- heart rate monitor on the treadmil – collected automatically
- miles walked
- calories burnt
- time stamp of how long the workout was
- search history -collected automatically
- youtube history – collected automatically
- what i eat – manually tracked by me
- sometimes calories intake too
- what i like/save on social media – collected automatically/and manually in a way, since I am actively liking them?
- my weekly hours for work – clocking in hours. – collected manually
- duty logs (I need to write down all the interactions I have with the dorm residents in a log) – collected manually
- card usage – collected automatically