An analysis of nearly 4,000 public datasets finds that over 90% of AI training datasets came from Europe and North America, and fewer than 4% came from Africa
that's reshaping the infrastructures of our world in ways that reflect the interests of those big corporations.” Niall Firth / @niallfirth : New findings show how the sources of data are concentrating power in the hands of the most powerful tech companies - new today from @melissahei.bsky.social www.technologyreview.com/2024/12/18/ 1... @hypervisible : Data Provenance Initiative “audited nearly 4,000 public data sets spanning over 600 languages, 67 countries, and three decades. The data came from 800 unique sources and nearly 700 organizations.” Melissa Heikkilä / @melissahei : New research reveals a worrying trend: AI's data practices risk concentrating power overwhelmingly in the hands of dominant technology companies. With analysis from — @shaynelongpre.bsky.social @sarahooker.bsky.social @smw.bsky.social @giadapistilli.com www.technologyreview.com/2024/12/18/ 1...