A look at Indian startups like TuluAI, which are building LLMs for low-resource languages by creating data sets nearly from scratch with community involvement
Indian founders are building digital data sets for low-resource languages by involving the community, and count on local relevance to go against big tech platforms. X: @restofworld , @restofworld , @restofworld , and @restofworld . LinkedIn: Rina Chandran X: @restofworld : “Our language is vulnerable and at risk of disappearing. So I took matters into my own hands.” The mission to build better AI tools for India's less common languages https://restofworld.org/... @restofworld : “Most AI systems are built in the U.S. They don't understand Indian languages or contexts. We need our own models that represent us,” said the founder of TuluAI, one of several AI startups hoping to compete with ChatGPT by focusing on local languages https://restofworld.org/... @restofworld : “We don't compete with GPT on scale. We compete on relevance.” The mission to build better AI tools for India's less common languages https://restofworld.org/... @restofworld : India has more than 1,600 languages and dialects. ChatGPT supports around a dozen. That's where these small startups see an opportunity to compete https://restofworld.org/... LinkedIn: Rina Chandran : ChatGPT is huge in India. Some startups are building AI tools for less common languages, painstakingly gathering data …