In Amazon CTO Werner Vogels’ technology predictions for 2024, he envisions a near future where generative AI (Artificial Intelligence) becomes more culturally aware. This involves training large language models (LLMs), a type of AI algorithm, on a diverse range of data, leading to more nuanced and accurate results. The goal is to make generative AI more accessible and useful to users worldwide.
However, LLMs rely on data from the internet, which is mostly associated with high-resource languages. This term typically refers to languages, such as English, which have an abundance of linguistic resources available for natural language processing (NLP) tasks.
It is critical that organisations gain the ability to easily customise their LLMs with local data in their native languages to foster social inclusion, stimulate economic growth by opening up new markets, and create improved citizen experiences.
Culture influences everything. Recognising this, AI Singapore (AISG), a national program launched by Singapore’s National Research Foundation to enhance the country’s AI capabilities, is making its LLMs more culturally-accurate, localised, and tailored to Southeast Asia.
SEA-LION Family—a First for the Region
Building on AWS’s scalable compute infrastructure, AISG developed SEA-LION, a family of LLMs that is specifically pre-trained and instruct-tuned (a powerful fine-tuning method that allows for greater control over LLM behavior) for Southeast Asian languages and cultures.
A first for the region, SEA-LION also serves as the foundation for Singapore’s National Multimodal LLM Program, contributing to the island’s capabilities in AI research and innovation. This initiative aligns with the National AI Strategy 2.0 that outlines plans to deepen the use of AI in Singapore.
The model will focus on more commonly used languages in Southeast Asia, including Bahasa Indonesia, Bahasa Melayu, Thai, and Vietnamese, and will eventually be extended to include other Southeast Asian languages like Burmese and Lao.
Building regionally represented LLMs also requires rich, hyper-local data in relevant languages. An example of local language nuances is the term “LOL” (an abbreviation of “laughing out loud” in English). In Thailand, people commonly use “55555” while Indonesians often use “wkwkwk.” LLMs trained on culturally diverse training data, like SEA-LION, enhances the ability of generative AI applications to grasp nuanced aspects of human experiences and navigate complex societal challenges.
Accelerating Hyper-Local Generative AI in Southeast Asia
SEA-LION will be available on Amazon SageMaker JumpStart by February 2024. The platform provides pre-trained, publicly available models to help customers around the world get started with machine learning (ML).
The relatively small 3-billion and 7-billion parameter variants of SEA-LION that have been released so far were trained using Amazon EC2, a service that provides scalable compute capacity in the cloud.
These smaller variants are more flexible and accessible than commonly used LLMs in the market today, which typically have hundreds of billions of parameters.
By the end of January 2024, AISG will launch a commercial version of instruction tuning parameters for SEA-LION, which will have greater capacity to capture nuances in Southeast Asian languages, improve contextual understanding, enhance multilingual reasoning, and generate context-rich outputs.
“AISG’s focus on underrepresented languages throughout Southeast Asia is significant. As the majority of our customers at Tokopedia interact with us in Bahasa Indonesia, developing models with that local fluency will enhance our ability to connect with customers and improve their experiences,” said Paul Condylis, Associate Vice President of Data Science at Tokopedia. “We’re in the early stages of experimenting with SEA-LION and the progress is promising.”
“As technology rapidly evolves in Asia-Pacific, we see an exciting opportunity to tailor language models to reflect the region’s rich linguistic diversity and revolutionise the way information is processed and shared. We are pleased to work closely with AISG on the SEA-LION LLM project that leverages AWS resources. This project helps to bridge the digital divide by improving cross-cultural communication and providing enhanced localised experiences. Furthermore, businesses and governments can tap into NCS’ deep experience in addressing business needs through the use of customised LLM-powered applications and harness the full potential of the SEA-LION family of LLMs,” said Ying Shaowei, Chief Scientist at NCS.
Cost-Effective Machine Learning and Compute
Building, training, and deploying an LLM requires time, significant compute resources, and expertise. AISG is working with AWS to address these challenges by harnessing the cloud’s powerful performance for tasks like Machine Learning (ML) training and high-performance computing using NVIDIA A100 Tensor Core GPUs, which deliver top-notch throughput and fast, responsive networking.
The compact size of SEA-LION also makes it relatively more cost-effective and efficient than larger LLMs, which can have hundreds of times more parameters. Smaller LLMs allow developers to deploy faster, are cheaper to fine-tune, and perform more quickly during inference. The ease of deploying smaller LLMs on mobile devices, or at the edge, also helps businesses more easily adopt and create applications.
“Building an LLM requires reliable cloud infrastructure that is readily available exactly when needed, and AWS is instrumental in helping us scale cost-effectively. We built a 3 billion parameter LLM in just three months with AWS, and we have since scaled the model to 7 billion parameters, extending its reach to more audiences,” said Dr Leslie Teo, senior director of AI Products at AISG.
“By working with AWS, we can focus solely on training our models instead of managing infrastructure. This accelerates the development of unique LLMs that reflect our region’s diversity,” added Teo.
Elsie Tan, Country Manager of Worldwide Public Sector in Singapore at AWS, said: “Language and culture-specific LLMs like AISG’s SEA-LION will enable smoother cross-cultural communication and understanding, preserve cultural nuances, and help governments and businesses better serve citizens and customers in Southeast Asia.
She added: “We’re excited to help AISG unlock the potential of generative AI to deliver significant benefits to ASEAN’s diverse economies. We look forward to helping AISG extend the reach of its LLMs with our extensive network of customers and partners in the region.”
Archive
- October 2024(44)
- September 2024(94)
- August 2024(100)
- July 2024(99)
- June 2024(126)
- May 2024(155)
- April 2024(123)
- March 2024(112)
- February 2024(109)
- January 2024(95)
- December 2023(56)
- November 2023(86)
- October 2023(97)
- September 2023(89)
- August 2023(101)
- July 2023(104)
- June 2023(113)
- May 2023(103)
- April 2023(93)
- March 2023(129)
- February 2023(77)
- January 2023(91)
- December 2022(90)
- November 2022(125)
- October 2022(117)
- September 2022(137)
- August 2022(119)
- July 2022(99)
- June 2022(128)
- May 2022(112)
- April 2022(108)
- March 2022(121)
- February 2022(93)
- January 2022(110)
- December 2021(92)
- November 2021(107)
- October 2021(101)
- September 2021(81)
- August 2021(74)
- July 2021(78)
- June 2021(92)
- May 2021(67)
- April 2021(79)
- March 2021(79)
- February 2021(58)
- January 2021(55)
- December 2020(56)
- November 2020(59)
- October 2020(78)
- September 2020(72)
- August 2020(64)
- July 2020(71)
- June 2020(74)
- May 2020(50)
- April 2020(71)
- March 2020(71)
- February 2020(58)
- January 2020(62)
- December 2019(57)
- November 2019(64)
- October 2019(25)
- September 2019(24)
- August 2019(14)
- July 2019(23)
- June 2019(54)
- May 2019(82)
- April 2019(76)
- March 2019(71)
- February 2019(67)
- January 2019(75)
- December 2018(44)
- November 2018(47)
- October 2018(74)
- September 2018(54)
- August 2018(61)
- July 2018(72)
- June 2018(62)
- May 2018(62)
- April 2018(73)
- March 2018(76)
- February 2018(8)
- January 2018(7)
- December 2017(6)
- November 2017(8)
- October 2017(3)
- September 2017(4)
- August 2017(4)
- July 2017(2)
- June 2017(5)
- May 2017(6)
- April 2017(11)
- March 2017(8)
- February 2017(16)
- January 2017(10)
- December 2016(12)
- November 2016(20)
- October 2016(7)
- September 2016(102)
- August 2016(168)
- July 2016(141)
- June 2016(149)
- May 2016(117)
- April 2016(59)
- March 2016(85)
- February 2016(153)
- December 2015(150)