Stability AI Launches Japanese StableLM Alpha, Redefining the Generative AI Landscape

AI News

Stability AI Launches Japanese StableLM Alpha, Redefining the Generative AI Landscape

Jimmy W.

August 14, 2023

Stability AI Launches Japanese StableLM Alpha, Redefining the Generative AI Landscape

The Introduction of Japanese StableLM Alpha: A Breakthrough in Generative AI

Stability AI, a leading generative AI company, has made a significant advancement in the Japanese generative AI landscape with the introduction of its first Japanese Language Model (LM), Japanese StableLM Alpha. This groundbreaking model is touted as the most advanced publicly available model for Japanese speakers. It has undergone extensive benchmark evaluation against four other Japanese LMs, solidifying its position as an industry leader.

The Impressive Features of Japanese StableLM Alpha

Japanese StableLM Alpha is a versatile and high-performing tool for various linguistic tasks. With its architecture of 7 billion parameters, it showcases Stability AI’s commitment to technological advancement. It outperforms other models in multiple categories, establishing itself as a standout in the field.

The Development Process and Collaborative Efforts

The Japanese StableLM Alpha 7B commercial iteration, which will be released under the Apache License 2.0, is the result of extensive training on a massive dataset of 750 billion tokens from both Japanese and English texts. Stability AI collaborated with the EleutherAI Polyglot project’s Japanese team and its Japanese community to create the datasets. The development process also involved the use of an extended version of EleutherAI’s GPT-NeoX software.

The Japanese StableLM Instruct Alpha: A Milestone for Research

In addition to Japanese StableLM Alpha, Stability AI has also introduced the Japanese StableLM Instruct Alpha 7B. This model is specifically designed for research purposes and boasts a unique ability to adhere to user instructions. It utilizes a methodical approach called Supervised Fine-tuning (SFT) using multiple open datasets.

Evaluation and Comparison with Competitors

The Japanese StableLM models underwent rigorous evaluations using EleutherAI’s Language Model Evaluation Harness. They were tested across various domains, including sentence classification, sentence pair classification, question answering, and sentence summarization. With an average score of 54.71%, the Japanese StableLM Instruct Alpha 7B outperformed its competitors, showcasing its superior performance and capabilities.

SoftBank’s Recent Announcement and the Future of Japanese Language Models

SoftBank, a major player in the Japanese market, recently announced its venture into homegrown Large Language Models (LLMs) for the Japanese market. The company has allocated a substantial budget of 20 billion JPY ($140 million) for its generative AI computing platform. As the competition in the field of generative AI continues to unfold, it remains to be seen which Japanese Language Model will ultimately establish its supremacy.

Source link