This week in AI | Week 2

blog thumbnail

This week in the field of artificial intelligence, Google revealed Gemma, HuggingFace introduction to Cosmopedia. Additionally, Amazon announced that open-source, high-performing Mistral models will soon be accessible on Amazon Bedrock. By bringing such powerful AI to more users via cloud-based services, new applications may emerge. These advancements show how quickly AI is developing; in certain areas, systems are already capable of doing some tasks that humans could not perform. As a result, the frontiers of what is possible are being redefined at a rapid rate. There are exciting times ahead as the AI community is coming up with new and seemingly unimaginable achievements every week.


Google’s Gemma: A New Open-Source AI Models

Googles Gemma

Google introduced Gemma, marking a pivotal moment in the democratisation of AI technology. Derived from the same technological lineage as the acclaimed Gemini models, Gemma is designed to foster responsible AI development, offering a suite of lightweight, state-of-the-art open models that promise to revolutionise how developers and researchers build and innovate.

Key Highlights:

  • Innovative Model Architecture: Gemma models, available in 2B and 7B variants, are designed to be both powerful and accessible, ensuring developers can leverage state-of-the-art AI capabilities without the need for extensive resources.
  • Responsible AI Toolkit: Accompanying the models is a toolkit aimed at guiding developers towards creating safer AI applications, reflecting Google’s commitment to ethical AI development.
  • Broad Accessibility: With support for major frameworks and interaction with popular tools, Gemma promises to be a versatile tool for developers, ensuring that the benefits of AI are broadly accessible and used.

Hugging Face Introduces Cosmopedia: The Largest Open Synthetic Dataset

Cosmopedia clusters by Hugging Face.

Hugging Face has recently unveiled Cosmopedia v0.1, the largest open synthetic dataset, consisting of over 30 million samples generated by Mixtral 7b. This dataset, comprising various content types such as textbooks, blog posts, stories, and WikiHow articles, totals an impressive 25 billion tokens.

The Genesis of Cosmopedia

Cosmopedia v0.1 stands out as a gigantic attempt to assemble world knowledge by mapping data from web datasets such as RefinedWeb and RedPajama. It is divided into eight distinct splits, each created from a different seed sample, and covers a wide range of topics, appealing to a variety of interests and inclinations.

Accessing Cosmopedia

Hugging Face includes code snippets for loading certain dataset splits to make it easier to use. A smaller subset, Cosmopedia-100k, is also available for individuals looking for a more easily maintained dataset. The development of Cosmo-1B, a larger model trained on Cosmopedia, demonstrates the dataset’s scalability and adaptability.

Minimising Redundancy

Cosmopedia was designed with the goal of maximising diversity while minimising redundancy. Through targeted prompt styles and audiences, continual prompt refining, and the use of MinHash deduplication algorithms, the dataset achieves a remarkable breadth of coverage and originality in content.


Mistral AI Models on Amazon Bedrock

Mistral and Amazon Bedrock

In another exciting development, Mistral AI, a France-based AI company known for its fast and secure large language models (LLMs), is set to make its models available on Amazon Bedrock. Mistral AI will join as the 7th foundation model provider on Amazon Bedrock, alongside leading AI companies.

Overview of Mistral AI Models:

  • Mistral 7B: This foundation model excels in English text generation tasks, including natural coding capabilities, optimised for low latency, low memory requirements, and high throughput.
  • Mixtral 8x7B: A high-quality sparse Mixture-of-Experts (MoE) model, ideal for a range of tasks from text summarization to code generation, offering a balance of cost and performance.

Why Mistral AI Models Stand Out:

  • Cost-Performance Balance: Mistral AI models offer an efficient, affordable, and scalable solution, striking a remarkable balance between cost and performance.
  • Fast Inference Speed: Optimised for low latency, these models are designed for scalable production use cases.
  • Transparency and Trust: Mistral AI’s commitment to transparency and customisation meets the needs of organisations facing stringent regulatory requirements.
  • Accessibility: ensuring that organisations of any size can integrate generative AI features into their applications.

Availability:

Mistral AI models are set to be publicly available on Amazon Bedrock soon, promising to provide developers and researchers with more tools to innovate and scale their generative AI applications.


Conclusion

This week’s developments highlight a huge drive for more open, accessible, and responsible AI technologies. The AI community continues to push for inclusive, diverse, and ethically grounded innovation, with Google’s Gemma offering cutting-edge models for responsible development, Hugging Face’s Cosmopedia expanding the scope of synthetic data research, and Mistral AI’s strategic move to Amazon Bedrock.

profile pic
Neha
February 24, 2024
Newsletter
Sign up for our newsletter to get the latest updates

Related posts