Cohere Labs unveils AfriAya, a vision-language dataset aimed at improving how AI models understand African languages and ...
AnyGPT is a new multimodal LLM that can be trained stably without changing the architecture or training paradigm of existing large-scale language models (LLMs). AnyGPT relies solely on data-level ...
Mistral Small 3.1 is new advanced open source language model designed to handle both text and image-based tasks with remarkable efficiency and precision. Released under the Apache 2.0 license, it ...
Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...
Artificial Intelligence (AI) has undergone remarkable advancements, revolutionizing fields such as general computer vision and natural language processing. These technologies, integral to the broader ...