Solving the data crisis in generative AI: Tackling the LLM brain drain

Photo of a person standing on a drain illustrating an article on tackling the generative AI data crisis, or the LLM brain drain, using KaaS.

Today’s generative AI models, particularly large language models (LLMs), rely on training data of an almost unimaginable scale and terabytes of text sourced from the vast expanse of the internet. While the internet has long been viewed as an infinite resource with billions of users contributing new content daily, researchers are beginning to scrutinise the impact of relentless data consumption on the broader information ecosystem.

A critical challenge is emerging. As AI models...

Alibaba introduces new AI models and tools at developer summit

Photo of an Alibaba office as the company announces updates at its developer summit including new Qwen AI models, development tools, and cloud infrastructure upgrades.

Alibaba Cloud has announced several enhancements to its AI offerings during its annual developer summit.

Among the updates are new large language models (LLMs), advanced AI development tools, upgraded cloud infrastructure, and a dedicated program to support global developers.

"Alibaba Cloud is committed to delivering real value to global developers through cutting-edge AI models, enhanced cloud infrastructure, and accessible support programs," said Dongliang Guo, VP of...

Couchbase tackles agentic AI development challenges

Image of a bit illustrating the release of the Capella AI Services suite of tools by Couchbase that promises to streamline the development of secure agentic artificial intelligence applications.

Couchbase has unveiled tools designed to help streamline the development of secure agentic AI applications at scale.

The developer data platform firm's latest offering, Capella AI Services, includes model hosting, automated vectorisation, unstructured data preprocessing, and AI agent catalogue services.

Capella AI Services’ features promise to enable developers to prototype, build, test, and deploy AI agents while maintaining proximity between models and data—helping...

GitHub Copilot now supports multiple LLMs

Picture of a person with a digital brain with multiple coloured waves illustrating the GitHub Copilot AI software development assistant gaining accessing to multiple new LLMs (large language models)

GitHub is bringing more flexibility and choice to Copilot through the integration of multiple large language models (LLMs).

Since its inception, GitHub Copilot has utilised different LLMs for varied uses. The journey began with the deployment of Codex, an early iteration of OpenAI's GPT-3, that was fine-tuned specifically for coding tasks. The evolution continued with the launch of Copilot Chat in 2023, initially using GPT-3.5 and subsequently transitioning to GPT-4. As demands...

JetBrains launches AI model for software development tasks

Image of a brain illustrating the launch of the new Mellum LLM by JetBrains that enhances its AI assistant for developers that specialises in software development tasks.

JetBrains has announced the launch of Mellum, its own AI model specifically engineered for software development tasks.

Mellum has been integrated exclusively into JetBrains’ AI Assistant, reporting dramatic improvements in both speed and accuracy of code completions compared to previous implementations.

Unlike more extensive language models, Mellum has been purposefully designed with a smaller footprint to deliver near-instantaneous coding suggestions. The model...

OpenAI cures structured data headache for developers

OpenAI has unveiled "Structured Outputs", a new API feature designed to address the long-standing challenge of reliably generating structured data from large language models (LLMs). The feature, available now, guarantees that model-generated outputs will adhere to developer-defined JSON Schemas.

Generating structured data from unstructured input is a cornerstone of many AI applications today. Developers leverage the OpenAI API to build sophisticated assistants capable of fetching...

Brave Search now answers coding queries

Brave Search has integrated a new AI feature called CodeLLM designed to provide high-quality answers to coding-related queries. 

CodeLLM summarises potential solutions located across the web and generates step-by-step explanations to common coding problems. It aims to save developers time by delivering concise and actionable responses without having to sift through dozens of search results.

The AI is powered by a large language model called Mixtral which can generate...

Xbox and Inworld AI forge game-changing alliance

Xbox has unveiled a partnership with Inworld AI aimed at transforming game development using AI-powered tools.

In a blog post, Haiyan Zhang, General Manager of Gaming AI at Xbox, reflected on the evolution of gaming AI from the days of Ms Pac-Man to the present. Zhang highlighted the transformative potential of modern AI in creating living worlds, dynamic narratives, and intricate characters.

While traditional rule-based AI set the foundation, the integration of Large...

Docker and partners launch GenAI Stack for developers

During the day two DockerCon keynote, Docker – in collaboration with partners Neo4j, LangChain, and Ollama – introduced the GenAI Stack.

This innovative platform is meticulously designed to empower developers to kickstart their generative AI applications within minutes, eliminating the complexities associated with integrating diverse technologies.

The GenAI Stack offers a seamless solution by providing pre-configured, ready-to-code, and secure components. These...

Reddit to charge for API access over AI training concerns

Social news aggregation and discussion website Reddit will begin charging companies for access to its API.

Reddit says it’s making the decision over concerns about companies using the API to train large language models (LLMs).

The company says that its pricing will be divided into tiers to support companies of different sizes, with different usage limits and broader usage rights offered at each tier. However, the exact pricing details have not yet been...