BETA
This is a BETA experience. You may opt-out by clicking here

More From Forbes

Edit Story

Databricks’ New Open Source LLM

Following

Data analytics company Databricks says its mission is to deliver data intelligence to every enterprise by allowing organizations to understand and use their unique data to build their own AI systems. Central to that mission is the ability to use a large language model tailored to the needs of the enterprise.

Databricks addresses the need for open LLMs with the release of DBRX, a new open, general-purpose large language model that sets new benchmarks for performance and efficiency. The announcement continues the recent trend of open large language models adapted for the needs of the enterprise.

What is DBRX?

The open-source DBRX large language model was developed by Databricks’ Mosaic Research team, which it acquired in June 2023 as part of its MosaicML acquisition.

Built as a Mixture-of-Experts model, DBRX achieves remarkable processing speed efficiency without compromising model performance. It operates with 36 billion parameters at any given time out of 132 billion, showcasing a balance between speed and quality.

Databricks showed off DBRX’s performance relative to competing LLMs, releasing performance numbers that show the model exceeding existing open-source models and OpenAI’s GPT-3.5 across multiple standard benchmarks.

Databricks emphasizes the model's customizability, enabling enterprises to leverage DBRX to create AI applications tailored to their unique data and requirements. The open-source license allows enterprises to tailor DBRX to their specific needs, offering the potential for better performance compared to proprietary models.

Databricks Integration

DBRX was developed entirely on the Databricks platform, utilizing tools like Unity Catalog for data governance, Apache Spark for data processing, and Mosaic AI Training for model training and fine-tuning. It's this deep integration that brings out the total value of the solution.

DBRX is accessible to Databricks customers through APIs, allowing seamless integration into existing workflows and applications. This enables users to leverage DBRX's capabilities directly within the Databricks environment.

Databricks also provides the tools and infrastructure for customers to pre-train their DBRX-class models from scratch or continue training on top of Databricks-provided checkpoints. This flexibility allows organizations to tailor the model's capabilities to enterprise-specific needs.

DBRX has been integrated into Databricks' GenAI-powered products and is already showing promising results. For instance, in applications like SQL query generation and optimization, Databricks tells us that early rollouts of DBRX have surpassed previous models' capabilities and demonstrate competitive performance against even more advanced models.

Analyst’s Take

The release of DBRX continues the emerging trend of open LLMs to allow enterprises to tailor the technology to meet the specific needs of their environment. This follows the releases of Meta’s LLaMa models, Google's Mistral, and EleutherAI’s GPT-Neo and GPT-J LLMs. It also exists in a market dominated by commercial solutions from companies like OpenAI and Anthropic.

The MoE architecture used in DBRX, which balances high performance with efficient processing speed, addresses one of the critical challenges in AI model deployment—resource-intensive operations. This makes DBRX a powerful tool for AI tasks and an economically viable option for a broader range of enterprises.

The work done by Databricks' Mosaic team is impressive. Outperforming established models, including GPT-3.5, on several benchmarks positions Databricks as a formidable player in the AI space, setting the stage for the company to attract a broader base to its platform.

For enterprises, accessing and integrating DBRX through Databricks offers a direct pathway to incorporating advanced AI into their operations. The model's language understanding and code generation strengths are particularly relevant for automating tasks, enhancing data analysis, and improving decision-making processes. This is an area where Databricks has always excelled.

Open-sourcing DBRX opens opportunities for the model to be improved and expanded upon by the global AI research community. Databricks can further harness this collective expertise to enhance DBRX, fostering a vibrant ecosystem around its AI technologies.

At the same time, DBRX's success in the broader market will depend on how easily organizations can integrate and customize the model for their specific use cases. Databricks must ensure robust support and resources are available to facilitate adoption.

The release of DBRX underlines Databricks' commitment to democratizing AI while also setting a new benchmark for the performance, customization, and efficiency of AI models within the industry. Databricks' initiative could lead to more widespread and effective use of AI across industries, driving the next wave of AI-enabled transformation–precisely what you'd expect of a company whose mission is to deliver data intelligence to every enterprise.

Follow me on Twitter or LinkedIn