Databricks releases Dolly 2.0, a powerful and open large language model for commercial use

Databricks is introducing Dolly 2.0, an open source, instruction-following LLM, fine-tuned on a human-generated instruction dataset licensed for research and commercial use.

Dolly 2.0 is a 12B parameter language model based on the EleutherAI pythia model family and fine-tuned exclusively on a new, high-quality human generated instruction following dataset, crowdsourced among Databricks employees.

Databricks is open-sourcing the entirety of Dolly 2.0, including the training code, the dataset, and the model weights, all suitable for commercial use. This means that any organization can create, own, and customize powerful LLMs that can talk to people, without paying for API access or sharing data with third parties.

The company heard repeatedly from its customers that they would be best served by owning their models, allowing them to create higher quality models for their domain specific applications without handing their sensitive data over to third parties.

Databricks also believes that the important issues of bias, accountability, and AI safety should be addressed by a broad community of diverse stakeholders rather than just a few large companies. Open-sourced datasets and models encourage commentary, research and innovation that will help to ensure everyone benefits from advances in artificial intelligence technology, according to the company.

To download Dolly 2.0 model weights visit the Databricks Hugging Face page and visit the Dolly repo on databricks-labs to download the databricks-dolly-15k dataset.

For more information about this news, visit www.databricks.com.

Free

for qualified subscribers

Subscribe Now Current Issue Past Issues

Register Now to SAVE BIG & Join Us for KMWorld 2025, November 17-20, in Washington, DC.

Databricks releases Dolly 2.0, a powerful and open large language model for commercial use

Special Report- Shadow AI: Managing the Unseen Copyright Risks in Your Organization

Supercharging Your Customer Experience Program With AI and Automation

Special Report- The Role Metadata Plays in the Information Lifecycle

More

The KM ROI Challenge: Measuring the Impact of Your Investment

Driving Real Impact with AI-Powered Document Automation

Better Together: Combining Generative and Extractive AI

Fact, Failure, or Fantasy: Navigating How to Win with AI in Knowledge Management

More Webinars