Falcon 180B — the largest open LLM

Is this the best open LLM yet? Better than Llama 2?

10 min readSep 10, 2023

Falcon 180B is a Large Language Model (LLM) that was released on September 6th, 2023 by the Technology Innovation Institute. This model is a descendant of the Falcon 40B model. Here’s a quick overview of the model:

180B parameter model in two versions (base and chat)
trained on 3.5 trillion tokens using the RefinedWeb dataset
context width of 2048 tokens

Falcon 180B is the largest publicly available model on the Hugging Face model hub. It is about the size of ChatGPT (GPT-3.5) which has 175B parameters. Is it the best?

Fine-tuning Large Language Model (LLM) on a Custom Dataset with QLoRA | MLExpert - Crush Your…

Can you train your own LLM using your own data? Can you accomplish this without sharing your data with third-party…

www.mlexpert.io

While the Falcon 180B model is publicly available, the commercial use is very restrictive. Please, refer to the license for more details and consult your legal team.

Model Variants (Base and Chat)

Falcon 180B — the largest open LLM

Is this the best open LLM yet? Better than Llama 2?

Fine-tuning Large Language Model (LLM) on a Custom Dataset with QLoRA | MLExpert - Crush Your…

Can you train your own LLM using your own data? Can you accomplish this without sharing your data with third-party…

Model Variants (Base and Chat)

Written by Venelin Valkov