Open in app

Sign in

Write

Sign in

Cameron R. Wolfe, Ph.D.
Cameron R. Wolfe, Ph.D.

3.6K Followers

Home

About

Published in

Towards Data Science

·5 days ago

The History of Open-Source LLMs: Imitation and Alignment (Part Three)

Open-source LLMs need alignment to become truly remarkable… — A majority of prior research on open-source large language models (LLMs) focused heavily upon creating pre-trained base models. However, these models have not undergone any fine-tuning, so they fail to match the quality of top closed-source LLMs (e.g., ChatGPT or Claude) due to their lack of alignment. Paid models are…

Artificial Intelligence

20 min read

The History of Open-Source LLMs: Imitation and Alignment (Part Three)
The History of Open-Source LLMs: Imitation and Alignment (Part Three)
Artificial Intelligence

20 min read


Published in

Towards Data Science

·Nov 18

The History of Open-Source LLMs: Better Base Models (Part Two)

How LLaMA, MPT, Falcon, and LLaMA-2 put open-source LLMs on the map… — Open-source research on large language models (LLMs) is incredibly valuable, as it aims to democratize a powerful and influential technology. Although open-source LLMs are now commonly used and widely studied, this area of research saw some initial struggles that were difficult to overcome. Namely, open-source LLMs performed poorly at first…

Artificial Intelligence

16 min read

The History of Open-Source LLMs: Better Base Models (Part Two)
The History of Open-Source LLMs: Better Base Models (Part Two)
Artificial Intelligence

16 min read


Published in

Towards Data Science

·Nov 7

The History of Open-Source LLMs: Early Days (Part One)

Understanding GPT-Neo, GPT-J, GLM, OPT, BLOOM, and more… — Research on language modeling has a long history that dates back to models like GTP and GPT-2 or even RNN-based techniques (e.g., ULMFit) that predate modern, transformer-based language models. Despite this long history, however, language models have only become popular relatively recently. The first rise in popularity came with the…

Artificial Intelligence

20 min read

The History of Open-Source LLMs: Early Days (Part One)
The History of Open-Source LLMs: Early Days (Part One)
Artificial Intelligence

20 min read


Published in

Towards Data Science

·Oct 29

Data is the Foundation of Language Models

How high-quality data impacts every aspect of the LLM training pipeline… — Large Language Models (LLMs) have been around for quite some time, but only recently has their impressive performance warranted significant attention from the broader AI community. With this in mind, we might begin to question the origin of the current LLM movement. What was it that actually made recent models…

Artificial Intelligence

16 min read

Data is the Foundation of Language Models
Data is the Foundation of Language Models
Artificial Intelligence

16 min read


Published in

Towards Data Science

·Oct 24

Falcon: The Pinnacle of Open-Source LLMs

The gap between open-source and proprietary LLMs continues to shrink… — Recent research in open-source large language models (LLMs) has mostly focused upon two areas: imitation learning and pre-training open-source base models. Though both approaches are viable, the creation of high-quality, open-source base models is especially enticing, as these models can be further fine-tuned (at a lower cost) and used in…

Artificial Intelligence

14 min read

Falcon: The Pinnacle of Open-Source LLMs
Falcon: The Pinnacle of Open-Source LLMs
Artificial Intelligence

14 min read


Published in

Towards Data Science

·Oct 15

Democratizing AI: MosaicML’s Impact on the Open-Source LLM Movement

How high-quality base models unlock new possibilities for an entire industry… — Recently, we have overviewed a lot of current research on the creation of open-source large language models (LLMs). Across all of this work, models are created using a common framework with a few simple components; see below.

Artificial Intelligence

13 min read

Democratizing AI: MosaicML’s Impact on the Open-Source LLM Movement
Democratizing AI: MosaicML’s Impact on the Open-Source LLM Movement
Artificial Intelligence

13 min read


Published in

Towards Data Science

·Sep 30

Orca: Properly Imitating Proprietary LLMs

Leveraging imitation to create high-quality, open-source LLMs… — As research progresses on large language models (LLMs), one key question that remains unanswered is whether an existing, high-quality LLM can be used to effectively train another LLM. Currently, there is a lot of debate and contention around this topic. The recent explosion of open-source imitation models initially indicated that…

Artificial Intelligence

16 min read

Orca: Properly Imitating Proprietary LLMs
Orca: Properly Imitating Proprietary LLMs
Artificial Intelligence

16 min read


Published in

Towards Data Science

·Sep 27

Imitation Models and the Open-Source LLM Revolution

Are proprietary LLMs like ChatGPT and GPT-4 actually easy to replicate? — The proposal of the LLaMA suite [2] of large language models (LLMs) led to a surge in publications on the topic of open-source LLMs. In many cases, the goal of these works was to cheaply produce smaller, opens-source LLMs (for research purposes) that have comparable quality to proprietary models like…

Artificial Intelligence

15 min read

Imitation Models and the Open-Source LLM Revolution
Imitation Models and the Open-Source LLM Revolution
Artificial Intelligence

15 min read


Published in

Towards Data Science

·Sep 17

Can Language Models Make Their Own Tools?

LaTM, CREATOR, and other closed-loop frameworks for LLM tool usage… — In recent overviews, we have explored the utility of augmenting large language models (LLMs) with external tools. These models can be taught to leverage tools in a variety of ways. However, we should realize that existing tool-following LLMs leverage only a limited set of potential tools [3], whereas the range…

Artificial Intelligence

16 min read

Can Language Models Make Their Own Tools?
Can Language Models Make Their Own Tools?
Artificial Intelligence

16 min read


Published in

Towards Data Science

·Sep 4

Language Models and Friends: Gorilla, HuggingGPT, TaskMatrix, and More

What happens when we give LLMs access to thousands of deep learning models? — Recently, we have witnessed a rise of foundation models to popularity within deep learning research. Pre-trained large language models (LLMs) have led to a new paradigm, in which a single model can be used — with surprising success — to solve many different problems. Despite the popularity of generic LLMs…

Artificial Intelligence

18 min read

Language Models and Friends: Gorilla, HuggingGPT, TaskMatrix, and More
Language Models and Friends: Gorilla, HuggingGPT, TaskMatrix, and More
Artificial Intelligence

18 min read

Cameron R. Wolfe, Ph.D.

Cameron R. Wolfe, Ph.D.

3.6K Followers

Director of AI @ Rebuy • Deep Learning Ph.D. • I make AI understandable

Following
  • Daniel Tunkelang

    Daniel Tunkelang

  • Sergi Castella i Sapé

    Sergi Castella i Sapé

  • Nuriiyibaslar

    Nuriiyibaslar

  • Even Oldridge

    Even Oldridge

See all (7)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams