Logic Labyrinth
Dall-E 3 illustration of a selection of LLM models
Dall-E 3 illustration of a selection of LLM models

Large Language Models and where to find them

Written by Dan Hubbert

Introduction

This isn’t meant to be a comprehensive list of Large Language Models and hosts, rather it lists models I’ve used and where online they were hosted. When I started writing scripts that ran prompts against multiple models, I found it difficult to find the models I wanted to use - or at least to find where they were hosted online.

Models

Open AI GPT3.5, GPT4 & GPT4 Turbo

Pretty obvious entry for the list. You can apply for API access, but it’s not guaranteed. It does seem to be more available than it was when GPT3.5 and GPT4 were new.

Open AI has different specialized versions of its models for different applications, such as versions for working with Tools/Functions, more up-to-date versions trained on more recent data or versions with larger content windows. Unfortunately, not all of these are generally available through the API interface.

Open AIs full list of models is pretty extensive - here is the highlights as of November 2023:

MODELDESCRIPTION
GPT-4 and GPT-4 TurboA set of models that improve on GPT-3.5 and can understand as well as generate natural language or code
GPT-3.5A set of models that improve on GPT-3 and can understand as well as generate natural language or code
DALL·EA model that can generate and edit images given a natural language prompt
TTSA set of models that can convert text into natural sounding spoken audio
WhisperA model that can convert audio into text
EmbeddingsA set of models that can convert text into a numerical form
ModerationA fine-tuned model that can detect whether text may be sensitive or unsafe
GPT baseA set of models without instruction following that can understand as well as generate natural language or code

Anthropic Claude-Instant and Claude-2

Anthropic has Claude-Instant and Claude-2, positioned as direct competitors to GPT3 and GPT4. The performance of the models is very good, though they specialize in applications like chat interfaces and summarization. I haven’t seen anything from them to rival Open AI’s specialized model versions for Functions.

Anthropic models also have a huge context window of 100k tokens, which means you can do summarization without needing to use complex strategies to process the whole of large documents. On its own that makes them worth experimenting with.

Unfortunately, API access is still quite restricted - you can apply but there is no guarantee of being accepted.

Meta AI Llama models

Meta AI has released its model as Open Source. This is a tremendous resource for the community, allowing experiments with fine-tuning a powerful model made with billions of parameters. Unfortunately, it also means that to work with it you either need to host it yourself or find someone who has made it available for public use.

I found DeepInfra perfect for my purposes, as they have the 7 billion, 13 billion and 70 billion parameter Llama models available. They also host a wide variety of other open-source models, including text generation, image generation and speech recognition models.

They also have a chat interface available with a selection of open-source text generation models available for use.

Cohere Command

Command is Cohere’s flagship model, and is available through their API. It’s also available through their Coral chat interface.

Command is a powerful model, not quite in the league of Open AI or Anthropic models, but able to compete or even outdo Meta’s Llama models. I include it in model comparisons because it’s a good benchmark to show how well the more sophisticated models perform.

Others

Trying to stay on top of developments in LLMs is almost impossible due to the incredible pace the field is moving. I haven’t listed HuggingFace on here, despite having used models hosted there before. They have a huge selection of models available, but only host some of the lower-powered versions and the performance of their API is poor. Having got a lot of errors from it I dropped it and swapped to using DeepInfra instead.

HuggingFace does have a lot of data sets and other resources though, so it’s worth checking out.

Conclusion

There are so many models available now, many of which I’d love to try out. I’m keeping the scope of this list to models I’ve used though, as I know all of these work reliably.