The CHAT Stack, GPT-4, And The Near-Term…

Mar 15, 2023

Your boy has seen the future in a (mostly broken) Discord bot. When the tooling catches up, things will get nuts.

10 Comments

Mar 15, 2023

really liking the idea of a CHAT stack. fantastic explanaiton and first explanation ive seen of promt tokens vs completion tokens. great stuff Jon!

Expand full comment

George Menyhei

Mar 15, 2023

I'm doing a writeup on smaller models that can run locally. That field is advancing rapidly, and it's impossible to handwave it away like OpenAI's brute force magic "in the cloud".

Expand full comment

Vanya Bagaev

Mar 15, 2023

Thank you. This is very helpful.

Expand full comment

Tom Parish

Mar 16, 2023

Jon, I have a question (or anyone can answer). So In the four-stage chatbot setup, you show a CONTENT icon that I think is the user (or company) specific content that is separate from a public LLM. I'm wondering if I understand this correctly. That CONTENT store is a vector database of imported content that has be converted to embeddings (vectors). This makes it possible for a user to leverage the strength of an LLM as a conversation interface to get an answer from the CONTENT store?

I'm just trying to understand this. I assume this is for example what PINECONE.IO is offering as a solution provider.

Expand full comment

Antilegomena

Mar 15, 2023

This is what LoRAs are supposed to handle, isn't it?

Expand full comment

Eugine Nier

Mar 15, 2023

The other thing people need, even if they don't realize it, is not becoming dependent on the whims of OpenAI.

Expand full comment

Reply (1)

George Menyhei

Mar 15, 2023

A lot of OpenAI's magic from GPT-2 to GPT-4 is being applied to alternatives, with success. As long as papers are published, the field is alive outside the gates.

Expand full comment

Reply (1)

Eugine Nier

Mar 15, 2023

So where can I download an open source alternative to say GPT-2?

Expand full comment

Reply (2)

George Menyhei

Mar 15, 2023

I'm aiming to publish a megapost on the topic today, meanwhile I recommend Stanford Alpaca as a starting point:

https://crfm.stanford.edu/2023/03/13/alpaca.html

This post is 2 days old, the underlying tech (LLaMA) is 3 weeks old. The field is going wild.