> While I haven't tested it extensively, 70B model is supposed to rival Chat GPT...

scarface_74 · on Aug 14, 2023

To your first point, I was trying use ChatGPT to generate some examples of negative interactions with customer service to show sentiment analysis in action for a project I was working on.

I had to do all types of workarounds for it to generate something useful without running into the guardrails.

pseudosavant · on Aug 14, 2023

I’ll second the context window too. I’ve been really impressed with Claude 2 because it can address such a larger context than I could feed into GPT4.

ramraj07 · on Aug 14, 2023

Could you give examples of smaller models trained on specific datasets?

antupis · on Aug 14, 2023

it can be almost anything like your HN comments or some corporate wiki, then get colab pro 10$ month or some juicy gaming machine and fine-tune that using eg this tutorial https://www.philschmid.de/instruction-tune-llama-2 but https://www.reddit.com/r/LocalLLaMA/ is full of different fine tuned models.

CodeCompost · on Aug 14, 2023

Can it handle other languages besides English?

e12e · on Aug 14, 2023

Not anywhere near as well as ChatGPT 4 (for chat anyway - maybe the model is better)?

Prompt:

> Hvad tycks om at fika nu?

ChatGPT 4

> Det låter som en trevlig idé! Fika är ju alltid gott. Vad skulle du vilja ha till din fika? (Oj, ursäkta för emojis! )

https://chat.openai.com/share/8e89a16f-f182-4f62-b9fa-f93cd5...

Llama2:

> I apologize, but I don't understand what you mean by "fika nu." Could you please provide more context or clarify your question so I can better assist you?

https://hf.co/chat/r/kOF2qst

jmorgan · on Aug 14, 2023

RE 2 - neat! What are some tasks you've been using smaller models (with perhaps larger context sizes) for?