What can and can't language models do? Lessons learned from BIGBench
Por um escritor misterioso
Descrição
So what exactly can and can’t language models do? What's the least impressive thing GPT-4 won't be able to do? What will GPT-4 be incapable of?
BIGBench is kind of a way to figure this out. BigBench, aka “The Beyond the Imitation Game” Benchmark, is an attempt to explore the capabilities of large language models over a wide variety of tasks. All the tasks are enumerated here.
I looked through every BIGBench task and took the ones that compared both GPT3 and PaLM against humans.
* Spreadsheet

The Best Large Language Models in 2023: Top LLMs - UC Today

Pathways Language Model (PaLM): Scaling to 540 Billion Parameters

The Flan Collection: Advancing open source methods for instruction
Gemini in-depth analysis. ChatGPT killer or scam?
Extrapolating GPT-N performance — AI Alignment Forum
InstructZero: Efficient Instruction Optimization for Black-Box

A Big Year For AI - Ahead of AI #4
GitHub - uncbiag/Awesome-Foundation-Models: A curated list of

2205.11916] Large Language Models are Zero-Shot Reasoners

What can and can't language models do? Lessons learned from BIGBench

Language Models Perform Reasoning via Chain of Thought – Google

Hidden abilities of large language models: Is emergence the norm?
de
por adulto (o preço varia de acordo com o tamanho do grupo)