ScalingIntelligence (Scaling Intelligence)

simonguozirui

authored 2 papers 2 months ago

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Paper • 2408.08274 • Published Aug 15, 2024 • 13

KernelBench: Can LLMs Write Efficient GPU Kernels?

Paper • 2502.10517 • Published Feb 14 • 3

simonguozirui

updated a dataset 2 months ago

ScalingIntelligence/kernelbench-samples

Updated Feb 25 • 290 • 1

simonguozirui

published a dataset 2 months ago

ScalingIntelligence/kernelbench-samples

Updated Feb 25 • 290 • 1

simarora

authored 5 papers 7 months ago

Bradley

authored a paper 9 months ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 13

RylanSchaeffer

authored a paper 11 months ago

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Paper • 2406.04391 • Published Jun 6, 2024 • 9

danbider

authored a paper 12 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

ekellbuch

authored a paper about 1 year ago

Deep Ensembles Work, But Are They Necessary?

Paper • 2202.06985 • Published Feb 14, 2022

sabrieyuboglu

authored a paper about 1 year ago

Zoology: Measuring and Improving Recall in Efficient Language Models

Paper • 2312.04927 • Published Dec 8, 2023 • 2

simarora

authored 5 papers about 1 year ago

On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021

Can Foundation Models Wrangle Your Data?

Paper • 2205.09911 • Published May 20, 2022

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Paper • 2310.12109 • Published Oct 18, 2023 • 1

Zoology: Measuring and Improving Recall in Efficient Language Models

Paper • 2312.04927 • Published Dec 8, 2023 • 2

RELIC: Investigating Large Language Model Responses using Self-Consistency

Paper • 2311.16842 • Published Nov 28, 2023 • 1

sabrieyuboglu

authored a paper about 1 year ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 21

Scaling Intelligence

AI & ML interests

ScalingIntelligence's activity

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

KernelBench: Can LLMs Write Efficient GPU Kernels?

ScalingIntelligence/kernelbench-samples

ScalingIntelligence/kernelbench-samples

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Simple linear attention language models balance the recall-throughput tradeoff

Just read twice: closing the recall gap for recurrent language models

LoLCATs: On Low-Rank Linearizing of Large Language Models

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

LoRA Learns Less and Forgets Less

Deep Ensembles Work, But Are They Necessary?

Zoology: Measuring and Improving Recall in Efficient Language Models

On the Opportunities and Risks of Foundation Models

Can Foundation Models Wrangle Your Data?

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Zoology: Measuring and Improving Recall in Efficient Language Models

RELIC: Investigating Large Language Model Responses using Self-Consistency

Simple linear attention language models balance the recall-throughput tradeoff

AI & ML interests

Team members 22

ScalingIntelligence's activity