Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2307.09288

It is a collection of papers that are useful in studying LLM.

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 61
LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 39
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 58
Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 40

Qwen/Qwen3-8B

Text Generation • Updated 10 days ago • 218k • 251
Qwen/Qwen3-4B

Text Generation • Updated 10 days ago • 154k • • 168
Qwen/Qwen3-0.6B

Text Generation • Updated 10 days ago • 246k • 213
google/gemma-3-4b-it

Image-Text-to-Text • Updated Mar 21 • 612k • 502

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 18
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 1

Source papers of LLM Giants

Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 35
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Paper • 2311.07919 • Published Nov 14, 2023 • 10
Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 163
Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 60

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 48
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 243
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Paper • 2309.11235 • Published Sep 20, 2023 • 15
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 391

royalmatrimonial

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 615
Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 367
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257
LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 257

Paused

2.09k

2.09k

Anycoder

🏢

Select and view code snippets for different providers
Running

271

271

Qwen2.5 Coder Artifacts

🐢

Generate application code with Qwen2.5-Coder-32B
Running

915

915

QwQ-32B-Preview

🔍

QwQ-32B-Preview
Running on CPU Upgrade

13k

13k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

New Tools For Oct 2024

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 2.72M • • 10.1k
openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 7.07M • • 2.35k
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 597k • • 1.43k
deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Dec 11, 2024 • 1.69k • 706

LLM Tech Report

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 367
Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 147
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Paper • 2409.12122 • Published Sep 18, 2024 • 3
Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 186

how to verify bank account on Wise business

If you want to know more or have any queries, just knock us here– Email: [email protected] Telegram: @Smmtoperofficial Skype: Smmtoperofficial WhatsA

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 243

Previous
1
2
3
...
9
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs