SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models Paper • 2410.03750 • Published Oct 1, 2024 • 2
Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models Paper • 2501.17088 • Published Jan 28 • 2
SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models Paper • 2410.03750 • Published Oct 1, 2024 • 2
Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models Paper • 2501.17088 • Published Jan 28 • 2
SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL Paper • 2111.00653 • Published Nov 1, 2021
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published Jan 23 • 9
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published Jan 23 • 9
ClimDetect: A Benchmark Dataset for Climate Change Detection and Attribution Paper • 2408.15993 • Published Aug 28, 2024 • 8
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5, 2024 • 38
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity Paper • 2404.10513 • Published Apr 16, 2024 • 2
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models Paper • 2402.16786 • Published Feb 26, 2024
Using Imperfect Surrogates for Downstream Inference: Design-based Supervised Learning for Social Science Applications of Large Language Models Paper • 2306.04746 • Published Jun 7, 2023
Why do LLaVA Vision-Language Models Reply to Images in English? Paper • 2407.02333 • Published Jul 2, 2024
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search Paper • 2404.10934 • Published Apr 16, 2024
A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities Paper • 2205.10358 • Published May 19, 2022