1 9 21

yotoshihiro

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

allenai/olmOCR-7B-0225-preview

liked a model 3 months ago

unsloth/r1-1776-GGUF

liked a Space 3 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

yotoshihiro's activity

liked a model about 2 months ago

allenai/olmOCR-7B-0225-preview

Image-Text-to-Text • Updated Feb 25 • 420k • 634

liked a model 3 months ago

unsloth/r1-1776-GGUF

Text Generation • Updated Feb 19 • 1.12k • 101

liked 2 Spaces 3 months ago

2.56k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

2.09k

Anycoder

🏢

Select and view code snippets for different providers

liked a model 4 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated Mar 27 • 657k • • 3.83k

upvoted a paper 4 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

liked a Space 5 months ago

557

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked a Space 6 months ago

Compare Llms

🌍

liked a model 7 months ago

Snowflake/snowflake-arctic-embed-m

liked a dataset 7 months ago

bigcode/the-stack-smol

Viewer • Updated May 2, 2023 • 300k • 512 • 53

liked a dataset 8 months ago

yuxiang630/hqcode

Viewer • Updated Aug 1, 2024 • 221k • 61 • 16

liked a Space 8 months ago

364

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data

liked a model 10 months ago

meta-llama/Llama-3.1-405B

Text Generation • Updated Sep 25, 2024 • 17.9k • 929

liked a Space 11 months ago

934

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

upvoted a collection 12 months ago

Llama3-ChatQA-1.5

Collection

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 3 days ago • 44

liked a model about 1 year ago

MediaTek-Research/Breexe-8x7B-Instruct-v0_1

Text Generation • Updated Aug 2, 2024 • 17 • 55