Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

golaxy
/
gogpt-560m

Text Generation
Transformers
PyTorch
Chinese
bloom
text-generation-inference
Model card Files Files and versions Community
3
  • GoGPT
    • 测试效果
      • TODO
        • 感谢

          GoGPT

          基于中文指令数据微调BLOOM img.png

          训练第一轮足够了,后续第二轮和第三轮提升不大

          • 🚀多样性指令数据
          • 🚀筛选高质量中文数据
          模型名字 参数量 模型地址
          gogpt-560m 5.6亿参数 🤗golaxy/gogpt-560m
          gogpt-3b 30亿参数 🤗golaxy/gogpt-3b

          测试效果

          img.png img.png img.png img.png img.png img.png

          TODO

          • 进行RLFH训练
          • 后续加入中英平行语料

          感谢

          • @hz大佬-zero_nlp
          • stanford_alpaca
          • Belle数据
          Downloads last month
          10
          Inference Providers NEW
          Text Generation
          This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

          Datasets used to train golaxy/gogpt-560m

          BelleGroup/school_math_0.25M

          Viewer • Updated Apr 8, 2023 • 248k • 462 • 104

          BelleGroup/train_0.5M_CN

          Viewer • Updated Apr 3, 2023 • 519k • 411 • 108

          BelleGroup/train_3.5M_CN

          Viewer • Updated Aug 16, 2023 • 3.61M • 356 • 131

          Spaces using golaxy/gogpt-560m 23

          🏆
          Intel/low_bit_open_llm_leaderboard
          🏆
          BAAI/open_cn_llm_leaderboard
          😻
          GTBench/GTBench
          🏆
          Vikhrmodels/small-shlepa-lb
          🏆
          kz-transformers/kaz-llm-lb
          🔢
          Vikhrmodels/DOoM-lb
          🎨
          OPTML-Group/UnlearnCanvas-Benchmark
          🥇
          BAAI/open_flageval_vlm_leaderboard
          🏆
          gsaivinay/open_llm_leaderboard
          🏆
          felixz/open_llm_leaderboard
          🌍
          neubla/neubla-llm-evaluation-board
          🏆
          rodrigomasini/data_only_open_llm_leaderboard
          Company
          TOS Privacy About Jobs
          Website
          Models Datasets Spaces Pricing Docs