About 14,600 results
Open links in new tab
  1. openai/gsm8k · Datasets at Hugging Face

    GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic …

  2. 【搬运】GSM8K 数据集介绍-CSDN博客

    Sep 28, 2025 · 本文介绍了OpenAI发布的GSM8K数据集,该数据集由8.5K高质量小学数学问题组成,分为训练和测试集。 为解决模型计算不准确问题,通过注入计算注释训练模型使用计算器,还介绍了 …

  3. GSM8k中文数据集 · Datasets

    Dataset GSM8K_zh is a dataset for mathematical reasoning in Chinese, question-answer pairs are translated from GSM8K (https://github.com/openai/grade-school-math/tree/master) by GPT-3.5 …

  4. GSM8K 评测基准详情 | 大模型排行榜 | DataLearnerAI

    一个包含 8500 道小学数学题的基准,用于评估模型的数学推理能力。 。 查看GSM8K介绍、评测指标、官方数据集链接、详细测试结果及大模型排名,掌握 AI 评测趋势!

  5. 完整教程:GSM8K:评估大模型数学推理能力的关键数据集 - 博客园

    Oct 13, 2025 · GSM8K(Grade School Math 8K)是一个包含 8,500 个高质量、语言多样化的小学数学单词问题 (Math Word Problems)的数据集。 该数据集由 OpenAI 团队创建,并于 2021 年通过论 …

  6. GitHub - openai/grade-school-math

    To diagnose the failures of current models and support research, we're releasing GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems.

  7. [2110.14168] Training Verifiers to Solve Math Word Problems

    Oct 27, 2021 · To diagnose the failures of current models and support research, we introduce GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems.

  8. gsm8k: Mirror of https://huggingface.co/datasets/gsm8k

    GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic …

  9. GSM8K Example — verl documentation

    Mar 25, 2025 · Here’s an example for GSM8k dataset and deepseek-llm-7b-chat model. Users could replace the data.train_files , data.val_files, actor_rollout_ref.model.path and critic.model.path based …

  10. GSM8K | DeepEval by Confident AI - The LLM Evaluation Framework

    Jan 6, 2026 · The GSM8K benchmark comprises 1,319 grade school math word problems, each crafted by expert human problem writers. These problems involve elementary arithmetic operations (+ − ×÷) …