About 23,500 results
Open links in new tab
  1. [2310.10631] Llemma: An Open Language Model For Mathematics …

    Oct 16, 2023 · We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma.

  2. Llemma: An Open Language Model For Mathematics - EleutherAI …

    Today we release Llemma: 7 billion and 34 billion parameter language models for mathematics. The Llemma models were initialized with Code Llama weights, then trained on the Proof-Pile II, a 55 billion token dataset of mathematical and scientific documents.

  3. EleutherAI/llemma_7b - Hugging Face

    Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. This model also comes in a 34B parameter version: Llemma 34B.

  4. Paper page - Llemma: An Open Language Model For Mathematics …

    Oct 16, 2023 · We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma.

  5. GitHub - EleutherAI/math-lm

    Repository for Llemma: an open language model for mathematics [Azerbayev et al 2023]. This repository hosts data and training code related to the following artifacts: This repository also contains submodules related to the overlap, fine-tuning, and theorem proving experiments described in the paper.

  6. a large language model for mathematics. We continue pretraining Code Llama on Proof-Pile-2, a mixture of scientific papers, web data containing mathematics. and mathematical code, yielding LLEMMA. On the MATH benchmark LLEMMA outperforms all known open base models, as well as the unreleased Minerv.

  7. Llemma: an open language model for mathematics - arXiv.org

    On the MATH benchmark Llemma outperforms all known open base models, as well as the unreleased Minerva model suite on an equi-parameter basis. Moreover, Llemma is capable of tool use and formal theorem proving without any further finetuning.

  8. EleutherAI/llemma_34b - Hugging Face

    Oct 17, 2023 · Llemma 34B is a language model for mathematics. It was initialized with Code Llama 34B weights, and trained on the Proof-Pile-2 for 50B tokens. This model also comes in a 7B parameter version: Llemma 7B.

  9. GitHub - wellecks/llemma_formal2formal: Llemma formal2formal …

    We observe a Ray error when running the 34b script (with VLLM --tp-degree > 1) on an untraced LeanDojo repo. A workaround is to run the 7b script with --tp-degree 1 such that LeanDojo completes tracing the repo. Then run the 34b script with --tp-degree > 1. Please cite the following: title={Llemma: An Open Language Model For Mathematics}, .

  10. Yema o Llema - Cómo se escribe

    El término llema está escrito de forma incorrecta. La palabra escrita correctamente es yema con y.

  11. Some results have been removed
Refresh