Llema - Search

About 23,500 results

Open links in new tab

Any time

arxiv.org
https://arxiv.org › abs
[2310.10631] Llemma: An Open Language Model For Mathematics …
Oct 16, 2023 · We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma.
eleuther.ai
https://blog.eleuther.ai › llemma
Llemma: An Open Language Model For Mathematics - EleutherAI …
Today we release Llemma: 7 billion and 34 billion parameter language models for mathematics. The Llemma models were initialized with Code Llama weights, then trained on the Proof-Pile II, a 55 billion token dataset of mathematical and scientific documents.
huggingface.co
https://huggingface.co › EleutherAI
EleutherAI/llemma_7b - Hugging Face
Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. This model also comes in a 34B parameter version: Llemma 34B.
huggingface.co
https://huggingface.co › papers
Paper page - Llemma: An Open Language Model For Mathematics …
Oct 16, 2023 · We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma.
github.com
https://github.com › EleutherAI › math-lm
GitHub - EleutherAI/math-lm
Repository for Llemma: an open language model for mathematics [Azerbayev et al 2023]. This repository hosts data and training code related to the following artifacts: This repository also contains submodules related to the overlap, fine-tuning, and theorem proving experiments described in the paper.
arxiv.org
https://arxiv.org › pdf
[PDF]
1 arXiv:2310.10631v3 [cs.CL] 15 Mar 2024
a large language model for mathematics. We continue pretraining Code Llama on Proof-Pile-2, a mixture of scientific papers, web data containing mathematics. and mathematical code, yielding LLEMMA. On the MATH benchmark LLEMMA outperforms all known open base models, as well as the unreleased Minerv.
arxiv.org
https://arxiv.org › html
Llemma: an open language model for mathematics - arXiv.org
On the MATH benchmark Llemma outperforms all known open base models, as well as the unreleased Minerva model suite on an equi-parameter basis. Moreover, Llemma is capable of tool use and formal theorem proving without any further finetuning.
huggingface.co
https://huggingface.co › EleutherAI
EleutherAI/llemma_34b - Hugging Face
Oct 17, 2023 · Llemma 34B is a language model for mathematics. It was initialized with Code Llama 34B weights, and trained on the Proof-Pile-2 for 50B tokens. This model also comes in a 7B parameter version: Llemma 7B.
github.com
https://github.com › wellecks
GitHub - wellecks/llemma_formal2formal: Llemma formal2formal …
We observe a Ray error when running the 34b script (with VLLM --tp-degree > 1) on an untraced LeanDojo repo. A workaround is to run the 7b script with --tp-degree 1 such that LeanDojo completes tracing the repo. Then run the 34b script with --tp-degree > 1. Please cite the following: title={Llemma: An Open Language Model For Mathematics}, .
como-se-escribe.com
https://www.como-se-escribe.com › yema-o-llema
Yema o Llema - Cómo se escribe
El término llema está escrito de forma incorrecta. La palabra escrita correctamente es yema con y.
Some results have been removed
Pagination
- 1
- 2
- 3
- 4
- Next

[2310.10631] Llemma: An Open Language Model For Mathematics …

Llemma: An Open Language Model For Mathematics - EleutherAI …

EleutherAI/llemma_7b - Hugging Face

Paper page - Llemma: An Open Language Model For Mathematics …

GitHub - EleutherAI/math-lm

1 arXiv:2310.10631v3 [cs.CL] 15 Mar 2024

Llemma: an open language model for mathematics - arXiv.org

EleutherAI/llemma_34b - Hugging Face

GitHub - wellecks/llemma_formal2formal: Llemma formal2formal …

Yema o Llema - Cómo se escribe