This checkpoint has been reproduced based on the code provided in the facebookresearch/coconut repository and the experimental settings described in the paper Training Large Language Models to Reason in a Continuous Latent Space. Please refer to these sources for further details on the methodology and configuration used in this experiment.
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.
Model tree for Esther22/coconut_Reproduction
Base model
openai-community/gpt2