allenai
/

olmOCR-7B-0225-preview

Image-Text-to-Text

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jakep-allenai commited on 24 days ago

Commit

22314c3

·

verified ·

1 Parent(s): 2ad3a7c

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -13,6 +13,21 @@ library_name: transformers
 This is a preview release of the olmOCR model that's fine tuned from Qwen2-VL-7B-Instruct.
 ## License and use
 olmOCR is licensed under the Apache 2.0 license.

 This is a preview release of the olmOCR model that's fine tuned from Qwen2-VL-7B-Instruct.
+Quick links:
+- 📃 [Paper](link-to-paper)
+- 🤗 [Dataset](https://huggingface.co/allenai/olmOCR-mix-0225)
+- 🛠️ [Code](https://github.com/allenai/olmocr)
+- 🎮 [Demo](https://olmocr.allenai.org/)
+The best way to use this model is via the [olmOCR toolkit](https://github.com/allenai/olmocr).
+## Prompting
+This model expects as input a single document image, rendered such that the longest dimension is 1024 pixels.
+The prompt must then contain the additional metadata from the document, and the easiest way to generate this
+prompt is via the [olmOCR toolkit](https://github.com/allenai/olmocr).
 ## License and use
 olmOCR is licensed under the Apache 2.0 license.