Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,21 @@ library_name: transformers
|
|
13 |
|
14 |
This is a preview release of the olmOCR model that's fine tuned from Qwen2-VL-7B-Instruct.
|
15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
## License and use
|
17 |
|
18 |
olmOCR is licensed under the Apache 2.0 license.
|
|
|
13 |
|
14 |
This is a preview release of the olmOCR model that's fine tuned from Qwen2-VL-7B-Instruct.
|
15 |
|
16 |
+
Quick links:
|
17 |
+
- 📃 [Paper](link-to-paper)
|
18 |
+
- 🤗 [Dataset](https://huggingface.co/allenai/olmOCR-mix-0225)
|
19 |
+
- 🛠️ [Code](https://github.com/allenai/olmocr)
|
20 |
+
- 🎮 [Demo](https://olmocr.allenai.org/)
|
21 |
+
|
22 |
+
The best way to use this model is via the [olmOCR toolkit](https://github.com/allenai/olmocr).
|
23 |
+
|
24 |
+
## Prompting
|
25 |
+
|
26 |
+
This model expects as input a single document image, rendered such that the longest dimension is 1024 pixels.
|
27 |
+
|
28 |
+
The prompt must then contain the additional metadata from the document, and the easiest way to generate this
|
29 |
+
prompt is via the [olmOCR toolkit](https://github.com/allenai/olmocr).
|
30 |
+
|
31 |
## License and use
|
32 |
|
33 |
olmOCR is licensed under the Apache 2.0 license.
|