Requirements ?
#14
by
sheyenrath
- opened
I see the following requirements:
- This model expects as input a single document image, rendered such that the longest dimension is 1024 pixels.
- The prompt must then contain the additional metadata from the document.
My questions:
- Is is really needed that the longest dimension is 1024 pixels? What happens if it's not?
- Why is this needed? And what exact additional metadata is mandatory and useful?