technologyneutral
Exploring DOGE: A Leap in Visual Document Understanding
Monday, January 6, 2025
With the DOGE-Engine, we built DOGE-Bench. This super useful tool offers seven different grounding and referring tasks across three types of documents—charts, posters, and PDFs. It's a fantastic way to thoroughly evaluate these models.
Using the data created by our engine, we developed a top-notch baseline model called DOGE. This model is a game-changer—it can accurately refer to and recognize texts at multiple levels within document images. Plus, we're making the code, data, and model available to everyone. Isn't that cool?
Actions
flag content