Intro

Body

<aside>

Topics

AdaptThink: Reasoning Models Can Learn When to Think

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

SCREENCODER: ADVANCING VISUAL-TO-CODE GENERATION FOR FRONT-END AUTOMATION VIA MODULAR MULTIMODAL AGENTS

</aside>

Conclusion

Reference