Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
The advancements in large language models have significantly accelerated the development of…
LoReFT: Representation Finetuning for Language Models
Parameter-efficient fine-tuning or PeFT methods seek to adapt large language models via…
POKELLMON: A Human-Parity Agent for Pokemon Battles with LLMs
Large Language Models and Generative AI have demonstrated unprecedented success on a…
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
The advent of GPT models, along with other autoregressive or AR large…
InstructIR: High-Quality Image Restoration Following Human Instructions
An image can convey a great deal, yet it may also be…