publications
2025
-
PairBench: Are Vision-Language Models Reliable at Comparing What They See?arXiv preprint arXiv:2502.15210 2025
-
EMNLP 2025)WebMMU: A benchmark for multimodal multilingual website understanding and code generationarXiv preprint arXiv:2508.16763 2025
-
NeurIPS 2025)Alignvlm: Bridging vision and language latent spaces for multimodal understandingarXiv preprint arXiv:2502.01341 2025
-
NeurIPS 2025)Rendering-Aware Reinforcement Learning for Vector Graphics GenerationarXiv preprint arXiv:2505.20793 2025
2024
-
arXivGPS-SSL: Guided Positive Sampling to Inject Prior Into Self-Supervised LearningarXiv preprint arXiv:2401.01990 2024
-
arXivFairLoRA: Unpacking Bias Mitigation in Vision Models with Fairness-Driven Low-Rank AdaptationarXiv preprint arXiv:2410.17358 2024
2022
-
DataPerf WS
ICML 2022
2020
-
EMNLP 2020