2025 ICLR 2025 BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Rodriguez, Juan A, Jian, Xiangru, Panigrahi, Siba Smarak, Zhang, Tianyu, Feizi, Aarash, Puri, Abhay, Suresh, Akshay Kalkunte, Savard, François, and others, In International Conference on Learning Representations 2025 HTML PDF 2024 arXiv GPS-SSL: Guided Positive Sampling to Inject Prior Into Self-Supervised Learning Feizi, Aarash, Balestriero, Randall, Romero-Soriano, Adriana, and Rabbany, Reihaneh arXiv preprint arXiv:2401.01990 2024 PDF 2022 DataPerf WS ICML 2022 Revisiting Hotels-50K and Hotel-ID Feizi, Aarash, Casanova, Arantxa, Romero-Soriano, Adriana, and Rabbany, Reihaneh arXiv preprint arXiv:2207.10200 2022 PDF 2020 EMNLP 2020 Structure Aware Negative Sampling in Knowledge Graphs Ahrabian, Kian, Feizi, Aarash, Salehi, Yasmin, Hamilton, William L., and Bose, Avishek Joey arXiv preprint arXiv:2009.11355 2020 PDF