Olesia Khrapunova. “Unified Benchmark for Evaluating Performance, Bias, and Consistency in LLM Binary Question Answering”. International Journal of Computer (IJC) 56, no. 1 (December 27, 2025): 319–338. Accessed January 9, 2026. https://ijcjournal.org/InternationalJournalOfComputer/article/view/2470.