1.
Olesia Khrapunova. Unified Benchmark for Evaluating Performance, Bias, and Consistency in LLM Binary Question Answering. IJC [Internet]. 2025 Dec. 27 [cited 2026 Jan. 9];56(1):319-38. Available from: https://ijcjournal.org/InternationalJournalOfComputer/article/view/2470