(1)
Olesia Khrapunova. Unified Benchmark for Evaluating Performance, Bias, and Consistency in LLM Binary Question Answering. IJC 2025, 56 (1), 319-338.