Return to Article Details Unified Benchmark for Evaluating Performance, Bias, and Consistency in LLM Binary Question Answering Download Download PDF