Large language models (LLMs) have shown strong language generation performance across diverse domains. LLMs have achieved passing grades on examinations in the style of the US legal bar examination 1 ...